I came across a nice interview about AMD's upcoming Llano and thought I should share.
AMD Llano - The First Accelerated Processing Unit
Source
AMD has started promoting its new Fusion concept, which, among other things, encompasses central and graphics processor on the same piece of silicon. This approach is significantly different from the existing integration of the competitive company, and we came in contact with Samuel Naffziger, AMD Senior Fellow, who explained how they thought this whole system and which are the important innovations comparing to the current approach.
IHW: So what was the exact name? Llano, if we understand correctly?
Samuel Naffziger: Yes, it's Llano, with double "L". It is the codename for our first APU (accelerated processing unit), which is the description they're using for our first products that integrate CPU and GPU on a single piece of silicon. It's actually four CPU cores on the first APU product. Here we talk about new era of processor performance, and the message here is at the same time complex and very simple. If you look at the history of architectures, specificaly CPU going back to the single core, and how advancements were made to performance, it was very much driven by frequency increase. Over time that has come less and less effective as power became more of a concern. Then we moved into a multi core era which is live and kicking today and is pretty much the central AMD CPU strategy, but even then it is increasingly requiering additional processing performance to handle some of the more popular applications which are more consuming. Even business suites use more and more graphical type of applications in their busineses. That's kind of the very simplistic high level of view of how we landed on direction which we took with Llano and APUs, and how we bring in the more paralel type of processing alongside the CPU to handle also something that is not strictly dedicated to graphics, where programmers can use the paralel architecture for more traditional CPU sequential tasks.
IHW: Ok, so what about the power consumption? As we can see from the promotional slides, power consumption is 2.5 and 25 W, it's a quite difference between minimum and maximum and also it's quite low comparing to CPUs that are in use right now.
SN: For one, the 32nm technology provides power efficiency which helps drive the power consumption down, and this is a mobile focus design, so we enthisise low voltage operation. As you may know these voltages are one of the most efficienly in terms of improving the performance per wat. The other thing is this is the CPU core power consumption, it's not the entire APU. And the range of usage of this APU core is extremely broad, so it's the key feature providing great response time, good consumer experience when the performance is needed, and when it's not needed, it can deliver the good battery life
IHW: For example, right now the HD 5000 series, their best is about 2.2 billions of transistors per chip. It's much much more than 35 milions of transistors in Llano core, so can you share with us exactly what number of transistors will the end core have?
SN: We're not disclosing details of the chip level right now.
IHW: You are aware that Intel already introduced the CPU and the GPU on the same socket, and they managed to put two different manufacturing processes in the same package. Do you care to explain what is the difference in your aproach and their aproach?
SN: What we're doing here is very different than Intel's aproach. They're trying the Larabee path and and essentialy they're validating our approach, but what they have done is just a low-end integrated graphics processor slapped in a multi chip module. That's very different of what we're doing with the Llano APU where we are taking industru leading graphics technology and fully integrating it with the CPU. Our solution shares the same memory subsystem and the integrated bus, which have significant advantages over just a low-end IGP attached in the CPU module.
IHW: When you say industry leading GPU technology is used in Llano, do you mean that you will be using the same Stream technology which exists right now in DirectX 11 discrete accelerators, or you will take some other approach?
SN: Well, it's the DirectX 11 capable GPU, and that's one of the largest differences, but we're not getting into the specifics of its capability versus what Intel's integrating in its CPU, but suffice to say it's fully capable DirectX 11 GPU which is different from what is integrated in Core i3 and Core i5 products
IHW: Let's go back to power management, what is the largest advancement in Llano?
SN: As you can imagine the level of integration the four cores with advanced graphics processor here, all in a mobile form factor, it requires a very advanced power management. One of those types of managing this kind of paralel processing environment is to be able to power gate any unit that's not needed at the particular time. And what we're done here is exploit some of the unique features of our process technology, Silicon on Insulator (SOI), and enabling efficient power gating approach on this x86 cores. So when the workload requires one, two or three cores and not all four, the unused cores burn a neglible amount of power. The goal of this is that we provide the best performance per wat across the broad range of consumer workload.
IHW: When you talk about SOI technology, and at the same time about 32nm technology with immersional litography, what exactly are the benefits when you have this two technologies at the same time on this manufacturing process size?
SN: The aspect of SOI here, that works synergisticly with our power gating, is that there is no source stream junction substrate. So instead of gating the ground, in the bulk technology power has to be gated. But we can just flood the ground and if there's no dyas connected to substrates, there is no leakage and it becomes a lot more efficient system.
IHW: What about the other innovations in Llano?
SN: The second inovation here is that we are enabling very efficient chip-level power management approach by using a digital power-meter integrated in each one of this cores. So in a nutshell, we sampled nearly one hundred signals until we determined core-level power consumption. And it real-time monitors the core operation and execution and we have maintained a running number of power consumption at the chip level which can be used for a variety of power management functions. So we found that this digital aproach not only provides better accuracy, so that provides full repeatability as the subject of environmental variations, that comes along with other aproaches used in industry that involves temperature sensors and AV-meters.
IHW: I don't know if you can discuss this, but when you said that the power management is important to you, for example there is one or more cores that is not in use at the moment. How much power will they consume in this lowest power state?
SN: If they're power gated? Well all we're saying right now is, less than 10% of the leakage power. So if you're familiar with the litography processor design, the leakage is typicaly 20-30% of the total power consumption. Our implementation is that the leakage component by power gating reduced by factor of 10.
IHW: When you are saying that GPU will be integrated in the same architecture with the CPU, will there be some overlapping between GPU and CPU functionality, will they share for example the same memory controller or it will be different, will the clocks be different, e.g. different frequency domains?
SN: Well, some of your questions require more details than we're ready, but we're making it clear that in this synergy the GPU and the CPU share the same memory subsystem and that provides many levels of improvement for both components and they also have a high speed on-die bus for improved communications between CPU and GPU.
IHW: Llano x86 core is basically a 32nm version of Phenom II core, if we heard correct. So is it a straight-forward shrink, or are there some significant changes when we are talking about new core?
SN: It's not a rebranded core. I wouldn't say it's significant architectural overhaul but there are basicaly some feature tweaks that address performance oportunity, so it will be a higher performing version of the legacy core, but it's the same fundamentals.
IHW: When you said about immersion litography, we know that right now there are some problems about yield levels at your TSMC partner on 40nm GPU side. What kind of yield level can we expect from Llano platform?
SN: Well, TSMC is not the manufacturing company for Llano, it's Global Foundries, but obviosly our expectations are very high, I mean that's the critical part of what we're doing here so you are acknoledging the challenges we're bringing, and of the GPU which was previosly made in bulk process by the company, and the SOI process by the different fab, but what we're working now is confident, Global Foundries will deliver it's promise, that's really all I can say at this point.
IHW: When you are saying that it's different type of manufacturing, that it's a bulk process, on the CPU side that is completely different, what will you do about GPU side? Will you still use the bulk technology, or you will somehow merge it with SOI technology?
SN: Well that's not feasible, there's only one way because it's a single die, it's a single chip. Discrete equivalent of the DX11 GPU will continue to be produced in bulk technology. We're not moving the discrete GPUs to SOI. But for Llano, it's a single integrated die so that GPU is being produced on the SOI process.
IHW: Well if you can just clarify something for us, when we are talking about 32nm manufacturing process, the aproach that AMD has and Intel has, can you just give us the highlights of biggest differences between these two aproaches?
SN: Beyond the obvious SOI versus bulk technology, we weel we have additional ground of experience because of our second generation of immersion lithography process, they are different, but that is area of expertise of Global Foundries.
IHW: Can you maybe reach some higher frequencies than your competitors?
SN: The frequencies acheved are very specific design driven, so I thing it's efficient to say we believe we have better processors and more efficient in this Global Foundry technologies than anyone elses technologies.



LinkBack URL
About LinkBacks





Reply With Quote
















Bookmarks