AMD Launches Mobile Kaveri APUs
by Jarred Walton on June 4, 2014 12:01 AM ESTAMD Mobile Kaveri SKUs
The previous generation Trinity/Richland APUs came in four variants: at the top were the highest performance 35W TDP standard voltage (SV) parts, and then we dropped into the ultra-low voltage (ULV) range with several variations: 25W, 19W, and 17W TDPs were all available. With Kaveri, AMD is mostly consolidating their lineup into two classes: SV 35W TDP parts, and ULV 19W TDP parts. AMD did mention at one point that the 19W parts can be configured to target different TDPs, however, allowing the laptop OEM to target anywhere from 15W to 25W if desired. There's also at least one 17W TDP part, which we'll get to in a moment.
One other item that AMD emphasized was their new idea of "Compute Cores", which is a way for them to compare performance potential of CPU and GPU cores. GPUs typically have hundreds of cores that are good for very specific tasks whereas CPUs have a few cores that are general purpose, but the GPU functionality is becoming increasingly complex so AMD wanted a way to compare them. For the purposes of Kaveri (and GCN graphics cards), AMD has defined a Compute Core as being "any core capable of running at least one process in its own context and virtual memory space, independently from other cores." What that means is each GCN Compute Unit counts as a Compute Core, and each CPU thread (two per Steamroller module) counts as a Compute Core.
AMD 35W Standard Voltage Mobile APUs
|
||||||
Trinity
|
Richland
|
Kaveri
|
||||
Model | A10-4600M | A8-5557M | A10-5757M | A8-7200P | A10-7400P | FX-7600P |
Core Name | Trinity | Richland | Richland | Kaveri | Kaveri | Kaveri |
Microarch | Piledriver | Piledriver | Piledriver | Steamroller | Steamroller | Steamroller |
Modules/Cores | 2/4 | 2/4 | 2/4 | 2/4 | 2/4 | 2/4 |
CPU Base Freq | 2300 | 2100 | 2500 | 2400 | 2500 | 2700 |
Max Turbo | 3200 | 3100 | 3500 | 3300 | 3400 | 3600 |
TDP | 35W | 35W | 35W | 35W | 35W | 35W |
L1 Cache |
128KB I$ 64 KB D$ |
128KB I$ 64 KB D$ |
128KB I$ 64 KB D$ |
192 KB I$ 64 KB D$ |
192 KB I$ 64 KB D$ |
192 KB I$ 64 KB D$ |
L2 Cache | 2x2MB | 2x2MB | 2x2MB | 2x2MB | 2x2MB | 2x2MB |
Graphics | HD 7660G | HD 8550G | HD 8650G | R5 | R6 | R7 |
GPU Cores | 384 | 256 | 384 | 256 | 384 | 512 |
GPU Clock | 685 | 720 | 720 | 626 | 654 | 686 |
Max DDR3 | 1600 | 1600 | 1600 | 1866 | 1866 | 2133 |
Starting at the top with the 35W APUs, these will be the highest performance mobile parts. At launch there will be three 35W APUs: the "entry" A8-7200P, "mainstream" A10-7400P, and "enthusiast" FX-7600P. All three APUs use the P suffix to indicate that they’re 35W parts. And right away, we see some interesting changes from the Trinity/Richland lineup.
First, you're sure to notice the use of the FX branding. Make no mistake: this is the same APU as the other Kaveri parts and it has no relation to the desktop FX processors; AMD marketing simply feels the FX brand has a good reputation among enthusiasts and consumers and they wanted to carry that over into the mobile world. Of course this also solves the question of what to call the new highest-end APU; Llano had A4/A6/A8 while Trinity used A6/A8/A10; Kaveri will use A6/A8/A10/FX (so no A12 for now; sorry).
Looking at the core clocks, the base clocks haven't changed too much (2.4-2.7GHz with Kaveri compared to 2.1-2.5GHz on Richland), but Turbo Core clocks have gone up quite a bit. AMD noted that a lot of work went into tuning the mobile Kaveri APUs for power, with one of only four Corporate Fellows at AMD being in charge of that area. The result is that Kaveri should run closer to its Turbo clock in many situations, and the maximum clock speeds have increased from 3.0-3.5GHz on Trinity/Richland to 3.3-3.6GHz on Kaveri. That might not seem like a big deal at first, but keep in mind a couple of points. First, at the same clock speed the Steamroller cores in Kaveri should be about 15-20% faster than the Piledriver cores in Trinity/Richland, thanks to architectural improvements. Second, on the desktop Richland actually topped out at 4.1/4.4GHz while Kaveri only reaches 3.7/4.0GHz, so while Kaveri still enjoys architectural improvements it had to be clocked slower; we don’t see that with the mobile parts.
AMD's Joe Macri (Corp VP and Product CTO Global Business Unit) discussed some of the design decisions that went into Kaveri, noting that choosing the right type of transistor for an APU is different than building a pure CPU. At a high level, there are “V” shaped transistors frequently used with CPUs that can run at higher clock speeds, and “T” shaped transistors that tend to work better in the highly parallel design of GPU cores. From what I could gather, AMD used "V" transistors in Trinity/Richland but has switched to "T" transistors for Kaveri, which explains the drop in maximum clock speed. Joe also noted that 47% of the Kaveri core is dedicated to GPU, again highlighting the importance of the transistor choice.
Moving over to the GPU side of things, each of the 35W APUs comes with a different GPU configuration, using 256, 384, or 512 stream processors for the A8/A10/FX parts, respectively. GPU clock speeds top out at 686MHz with Kaveri compared to 720MHz with Richland, so this is another small step back but GCN’s architectural efficiency and increase in SPUs in the FX-7600P should more than compensate. That said, feeding these increasingly powerful GPUs becomes an increasingly difficult task, which is why maximum memory clock speeds are up to 1866MHz for the A8 and A10, and meanwhile the FX processor goes one further to 2133MHz. Given just how hard it is to feed a fully enabled APU like the FX-7600P – a problem we’ve already seen on the similarly configured desktop SKUs – the memory bandwidth increase is a welcome sight.
AMD 17W Ultra-Low Voltage Mobile APUs
|
||||
Trinity
|
Richland
|
Kaveri
|
||
Model | A4-4455M | A4-4145M | A6-5345M | A6-7000 |
Core Name | Trinity | Richland | Richland | Kaveri |
Microarch | Piledriver | Piledriver | Piledriver | Steamroller |
Modules/Cores | 1/2 | 1/2 | 1/2 | 1/2 |
CPU Base Freq | 2100 | 2000 | 2200 | 2200 |
Max Turbo | 2600 | 2600 | 2800 | 3000 |
TDP | 17W | 17W | 17W | 17W |
L1 Cache |
128KB I$ 64 KB D$ |
128KB I$ 64 KB D$ |
128 KB I$ 64 KB D$ |
192 KB I$ 64 KB D$ |
L2 Cache | 2MB | 1MB | 1MB | 1MB |
Graphics | HD 7500G | HD 8130G | HD 8410G | R4 |
GPU Cores | 256 | 128 | 192 | 192 |
GPU Clock | 424 | 554 | 600 | 553 |
Max DDR3 | 1333 | 1333 | 1333 | 1600 |
AMD 19W Ultra-Low Voltage Mobile APUs
|
|||||
Trinity
|
Richland
|
Kaveri
|
|||
Model | A8-4555M | A8-5545M | A8-7100 | A10-7300 | FX-7500 |
Core Name | Trinity | Richland | Kaveri | Kaveri | Kaveri |
Microarch | Piledriver | Piledriver | Steamroller | Steamroller | Steamroller |
Modules/Cores | 2/4 | 2/4 | 2/4 | 2/4 | 2/4 |
CPU Base Freq | 1600 | 1700 | 1800 | 1900 | 2100 |
Max Turbo | 2400 | 2700 | 3000 | 3200 | 3300 |
TDP | 19W | 19W | 19W | 19W | 19W |
L1 Cache |
128KB I$ 64 KB D$ |
128KB I$ 64 KB D$ |
192 KB I$ 64 KB D$ |
192 KB I$ 64 KB D$ |
192 KB I$ 64 KB D$ |
L2 Cache | 2x2MB | 2x2MB | 2x2MB | 2x2MB | 2x2MB |
Graphics | HD 7600G | HD 8510G | R5 | R6 | R7 |
GPU Cores | 384 | 384 | 256 | 384 | 384 |
GPU Clock | 424 | 554 | 514 | 533 | 553 |
Max DDR3 | 1333 | 1333 | 1600 | 1600 | 1600 |
Moving on to the 17W/19W parts, there's apparently a single 17W APU, the A6-7000, along with three 19W APUs. (AMD didn't provide details on the A6-7000 at the briefing, but we've since confirmed the above specifications, and there's also a Pro equivalent -- see below.) Richland had two 17W parts and one 19W part, so AMD has sort of flipped roles here. As expected the 17W A6-7000 is a rather lean chip; it has one Steamroller module and 192 GCN steaming processors (for a total of five Compute Cores, if you're counting), with a base/turbo CPU clock speed of 2.2GHz/3.0GHz while the GPU turbo clock stands at 553MHz. Compared to the previous generation Richland processors, the A6-7000 gains all of Kaveri’s architectural improvements along with an additional 200MHz for the maximum CPU turbo clock. GPU clock speeds on the other hand take a hit, but this is offset by GCN’s greater performance and a badly needed increase in the maximum DDR3 memory clock speed. In fact with mobile Kaveri, DDR3-1600 is now the baseline, with all processors supporting 1600 or better.
Meanwhile the 19W parts all have two Steamroller modules, and maximum base/turbo clock speeds only differ by 300MHz; most of the differentiation comes in the GPU department. The A8-7100 includes four GCN CUs, the A10-7300 has six CUs with slightly higher GPU clocks, and the FX-7500 also has six CUs with another moderate bump in clock speed. AMD has also brought along the "R-series" branding for the GPUs, so the A6-7000 gets an R4 GPU, the A8 is an R5, A10 gets an R6, and the FX has an R7 GPU. Maximum GPU clocks are again down in some cases compared with Richland, though architectural difference should more than cover any loss in clock speed.
AMD 17/19W Commercial Pro Series ULV APUs
|
|||
Model | A6 Pro-7050B | A8 Pro-7150B | A10 Pro-7350B |
Core Name | Kaveri | Kaveri | Kaveri |
Microarch | Steamroller | Steamroller | Steamroller |
Modules/Cores | 1/2 | 2/4 | 2/4 |
CPU Base Freq | 2200 | 1900 | 2100 |
Max Turbo | 3000 | 3200 | 3300 |
TDP | 17W | 19W | 19W |
L1 Cache |
192 KB I$ 64 KB D$ |
192 KB I$ 64 KB D$ |
192 KB I$ 64 KB D$ |
L2 Cache | 1MB | 2x2MB | 2x2MB |
Graphics | R4 | R5 | R6 |
GPU Cores | 192 | 384 | 384 |
GPU Clock | 533 | 533 | 553 |
Max DDR3 | 1600 | 1600 | 1600 |
There's one final category of APUs, which appear to be AMD's equivalent of Intel's SIPP (Stable Image Platform Program) CPUs: the AMD Pro Series. There are three Pro APUs, the A10 Pro-7350B, A8 Pro-7150B, and A6 Pro-7050B. These APUs are functionally equivalent to the FX-7500, A10-7300, and A6-7000 respectively. The Pro series targets business customers with a message of commercial stability and management. AMD guarantees that these APUs will remain available for an extended period of time, so enterprise customers won't need to worry about validating new hardware for a couple years.
Besides offering different levels of performance, AMD is also differentiating their Kaveri APUs based on other features. Eyefinity and TrueAudio support will be limited to the A10 and FX APUs; the A6/A8 APUs lose this functionality. Similarly, the A6 does not have Dual Graphics functionality; I didn't have much luck with Dual Graphics on Richland/Trinity laptops, but with the iGPU and dGPU both being GCN architectures now there's at least more potential to extract additional performance through CrossFire. Finally, note that the A6 APUs get ARM TrustZone functionality, whereas the higher-end A8, A10, and FX APUs do not.
All of the above was discussed previously (and then pulled), but we have something new to add with today's launch: a performance preview.
125 Comments
View All Comments
beggerking@yahoo.com - Wednesday, June 4, 2014 - link
icrap and affordable? what a joke! loldon't forget outdated low res screen, no pen input, no touch screen, and still more expensive.
Surface Pro is much better and much more affordable.
Galatian - Wednesday, June 4, 2014 - link
Exactly how is the 13" and 15" rMBP screen outdated and low res?Oh and: not everybody shopping for a notebook needs a touch display nor pen input. I even give you a reason why the 12" Surface 3 will not replace either notebooks or tablets: 750g.
nerd1 - Wednesday, June 4, 2014 - link
1st gen iPad 3G weighed 730gr and everyone said it's as light as a feather.Oh, and macbook still looks and weighs exactly same since then.
Galatian - Wednesday, June 4, 2014 - link
Your point? They are light now. I owned the 1st gen iPad and it was way to heavy to comfortably hold in my hands to read. I since switched to the iPad mini for its lightness and small form factor, nothing the Surface can provide me with. So I still tuck along my notebook to write stuff and use the iPad for media consumption. The surface simply doesn't work for me, but your mileage might vary. Just don't make it sound it's the holy grail that truly combines both tablets and notebooks, because it isn't. At least not for all.Morawka - Wednesday, June 4, 2014 - link
well for one it's not a IPS Screen, and 2, it's a low res panelGalatian - Wednesday, June 4, 2014 - link
The iPad mini with retina? You need to check your facts.ShieTar - Wednesday, June 4, 2014 - link
The 35W AMD APU is giving you about 75% of the mobile gaming performance that you get with a 15W Intel CPU + 50W nVidia GPU; but over 200% of the 15W Intel APU performance. That's a very decent result in my opinion. Sure it's slower as a pure CPU, but I don't think it will hit its 35W TDP limit when the GPU part is dormant, either.Another interesting comparison might have been to see how FX-7600P fairs against something like an i7-4558U, i.e. an Intel APU with a ~30W TDP and an Iris GPU.
Roland00Address - Wednesday, June 4, 2014 - link
While I agree with you perfectly on what you said, I want to bring up another point. The gt750m is a kepler part with 384 stream processors (2 SMX). We now have maxwell on the market and maxwell is very good. With maxwell you get 640 stream processors (5 SMM, 1 SMM is roughly 90% of an SMX) in roughly the same tdp that as the gt750m. Thus nvidia+intel is able to get a large increase in number of calculation units in roughly the same form factor.So in sum AMD is able compete with intel and intel+nvidia only on price and/or time to market. Intel and Intel+Nvidia can meet AMD on graphic efficiency, form factors, and Intel is faster in cpu tasks. Sadly I wish AMD was doing better than they are, not because AMD is bad but I want more competition and more competition is always good for the end user. Problem is AMD may bring their B+ game often but when you have Intel and Nvidia as competitors you need to bring your A+ game to win for your competitors are just as talented as you are and they have far more design resources due to greater revenue and cash on hand which means more money spent on engineers.
Krysto - Wednesday, June 4, 2014 - link
Intel cheats with Turbo-Boost, though, being able to win 5-minute benchmarks because of Turbo-Boost, while also claiming long battery life because Turbo-Boost barely gets any use, so the chip is actually running at lower performance than advertised in the long term.Galatian - Wednesday, June 4, 2014 - link
And AMD doesn't turbo? o.O