... In the Cloud

In the early age of ARM server hype, the word micro server was used a lot. Than that word was associated with "wimpy cores" and marketing people avoided it at almost any cost. But the word might make a comeback as developers are starting to write more and more micro services, a way of breaking down complex software in small components that perform a distinct task.

One of the cool things micro services make possible is to make software scale horizontally very quickly when necessary, but run on instances/virtual machines/servers with modest resources when that is not the case. And this helps a lot to keep costs down and performance high when you are running on top of a public cloud. In other words, public clouds are encouraging this kind of development.

Granted, at the moment, micro services mostly run inside virtual machines on top of the brawnier Xeon E5s. But it is not too far fetched to say that some of the I/O intensive micro services could find a home in a cheaper and "right sized" physical server. All these micro services need low latency network connections as one of the disadvantages of micro services is that you get software components talking to each other over the network instead of exchanging data and messages in RAM.

And of course, webfarms already moved to this kind of architecture, way before before the rise of micro services. Caching servers, static and dynamic webservers, databases are all running on separate machines. The distributed architecture of these webfarms craves fast and low latency networking.

The Silver Lining

Remember our coverage of the first ARM based server, the Calxeda based server? The single thread performance was pretty bad, the pricing was too high and the 32 bit A9 cores limited each server node to a paltry 4 GB. But the low power, high performance network fabric and the server node technology delivered a pretty amazing performance/watt ratio and very low network latency. Calxeda's fabric came too early as the ARM SoCs were not simply not good enough at that time. An A15 based ECX-2000 was developed as stop gap measure, but Calxeda run out of money. But that was not the end of the story.

Yes, Silver Lining has bought up the IP of Calxeda. The current offering is still based upon the ECX-2000 (A15 cores). Once they adopt the Opteron A1100, the "Calxeda Fabric" is finally freed from the old 32 bit ARM shackles.

We don't have to wait for a "fully clustered server". Silver Lining also has an FIA-2100 fabric switch available, a PCIe card. Basically you can now have a Calxeda cluster, but then at rack level.

You buy one top of rack (ToR) switch (the light blue bar above) and 12 FIA fabric switches to make a cluster of 12 servers. You connect only one out of three servers to the Tor switch and you interconnect the other servers via the FIA-2100 NICs. The (programmable) intelligence of the FIA-2100 fabric then takes over and gives you a computer cluster with very low latency, redundancy and failover at much lower costs than the traditional networking, just like the good old Calxeda based cluster. At least, that is what Silver Lining claims, but we give them the benefit of the doubt. It is definitely an elegant way to solve the networking problems.

The FIA-2100 NICs is supported on the new A1100 platform. However, it is not all good news for AMD and ARM. This technology used to be limited to just ARM SoCs, but now the Calxeda fabric is PCIe technology, so it will also work with all Intel x86 servers. There is good chance that the first "Calxeda fabric based cluster in a rack" will be powered by Xeon Ds.

We might assume though that the "non-rack" or "inside one server" product of Silver Lining will be most likely A1100 based as their current product is also ARM based.

So there is a chance that the AMD A1100 will find a home in its own "MoonShot alike chassis". A Silver Lining in the dark clouds of delays.

Conclusion

So how do we feel about the A1100? It is late, very late. The expected performance and power consumption are most likely not competitive with what Intel has available, let alone what Intel will launch in a few months. But at last, AMD has managed to launch a 64 bit ARM server SoC which has the support of all major Linux distributions and which can benefit from all progress that the Linux community makes, instead of relying on a special adapted distribution.

The most important things like ACPI and support for PCI Express seems to be working. AMD has paid a high "time to market" price for being an 64 bit ARM server pioneer. The A1100 time schedule suffered from the teething problems of the ARM server ecosystem. Still, the A1100 might be a good way to finally kickstart the ARM server market. Thanks to the Linaro "96boards enterprise edition", a 300-400$ SoC + board should be available soon and make it much cheaper to build software for the 64 bit ARM ecosystem. Thanks to Silver Lining, complete clusters of A1100 servers might get the attention of the cloud providers.

This may pay off in the near future, on the condition that the K12 core is delivered in a timely manner (2017). Because at the end of the day there are no excuses left for AMD or for ARM. If ARM servers are to be successful they will finally have to deliver instead of promising dreamt up server marketshare percentages on a progressively further in the future date.

Why It Took So Long
Comments Locked

37 Comments

View All Comments

  • Lonyo - Thursday, January 14, 2016 - link

    In terms of manufacturing and performance, since Samsung recently inked to fab some Qualcomm systems, and GloFo/Samsung have a partnership, is there not potential for a somewhat rapid improvement in process to use an advanced, up and running and potentially available capacity 16nm process from Samsung within a short period of time? (<18 mo) to bring them closer to the new Atoms that are due.
  • Kevin G - Thursday, January 14, 2016 - link

    In 18 months we should be looking at the arrival of AMD's K12 chip if all goes according to plan.
  • nandnandnand - Thursday, January 14, 2016 - link

    K12 is expected to be on a 14/16nm process, right?
  • medi03 - Friday, January 15, 2016 - link

    Most likely 14nm (Samsung's process, with GloFo on board)
  • Jtaylor1986 - Thursday, January 14, 2016 - link

    Why did you ever both to write this article? I hope 2 pages of bashing AMD got whatever grievance you had out of your system. I will look forward to more professional articles in the future.
  • Lonyo - Thursday, January 14, 2016 - link

    I could 1 page of bashing AMD for being late and non-competitive in raw number terms. Then 2 pages of explaining potential use cases.
  • Lonyo - Thursday, January 14, 2016 - link

    *Count.
  • Jtaylor1986 - Thursday, January 14, 2016 - link

    Begs the question why cover it at all if this is a totally uncompetitive product that is effectively dead on arrival. Why spend resources covering a product nobody will ever buy when there are numerous products lots of people buy they haven't covered.
  • eanazag - Thursday, January 14, 2016 - link

    Cover it because it is finally done. Plus 14 SATA 3 connectors and dual 10 GigE is nothing to scoff at. Add to that a good size of DDR4 ECC RAM and we have something to think about. I suggest you price shop dual 10 GigE for less than $400 and see what you get. I think those items make it worth considering despite the proc being uncompetitive on several fronts. Don't be confused; this is a niche system. This is not a jack of all trades setup.
  • beginner99 - Friday, January 15, 2016 - link

    Still even for real use (business) or just for myself for fiddling around I would go with the Xeon-D. It's just so much more powerful. Yes it costs more but not much. Even if its $500 vs.$ 1000. Software you run on that will cost 10x times of that amount.

Log in

Don't have an account? Sign up now