Nvida launched a monster field yesterday called the HGX-2, and it’s the stuff that geek goals are manufactured from. It’s a cloud server that’s presupposed to be so highly effective it combines excessive efficiency computing with synthetic intelligence necessities in a single exceptionally compelling package deal.
You recognize you need to know the specs, so let’s get to it: It begins with 16x NVIDIA Tesla V100 GPUs. That’s good for 2 petaFLOPS for AI with low precision, 250 teraFLOPS
for medium precision and 125 teraFLOPS for these occasions once you want the very best precision. It comes normal with a half a terabyte of reminiscence and 12 Nvidia NVSwitches, which allow GPU to GPU communications at 300 GB per second. They’ve doubled the capability from the HGX-1 launched final yr.
Paresh Kharya, group product advertising supervisor for Nvidia’s Tesla information heart merchandise says this communication pace permits them to deal with the GPUs primarily as a one big, single GPU. “And what that permits [developers] to do is not only entry that huge compute energy, but in addition entry that half a terabyte of GPU reminiscence as a single reminiscence block of their applications,” he defined.
Sadly you gained’t have the ability to purchase one in every of these containers. Actually, Nvidia is distributing them strictly to resellers, who will doubtless package deal these infants up and promote them to hyperscale datacenters and cloud suppliers. The fantastic thing about this strategy for cloud resellers is that once they purchase it, they’ve your entire vary of precision in a single field, Kharya stated
“The advantage of the unified platform is as corporations and cloud suppliers are constructing out their infrastructure, they’ll standardize on a single unified structure that helps your entire vary of excessive efficiency workloads. So whether or not it’s AI, or whether or not it’s excessive efficiency simulations your entire vary of workloads is now doable in only a single platform,”Kharya defined.
He factors out that is significantly vital in massive scale datacenters. “In hyperscale corporations or cloud suppliers, the primary profit that they’re offering is the economies of scale. If they’ll standardize on the fewest doable architectures, they’ll actually maximize the operational effectivity. And what HGX permits them to do is to standardize on that single unified platform,” he added.
As for builders, they’ll write applications that benefit from the underlying applied sciences and program within the precise stage of precision they require from a single field.
The HGX-2 powered servers can be out there later this yr from accomplice resellers together with Lenovo, QCT, Supermicro and Wiwynn.