Amazon has already shifted about 80% of Alexa processing onto Elastic Compute Cloud (EC2) Inf1 instances, which use the new AWS Inferentia chips. Compared to the G4 instances, which used traditional GPUs, the Inf1 instances push throughput up by 30% and costs down by 45%. Amazon reckons that they're the...
https://www.techspot.com/news/87603-amazon-ditching-nvidia-gpus-favor-their-own-silicon.html?utm_source=dlvr.it&utm_medium=blogger
https://www.techspot.com/news/87603-amazon-ditching-nvidia-gpus-favor-their-own-silicon.html?utm_source=dlvr.it&utm_medium=blogger