Naveen Rao, Intel | AWS re:Invent 2019
Naveen Rao, GM of AI at Intel, sits down with Justin Warren & Dave Vellante for AWS re:Invent 2019 at the Sands Expo & Convention Center in Las Vegas, NV. #reInvent #Intel #theCUBE https://siliconangle.com/2020/01/07/intels-focus-ai-maturation-guides-data-center-strategy-reinvent-guestoftheweek/ Intel’s focus on AI maturation guides its data center strategy When it comes to advancing the field of artificial intelligence, the ultimate prize is still clear. The goal is to come as close as possible to the power of the human brain. For researchers at the forefront of AI development, such as Naveen Rao (pictured), vice president and general manager of the artificial intelligence products group at Intel Corp., achieving near parity with a human’s cognitive ability remains a long way off. “Back in 2013, there were 10 million or 20 million parameters, which was very large for a machine-learning model,” Rao said. “Now they’re in the billions. The human brain is 300 trillion to 500 trillion models, so we’re still pretty far away from that; we’ve got a long way to go.” Rao spoke with Dave Vellante, host of theCUBE, SiliconANGLE Media’s mobile livestreaming studio, and guest host Justin Warren, chief analyst at PivotNine Pty Ltd., during Amazon Web Services Inc.’s re:Invent conference in Las Vegas. They discussed the role of Intel’s processor technology in machine learning, intelligence for cloud and edge computing, the impact of recent neural network training tools, AI for good and the future of autonomous cars. (* Disclosure below.) This week, theCUBE features Naveen Rao as its Guest of the Week. Research advances AI While the processing gap remains sizable, Rao and Intel are working on several projects to move AI forward. It is work not only fundamental to the field, but also integral to Intel’s own business strategy and long-term future. Intel’s PC chip business still accounts for approximately half of its total revenue, but the second-largest segment revolves around the data center where AI is having the greatest impact. The company has been adjusting its powerful Xeon central processing unit chips to handle complex machine-learning tasks, most recently adding DL Boost to facilitate neural net performance. As developers and data scientists iterate large data sets to generate a series of outcomes, the inference of how results are rolled out and deployed becomes more significant. “Inference is all about the best performance per watt,” Rao explained. “How much processing can I shove into a particular time and power budget? On the training side, it’s much more about what kind of flexibility I have for exploring different types of models and training them very fast.” Acquisitions bolster portfolio One indicator of how seriously Intel is taking its role as a provider of AI processing in the data center can be found in its acquisition of Israel-based Habana Labs Ltd. for $2 billion in December. Habana Labs’ Goya AI Inference Processor is currently used by Facebook Inc. for its own machine-learning compiler. Intel’s acquisition of Habana Labs followed other moves the company has made in the AI space since 2016 when it purchased Nervana Inc., where Rao served as co-founder and chief executive officer. In 2018, Intel bought Vertex.Ai and its platform-agnostic AI model technology and last year open-sourced its deep neural network framework, nGraph. In November, Intel introduced its Nervana Neural Network Processors for training and inference, designed to accelerate AI system deployment from cloud to edge. “From its very inception, the machine was really meant to be something that recapitulated intelligence,” Rao said. “Everything we do is impacted by AI and will be in service of building better AI platforms for intelligence at the edge, intelligence in the cloud, and everything in between.” Training neural networks Building better AI platforms will require training deep neural networks to run more powerfully in data centers today. But for these networks to become truly effective, they must be able to generalize to understand a range of possibilities while improving overall intelligence. Welcome to the world of the 16-bit brain floating point and generative adversarial networks, or GANs. Intel will soon begin leveraging the floating point, or “bfloat16,” instruction for its Cooper Lake Xeon processors and Nervana-based training models. AI researchers have found that bfloat16 worked well across workloads and can be used for vision, speech and language applications, which explains why Intel is moving boldly down that path. The instruction is also useful in helping move key learning networks forward, such as GANs. Using generative adversarial networks has been called “the most interesting idea in the last 10 years in machine learning” by Yann LeCun, director of AI research at Facebook. ...