Name: Wesley Kerr | Spark Summit 2017
Uploaded: 2017-06-06T18:06:00.000Z
Duration: 10 min 46 s

Wesley Kerr | Spark Summit 2017

Wesley Kerr from RIOT games joins CUBE hosts David Goad & George Gilbert are live from Spark Summit 2017 at the Moscone West in San Francisco CA #SparkSummit #theCUBE https://siliconangle.com/2017/07/20/game-data-science-analytics-architecture-behind-riot-games-sparksummit/ A game of data science: the analytics architecture behind Riot Games With the help of modern analytics, Riot Games Inc. developed a highly successful computer game called League of Legends, in which players form teams of champions and compete with other players around the world. Wesley Kerr (pictured), senior data scientist at Riot Games, explained how his organization is leveraging data science to improve player experience and weed out unsavory behavior. “[In] about 2 percent of our games there is some form of serious abuse that comes in the form of hate speech, racism and sexism, things that have no place in the game.” Kerr said. “Right now it’s purely based on things said in chat, but we’re investigating other ways of measuring that behavior.” Kerr gave a keynote speech at this year’s Spark Summit in San Francisco, California, and afterwards spoke with David Goad (@davidgoad) and George Gilbert (@ggilbert41), co-hosts of theCUBE, SiliconANGLE Media’s mobile live streaming studio, to dive into more detail about Riot Game’s data science stack. (* Disclosure below.) A DataBricks power player experience engine Kerr described what is under the hood at Riot Game’s data science organization. “We rely on DataBricks for all of our deployments. We do many different clusters and have about 14 different data scientists that work with us. Each one is able to manage their own cluster, spin them up tear them down, find their data and work with it through DataBricks,” Kerr explained. Kerr went on to explain the configuration of the data warehouse itself and how they manage the sheer scale of data being processed. “We’re able to leverage the power of our players; we have 100 million. … All the data flows into a hive data warehouse stored in S3. We have two different ways of interacting with it. We can run queries against Hive, which tends to be a little slower for our use cases. Our data scientists tend to access to all of that data through DataBricks and Spark, which runs much quicker for our use cases.” Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of Spark Summit 2017. (* Disclosure: DataBricks Inc. sponsored this Spark Summit 2017 segment on SiliconANGLE Media’s theCUBE. Neither DataBricks nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Share this session