Wesley Kerr from RIOT games joins CUBE hosts David Goad & George Gilbert are live from Spark Summit 2017 at the Moscone West in San Francisco CA
#SparkSummit #theCUBE
https://siliconangle.com/2017/07/20/game-data-science-analytics-architecture-behind-riot-games-sparksummit/
A game of data science: the analytics architecture behind Riot Games
With the help of modern analytics, Riot Games Inc. developed a highly successful computer game called League of Legends, in which players form teams of champions and compete with other players around the world. Wesley Kerr (pictured), senior data scientist at Riot Games, explained how his organization is leveraging data science to improve player experience and weed out unsavory behavior.
“[In] about 2 percent of our games there is some form of serious abuse that comes in the form of hate speech, racism and sexism, things that have no place in the game.” Kerr said. “Right now it’s purely based on things said in chat, but we’re investigating other ways of measuring that behavior.”
Kerr gave a keynote speech at this year’s Spark Summit in San Francisco, California, and afterwards spoke with David Goad (@davidgoad) and George Gilbert (@ggilbert41), co-hosts of theCUBE, SiliconANGLE Media’s mobile live streaming studio, to dive into more detail about Riot Game’s data science stack. (* Disclosure below.)
A DataBricks power player experience engine
Kerr described what is under the hood at Riot Game’s data science organization. “We rely on DataBricks for all of our deployments. We do many different clusters and have about 14 different data scientists that work with us. Each one is able to manage their own cluster, spin them up tear them down, find their data and work with it through DataBricks,” Kerr explained.
Kerr went on to explain the configuration of the data warehouse itself and how they manage the sheer scale of data being processed.
“We’re able to leverage the power of our players; we have 100 million. … All the data flows into a hive data warehouse stored in S3. We have two different ways of interacting with it. We can run queries against Hive, which tends to be a little slower for our use cases. Our data scientists tend to access to all of that data through DataBricks and Spark, which runs much quicker for our use cases.”
Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of Spark Summit 2017. (* Disclosure: DataBricks Inc. sponsored this Spark Summit 2017 segment on SiliconANGLE Media’s theCUBE. Neither DataBricks nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)
Forgot Password
Almost there!
We just sent you a verification email. Please verify your account to gain access to
Spark Summit 2017 | San Francisco. If you don’t think you received an email check your
spam folder.
In order to sign in, enter the email address you used to registered for the event. Once completed, you will receive an email with a verification link. Open this link to automatically sign into the site.
Register For Spark Summit 2017 | San Francisco
Please fill out the information below. You will recieve an email with a verification link confirming your registration. Click the link to automatically sign into the site.
You’re almost there!
We just sent you a verification email. Please click the verification button in the email. Once your email address is verified, you will have full access to all event content for Spark Summit 2017 | San Francisco.
I want my badge and interests to be visible to all attendees.
Checking this box will display your presense on the attendees list, view your profile and allow other attendees to contact you via 1-1 chat. Read the Privacy Policy. At any time, you can choose to disable this preference.
Select your Interests!
add
Upload your photo
Uploading..
OR
Connect via Twitter
Connect via Linkedin
EDIT PASSWORD
Share
Forgot Password
Almost there!
We just sent you a verification email. Please verify your account to gain access to
Spark Summit 2017 | San Francisco. If you don’t think you received an email check your
spam folder.
In order to sign in, enter the email address you used to registered for the event. Once completed, you will receive an email with a verification link. Open this link to automatically sign into the site.
Sign in to gain access to Spark Summit 2017 | San Francisco
Please sign in with LinkedIn to continue to Spark Summit 2017 | San Francisco. Signing in with LinkedIn ensures a professional environment.
Are you sure you want to remove access rights for this user?
Details
Manage Access
email address
Community Invitation
Wesley Kerr | Spark Summit 2017
Wesley Kerr from RIOT games joins CUBE hosts David Goad & George Gilbert are live from Spark Summit 2017 at the Moscone West in San Francisco CA
#SparkSummit #theCUBE
https://siliconangle.com/2017/07/20/game-data-science-analytics-architecture-behind-riot-games-sparksummit/
A game of data science: the analytics architecture behind Riot Games
With the help of modern analytics, Riot Games Inc. developed a highly successful computer game called League of Legends, in which players form teams of champions and compete with other players around the world. Wesley Kerr (pictured), senior data scientist at Riot Games, explained how his organization is leveraging data science to improve player experience and weed out unsavory behavior.
“[In] about 2 percent of our games there is some form of serious abuse that comes in the form of hate speech, racism and sexism, things that have no place in the game.” Kerr said. “Right now it’s purely based on things said in chat, but we’re investigating other ways of measuring that behavior.”
Kerr gave a keynote speech at this year’s Spark Summit in San Francisco, California, and afterwards spoke with David Goad (@davidgoad) and George Gilbert (@ggilbert41), co-hosts of theCUBE, SiliconANGLE Media’s mobile live streaming studio, to dive into more detail about Riot Game’s data science stack. (* Disclosure below.)
A DataBricks power player experience engine
Kerr described what is under the hood at Riot Game’s data science organization. “We rely on DataBricks for all of our deployments. We do many different clusters and have about 14 different data scientists that work with us. Each one is able to manage their own cluster, spin them up tear them down, find their data and work with it through DataBricks,” Kerr explained.
Kerr went on to explain the configuration of the data warehouse itself and how they manage the sheer scale of data being processed.
“We’re able to leverage the power of our players; we have 100 million. … All the data flows into a hive data warehouse stored in S3. We have two different ways of interacting with it. We can run queries against Hive, which tends to be a little slower for our use cases. Our data scientists tend to access to all of that data through DataBricks and Spark, which runs much quicker for our use cases.”
Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of Spark Summit 2017. (* Disclosure: DataBricks Inc. sponsored this Spark Summit 2017 segment on SiliconANGLE Media’s theCUBE. Neither DataBricks nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)