Riot Games: A New Data Ingest Pipeline for League of Legends using MSK

Riot Games ingests about 20 TB of data every day on AWS.  This data powers a wide range of products including match-making, personalization, analytics, security, anti-cheat and player behavior.  Until recently, this data was only queryable hours after it was produced (in some cases up to 6).  With AWS MSK, the team was able to bring that time down to 5 minutes and unlock use cases that require stream processing.  MSK also allowed us to lower our TCO and deprecate an aging, Map-Reduce based pipeline. In this session, we will talk about the before state, migration path, current state and future state of our data ingestion pipeline.