How Roblox Reduces Spark Join Query Costs With Machine Learning Optimized Bloom Filters

Abstract Every day on Roblox, 65.5 million users engage with millions of experiences, totaling 14.0 billion hours quarterly. This interaction generates a petabyte-scale data lake, which is enriched for analytics and machine learning (ML) purposes. It’s resource-intensive to join fact and dimension tables in our data lake, so to optimize this and reduce data shuffling, […]

Continue Reading

How Roblox Avatar Tech Is Evolving

We currently support two different tech stacks for avatars: A legacy tech stack (R6) that supports older avatars and experiences; and a newer tech stack (R15) that supports all avatar styles and capabilities. To ensure that any avatar style will work in any experience and everyone can access the latest features, we’re working to unify […]

Continue Reading