Senior Software Engineer, Application Observability
San Mateo, CA, United States
Roblox
Roblox is the ultimate virtual universe that lets you create, share experiences with friends, and be anything you can imagine. Join millions of people and discover an infinite variety of immersive experiences created by a global community!Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
As a Senior Software Engineer, Application Observability, you will collaborate with data scientists, product managers, and leaders across the company to ensure app quality by working across multiple teams to build and scale a robust anomaly detection system for Roblox. Your work will lay the essential foundation for app quality, driving a great user experience and supporting Roblox's business growth.
You Will:
- Tailor and consolidate real-time detection and root-cause analysis solutions as needed, covering every stage - from code merge, build, and deploy to rollout, full release, and ongoing monitoring.
- Collaborate with pods and teams to define and standardize key metrics, their definitions, and the necessary dimensions essential to release and app health observability.
- Operationalize and scale metrics monitoring by enabling quick slicing and dicing of data across different types of releases and roll-outs, as well as root-cause analysis.
- Work with Data Scientists and ML Modelers to fine-tune severity thresholds for production incidents based on business impact insights from experiments and causal learning.
- Collaborate with engineers to detect and investigate abnormal trends, while proactively and continuously identifying new opportunities to improve visibility, alerting, tooling, and processes throughout the entire release and app health observability lifecycle.
- Be a technical leader for the team and mentor junior engineers and help recruit future talent for the team
You Have:
- Expertise in back-end software engineering with 6+ years of experience building scalable, distributed systems.
- Experience with monitoring and observability for large, consumer-facing applications with an ideal focus on client-side monitoring.
- Comfort rolling up your sleeves and diving into client and application code when needed.
- Experience with defining the correct charts, alerts and queries to understand the health of large applications and systems.
- Proficiency in the incident response process
- Excitement to collaborate with multiple teams and build long-term solutions for all of Roblox
- An understanding of statistics, and familiarity with deploying and running ML models at large scale a strong plus
For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits.
Annual Salary Range$233,840—$283,780 USDRoles that are based in our San Mateo, CA Headquarters are in-office Tuesday, Wednesday, and Thursday, with optional in-office on Monday and Friday (unless otherwise noted).
You’ll Love:
- Industry-leading compensation package
- Excellent medical, dental, and vision coverage
- A rewarding 401k program
- Flexible vacation policy (varies by exemption status)
- Roflex - Flexible and supportive work policy
- Roblox Admin badge for your avatar
- At Roblox HQ:
- Free catered lunches five times a week and several fully stocked kitchens with unlimited snacks
- Onsite fitness center and fitness program credit
- Annual CalTrain Go Pass
Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations for all candidates during the interview process.
Tags: Distributed Systems Engineering Machine Learning ML models Statistics
Perks/benefits: Career development Equity / stock options Flex hours Flex vacation Health care Unlimited paid time off
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.