Software Engineer - XR Codec Interactions and Avatars Team
Pittsburgh, PA | Redmond, WA
Meta
Giving people the power to build community and bring the world closer together
XR Codec Interactions and Avatars (XRCIA) brings together a highly interdisciplinary team of researchers and engineers to create the future of augmented and virtual reality. On the Research Oriented Cluster Foundations team, you’ll work on building and maintaining tools, libraries, and frameworks that will help researchers collaborate with each other and empower their research towards the generation of Codec Interactions and Avatars. Our team cultivates an honest and considerate environment where self-motivated individuals thrive. We encourage ownership and embrace the ambiguity that comes with working on the frontiers of research.
In this software engineer role, you will serve as the point of contact for Meta's research GPU super clusters. You are a hybrid software/systems/infrastructure engineer who ensures that Meta’s Research Super Clusters run smoothly and have the capacity for future growth.
Our team is composed of people with varied levels of experience and backgrounds. Relevant industry experience is important (Software Engineer, Site Reliability Engineer (SRE), Systems Engineer, DevOps Engineer, Network Engineer, or similar role), but ultimately less so than your demonstrated attitude. We sail into uncharted waters every day at Meta’s large scale ML model training GPU clusters, and we are always learning.Software Engineer - XR Codec Interactions and Avatars Team Responsibilities
$70.67/hour to $208,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
In this software engineer role, you will serve as the point of contact for Meta's research GPU super clusters. You are a hybrid software/systems/infrastructure engineer who ensures that Meta’s Research Super Clusters run smoothly and have the capacity for future growth.
Our team is composed of people with varied levels of experience and backgrounds. Relevant industry experience is important (Software Engineer, Site Reliability Engineer (SRE), Systems Engineer, DevOps Engineer, Network Engineer, or similar role), but ultimately less so than your demonstrated attitude. We sail into uncharted waters every day at Meta’s large scale ML model training GPU clusters, and we are always learning.Software Engineer - XR Codec Interactions and Avatars Team Responsibilities
- Leverage the scale and complexity of the larger Meta infrastructure to accelerate our Codec Interaction and Avatars projects
- Influence outcomes within your immediate team, peer engineering teams, and with cross-functional stakeholders
- Work independently, handle large projects simultaneously, and prioritize team roadmap and deliverables by balancing required effort with resulting impact
- Own Research Super Cluster back-end services which handle fleet management, infrastructure components that drive Meta’s advances in AI, core services which are used by every team at XRCIA, networking systems, and everything in between
- Author and review code, develop documentation and capacity plans, and debug the hardest problems, all live, on some of the largest and most complex systems in the world
- Together with your engineering team, you will share an on-call rotation and be an escalation contact for service incidents. Provide on-call support and lead incident root cause analysis through multiple data engineering layers (compute, storage, network) for GPU clusters and act as a final escalation point
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
- 3+ years of experience in UNIX/LINUX and clear understanding of TCP/IP network fundamentals
- 5+ years of experience coding in at least one of the following languages: C++, Python, or Rust
- Experience with software development practices such as source control, code reviews, unit testing, debugging and profiling
- Experience with Internet service architecture capacity planning and/or handling needs for urgent capacity augmentation
- Knowledge of common web technologies and/or Internet service architectures (such as LAMP or MEAN stacks, CDN, Load Balancing techniques, etc.)
- Experience configuring and running infrastructure level applications, such as Kubernetes, Terraform, MySQL, SLURM, etc.
- Thorough understanding of Linux operating system, including the networking subsystem
- Experience in distributed system performance measurement, logging, and optimization
- Experience with Python library management systems such as Conda
- Prior experience in cluster oncall operations, including troubleshooting server/scheduler/storage errors, maintaining compute/storage environments/libraries/tools, helping onboard users to the cluster, and answering general questions from users
- Prior experience in cluster coordination and strategy planning, including collecting/understanding needs of users, developing tools to improve user experience, providing guidance on best practices, forecasting compute/storage needs, and developing long-term user experience/compute/storage strategies
- Prior experience building tooling for monitoring and telemetry
- Prior experience in developing/managing distributed network file systems
- Prior experience in network security
$70.67/hour to $208,000/year + bonus + equity + benefits
Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Equal Employment Opportunity Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.
Job stats:
1
0
0
Category:
Engineering Jobs
Tags: Architecture Computer Science DevOps Engineering GPU Kubernetes Linux Machine Learning Model training MySQL Physics Python Research Rust Security Terraform Testing VR
Perks/benefits: Career development Equity / stock options Health care Salary bonus
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsData Engineer II jobsStaff Data Scientist jobsPrincipal Data Engineer jobsSr. Data Engineer jobsStaff Machine Learning Engineer jobsPrincipal Software Engineer jobsData Science Manager jobsData Manager jobsData Science Intern jobsSoftware Engineer II jobsDevOps Engineer jobsBusiness Intelligence Analyst jobsJunior Data Analyst jobsData Analyst Intern jobsData Specialist jobsBusiness Data Analyst jobsLead Data Analyst jobsStaff Software Engineer jobsSr. Data Scientist jobsSenior Backend Engineer jobsData Governance Analyst jobsAI/ML Engineer jobsData Engineer III jobsResearch Scientist jobs
Consulting jobsAirflow jobsMLOps jobsOpen Source jobsKPIs jobsKafka jobsJavaScript jobsEconomics jobsLinux jobsTerraform jobsNoSQL jobsData Warehousing jobsComputer Vision jobsGoogle Cloud jobsGitHub jobsRDBMS jobsPostgreSQL jobsR&D jobsScikit-learn jobsStreaming jobsPhysics jobsData warehouse jobsBanking jobsHadoop jobsdbt jobs
Scala jobsLooker jobsPandas jobsOracle jobsBigQuery jobsClassification jobsReact jobsRAG jobsCX jobsScrum jobsDistributed Systems jobsPySpark jobsIndustrial jobsPrompt engineering jobsELT jobsJira jobsMicroservices jobsRedshift jobsGPT jobsRobotics jobsTypeScript jobsOpenAI jobsLangChain jobsSAS jobsJenkins jobs