Software Engineer, Systems ML - Model Optimization (PhD)

Seattle, WA | Burlingame, CA | New York, NY

Apply now Apply later

Meta is looking for software engineers to play a pivotal role designed to further enhance and elevate our AI inference infrastructure. As a member of our team, you will play a significant role in improving the latency and power consumption of our AI models and in building user facing APIs for our ML engineers. Your expertise will enable us to reach new heights in enabling efficient model inference. The position requires a combination of expertise in machine learning and software engineering.Software Engineer, Systems ML - Model Optimization (PhD) Responsibilities
  • Fine tune, quantize and deploy ML models on-device across phones, AR and VR devices.
  • Optimize models for latency and power consumption.
  • Enable efficient inference on GPUs.
  • Build tooling to develop and deploy efficient models for inference.
  • Partner with teams across meta reality labs to optimize key inference workloads.
Minimum Qualifications
  • Currently has or is in the process of obtaining a PhD in the field of Computer Science, Computer Engineering or equivalent. Degree must be completed prior to joining Meta.
  • Specialized experience in the following machine learning/deep learning domains: model quantization, compression, on-device inference, GPU inference, PyTorch.
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta.
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment.
Preferred Qualifications
  • Proven record of training, fine tuning, and optimizing models.
  • 3+ years of experience on accelerating deep learning models for on-device inference.
  • Optimizing machine learning model inference on NVIDIA GPUs.
  • Familiarity with on-device inference platforms (ARM, Qualcomm DSP).
  • Experience with CUDA/Triton.
For those who live in or expect to work from California if hired for this position, please click here for additional information. LocationsAbout Meta Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics. Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com. $56.25/hour to $173,000/year + bonus + equity + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Apply now Apply later
  • Share this job via
  • 𝕏
  • or
Job stats:  1  0  0

Tags: APIs Computer Science CUDA Deep Learning Engineering GPU Machine Learning ML models Model inference PhD Physics PyTorch VR

Perks/benefits: Career development Equity / stock options Health care Salary bonus

Region: North America
Country: United States

More jobs like this