AI Training Infrastructure Engineer

Sunnyvale, CA

Figure

See how a Home Equity Line of Credit with Figure can help you plan a home renovation project, consolidate high-interest debt, or fund your dream vacation!

View all jobs at Figure

Apply now Apply later

About the job

Figure is an AI Robotics company developing a general purpose humanoid. Our humanoid robot, Figure 02, is designed for commercial tasks and the home. We are based in Sunnyvale, CA and require 5 days/week in-office collaboration. It’s time to build.

Figure’s vision is to deploy autonomous humanoids at a global scale. Our AI team is looking for an experienced Training Infrastructure Engineer, to take our infrastructure to the next level. This role is focused on managing the training cluster, implementing distributed training algorithms, data loaders, and developer tools for AI researchers. The ideal candidate has experience building tools and infrastructure for a large-scale deep learning system.

Responsibilities

  • Design and implement software tools used to train deep neural networks and deploy them on humanoid robots
  • Maintain a reliable and performant training cluster
  • Implement distributed training algorithms
  • Implement developer tools for AI researchers tools for AI researchers

 

Requirements

  • Bachelor's or Master's degree in Computer Science, Robotics, Engineering, or a related field
  • Experience with Python and PyTorch
  • Experience managing HPC clusters for deep neural network training
  • Minimum of 4 years of professional, full-time experience building reliable backend systems

Bonus Qualifications

  • Experience managing cloud infrastructure (AWS, Azure, GCP)
  • Experience with job scheduling / orchestration tools (SLURM, Kubernetes, LSF, etc.)
  • Experience with configuration management tools (Ansible, Terraform, Puppet, Chef, etc.)

The US base salary range for this full-time position is between $140,000 - $220,000 annually.

 

The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.



Apply now Apply later
Job stats:  2  1  0

Tags: Ansible AWS Azure Computer Science Deep Learning Engineering GCP HPC Kubernetes Puppet Python PyTorch Robotics Terraform

Perks/benefits: Career development

Region: North America
Country: United States

More jobs like this