ML Infrastructure Engineer
Tasks
- Architect cloud and on prem ML training and evaluation infrastructure
- Build data management pipelines from ingestion to training and evaluation
- Create processes for research to production model transition balancing cost
- Deliver projects end-to-end
- Deploy and evaluate models on test and production vehicles
- Develop cloud data infrastructure for machine learning
- Implement model deployment monitoring and MLOps
- Optimize model training using improved storage formats and techniques
- Own ML infrastructure for autonomous vehicle model development and deployment
Perks/Benefits
- N/A
Skills/Tech-stack
Apache Airflow | Apache Spark | Argo Workflows | Cloud platform | Data Management | Data Pipelines | Distributed Systems | Distributed Training | Google Cloud | Google Cloud Platform | Infrastructure as Code | Jetson | Kubernetes | MLOps | Machine Learning | Node.js | ONNX | Python | TensorFlow | TensorRT | Terraform | Workflow Orchestration | “as-code”
Education
N/A
Related jobs
-
Automation Testing | CI/CD | CSS | Cypress | Feature DevelopmentMedical, dental & vision coverage | Paid time off | Parental leave | Reimbursement programs | Retirement planMid-levelRaleigh, United States R10d ago
-
Partner Engineering GenAI - US USD 133K-203KAPIs | Artificial Intelligence | C plus plus | Claude | Cloud ComputingSenior-level Full TimeMenlo Park, CA | Seattle, WA …2h ago
-
Machine Learning Performance Modeling Architect USD 173K-249KC# | C++ | Data Visualization | Heterogeneous computing | Image qualitySenior-level Full TimeSunnyvale, CA2h ago
-
Mid-level Full TimeSunnyvale, CA | Burlingame, CA2h ago
-
Robotics Engineer - Logistics and Material Flow USD 170K-240KAGV | Automation | Branching | C++ | Computer ScienceSenior-level Full TimeFremont, CA2h ago
-
Software Developer, Scaled Ops AI Acceleration Team USD 147K-203KAI infrastructure | Data Mining | Fine Tuning | Hack | JavaScriptSenior-level Full TimeSunnyvale, CA | Austin, TX | …2h ago
-
Automated testing | C++ | CSS | Debugging | GraphQLSenior-level Full TimeMenlo Park, CA2h ago
-
Robotics Control Engineer - Manipulation USD 170K-240KABB Rapid | AI Motion Planning | Adaptive Control | C++ | Cause analysisSenior-level Full TimeMenlo Park, CA | Fremont, CA2h ago
-
Robotics Manipulation Engineer USD 170K-240KAdaptive Control | Automation | C++ | Deep learning | GazeboSenior-level Full TimeFremont, CA2h ago
-
Software Engineer - Language (Technical Leadership) USD 213K-293KASR | Benchmarking | C# | C++ | Conversational AISenior-level Full TimeMenlo Park, CA | Seattle, WA …2h ago
-
Code review | Contamination Checking | Data Generation | Data Pipelines | Data ProcessingEntry-level Full TimeMenlo Park, CA2h ago
-
Business Support Engineer USD 136K-197KCall Support | Cloud Computing | Data Analysis | Data Mining | Docker24x7 on-call rotationEntry-level Full TimeMenlo Park, CA2h ago
-
Business Support Engineer USD 159K-223KCloud Computing | Data Analysis | Data Mining | Distributed Systems | Docker24x7 on-call rotation | Cross-functional team collaboration | Global partner supportSenior-level Full TimeMenlo Park, CA2h ago
-
Senior-level Full TimeMenlo Park, CA | New York, …2h ago
-
Research Engineer, Media Data Research - MSL FAIR USD 170K-251KComputer Vision | Data Curation | Data Generation | Data Scaling Laws | Data mixingSenior-level Full TimeMenlo Park, CA2h ago
-
Staff Software Engineer, Torch TPU USD 207K-300KCUDA | Computer Vision | Data Processing | Debugging | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA2h ago
-
C++ | Compilers | Custom Kernels | Data Processing | Data StructuresSenior-level Full TimeMountain View, CA, USA2h ago
-
Technical Solutions Engineer, Cloud AI, Google Cloud USD 150K-218KAI Model Training | AI model | Apache Beam | Apache Hadoop | Apache SparkSenior-level Full TimeSunnyvale, CA, USA; Austin, TX, USA2h ago
-
Artificial Intelligence | Machine Learning | Marketing | Product Development | Product ManagementCoaching | Community access | Relocation support | Startup hiring support | Weekly founder sparringMid-level ContractAustin, United States R6h ago
-
Principal AI Engineer - Core Platform USD 250K-290KAWS | Agents SDK | Anomaly Detection | Automated testing | Classification401k match | Company-provided phone | Health insurance | Hybrid work | PTOSenior-level Full TimeNew York, New York, United States9h ago
-
Engineer, AI Dev Tools USD 90K-120KAPI Integration | Agent architecture | Artificial Intelligence | Containerization | Data Modeling401k | Dental insurance | Health insurance | Hybrid work | Paid HolidaysMid-level Full TimeMinnetonka, MN, US12h ago
-
Engineer, AI Dev Tools USD 90K-120KAPI Integration | Agent architecture | Containerization | Data pipeline | Docker401k | Dental insurance | Health insurance | Paid Holidays | Paid time offMid-level Full TimeFoxboro, MA, US12h ago
-
Senior AI Engineer USD 170K-200KAgent systems | Agentic AI | Anthropic API | Automated evals | Backend architecture401k | Company-provided equipment | Comprehensive medical, dental and vision coverage | Disability insurance | Flexible vacation policySenior-level Full TimeRemote (United States) R12h ago
-
Checkpointing | Cloud Networking | Failure recovery | Golang | Human Feedback401k match | Cell phone stipend | Commuter benefits | Dental insurance | HSA employer contributionsSenior-level Full TimeSan Francisco, CA - US14h ago
-
Principal Engineer, AI Model LifeCycle USD 260K-326KAdapters | Checkpointing | DPO | DeepSpeed | Distributed TrainingCell phone stipend | Commuter benefits | Dental insurance | Health insurance | Mental health wellness supportSenior-level Full TimeSan Francisco, CA - US14h ago