ML Infrastructure Engineer
Tasks
- Build ML training and data generation pipelines
- Create CI/CD workflows for data pipelines
- Design training data versioning system
- Develop tooling for launching monitoring debugging training jobs
- Implement deployment rollback and monitoring
- Optimize real time model inference services
- Profile latency and throughput
- Refactor ML training scripts
Perks/Benefits
- N/A
Skills/Tech-stack
AWS | AWS CDK | Airflow | Artifact management | Batching | CI/CD | Containers | DVC | Data Versioning | Dataset Lineage | Docker | Inference Optimization | Infrastructure as Code | LakeFS | Latency optimization | Metaflow | Model Distillation | Model Serving | ONNX Runtime | Prefect | Profiling | PyTorch | Python | Quantization | TensorRT | Terraform | Throughput | Weights and Biases | “as-code”
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Regions
Countries
States
Related jobs
-
Software Engineer, Search AI Infra Performance USD 174K-252KData Processing | Debugging | Distributed Systems | Generative AI | Language ModelsMid-level Full TimeMountain View, CA, USA2h ago
-
Staff Software Engineer, YouTube Data Science USD 207K-300KBig Data | Data Structures | Data Structures and Algorithms | Data analytics | Distributed ComputingSenior-level Full TimeSan Bruno, CA, USA2h ago
-
Software Engineer III, BigLake OSS USD 147K-211KApache Arrow | Apache Iceberg | Apache Spark | C++ | Data StorageSenior-level Full TimeSeattle, WA, USA2h ago
-
Senior Staff AI Data Infrastructure Engineer USD 203K-344KApache Iceberg | Apache Spark | C++ | Concurrent programming | Data LakehouseSenior-level Full TimeSanta Clara, CA15h ago
-
Software Engineer - GPU Inference USD 165K-330KAPI | Async Scheduling | CLI | CUDA | Distributed Systems401k | Fertility and family building stipend | Flexible PTO | Medical/Dental/Vision insurance | Paid parental leaveSenior-level Full TimeSan Francisco16h ago
-
Cloud Machine Learning Engineer - US remote USD 150K-200KAWS CloudWatch | Accelerate | Amazon EC2 | Amazon S3 | Amazon SageMakerConference reimbursement | Flexible paid time off | Flexible working hours | Health, dental, and vision benefits | Parental leaveMid-level Full TimeUnited States - Remote R1d ago
-
Palantir Senior Data Engineer USD 135K-200KData Management | Data Processing | Data integration | Feature Engineering | Generative AISenior-level Full TimeAtlanta, Georgia, United States1d ago
-
Mid-level Full TimeMalvern, Pennsylvania, United States1d ago
-
Applied Research - Evals & Data USD 150K-300KAccelerate | Data Pipelines | Data Versioning | Distributed Systems | Distributed tracingConference attendance | Professional development budget | Relocation support | Remote work | Team offsitesSenior-level Full TimeSan Francisco1d ago
-
Staff Data Engineer USD 187K-245KAPI Gateway | Alerting | Amazon Redshift | Apache Airflow | BigQueryEquity | Flexible paid time off | Health insurance 100% paid premium | Lifestyle stipend | Parental leaveSenior-level Full TimeRemote, US R1d ago
-
Training: ML Framework Engineer USD 205K-445KDistributed Systems | Machine Learning | Performance optimization | Profiling | PythonHybrid work model | Relocation assistanceMid-level Full TimeSan Francisco1d ago
-
Staff AI engineer USD 125K-170KAI Evaluations | AWS | Agent Orchestration | Caching | Data PipelinesFlexible working hours | Hybrid work culture | Unlimited time offSenior-level Full TimeSan Francisco1d ago
-
Machine Learning Engineer: Perception and Planning USD 184K-275KAutomated testing | Behavior Prediction | C++ | Classification | Code reviewSenior-level Full TimeOakland, CA1d ago
-
Robotics System Engineer USD 110K-275KAutonomous Systems | C++ | Data Analysis | Metrics pipelines | PythonSenior-level Full TimeOakland, CA1d ago
-
ML Infrastructure Engineer USD 160K-230KAmazon SageMaker | Apache Airflow | Apache Spark | Argo Workflows | Cloud platformEntry-level Full TimeOakland, CA1d ago
-
Embedded Software Engineer II USD 129K-193KBootloader | C# | C++ | CI/CD | DebuggingDental insurance | Disability insurance | FSA | HSA | Health insuranceSenior-level Full TimeWestminster, CO1d ago
-
Sr. Back-End Software Engineer - Machine Learning USD 190K-250KC++ | Computer Vision | Distributed Systems | Language Processing | Linux401k matching | Commuter benefits | Dependent Family Medical Premium Coverage | Employee Medical Premium Coverage | Employee referral programSenior-level Full TimeSanta Clara, CA1d ago
-
Space Operations Engineer (Embedded Software) USD 100K-160KAPI Integration | ARM | C plus plus | C# | Command and controlMid-level Full TimeSan Francisco, CA1d ago
-
Director, Perception USD 253K-318K3D Modeling | CUDA | Computer Vision | Deep learning | GPU ComputingExecutive-level Full TimeFoster City, CA1d ago
-
Senior Quantum Embedded Engineer USD 150K-190K10 Gigabit Ethernet | AMD Xilinx | AMD Xilinx RFSoC | AMD Xilinx Zynq | BashHybrid work | Remote workSenior-level Full TimeNew Haven, CT1d ago
-
Senior Quantum Applications Engineer - QEC USD 120K-258KCUDA-Q | Decoder algorithms | Docker | End to End | End-to-End TestingSenior-level Full TimeNew Haven, CT1d ago
-
Quantum Engineer (Physicist) USD 136K-193KCircuit-QED | Control Systems | Cryogenics | Error correction | Low temperature physicsFast-paced environment | Interdisciplinary team | State-of-the-art facilitiesMid-level Full TimeNew Haven, CT1d ago
-
Associate Quantum Engineer USD 136K-193KCryogenic Systems | Data Analysis | Data acquisition | High vacuum systems | High-vacuumInterdisciplinary team collaboration | Mentorship | State-of-the-art facilitiesMid-level Full TimeNew Haven, CT1d ago
-
Scientific Data Engineer USD 120K-160KData Transformation | Data Warehousing | Data cleaning | ETL | Excel401k matching | Dental insurance | Health insurance | Paid time offMid-level Full TimeSan Francisco, New York1d ago
-
Platform Engineer - Generative AI USD 120K-160KAPI Development | Caching | Database Design | Fine Tuning | Flask401k employer contribution | Dental insurance | Health insuranceMid-level Full TimeNew York, San Francisco, Munich or …1d ago