Staff Machine Learning Engineer, Offline Infrastructure
Tasks
- Design and operate large scale data pipelines for training datasets
- Develop infrastructure for distributed training workflows
- Enable large scale experimentation and model iteration
- Enhance observability with monitoring and automated testing
- Improve reproducibility with dataset validation
- Integrate ML pipelines with workflow orchestration systems
- Lead architectural improvements for scalable reliable and cost efficient pipelines
- Optimize performance and resource utilization for distributed compute
Perks/Benefits
- Commute subsidy
- Comprehensive health life and disability insurance
- Employee assistance program
- Employee resource groups
- Employee stock ownership
- Generous vacation and personal days
- Mental health and wellbeing programs
- Office Food Snacks
- Parental leave and family care support
- Relocation support not available
- Retirement pension plans
- Training and development programs
- Volunteering and donation matching
Skills/Tech-stack
Apache Airflow | Apache Flink | Apache Spark | Automated testing | Data Lakes | Data Warehouses | Data pipeline | Data pipeline automation | Distributed Computing | Flyte | Machine Learning | Monitoring | Pipeline Automation | PyTorch | Python | Ray | Ray Data | Ray Train | Streaming Platforms | Workflow Orchestration
Education
N/A
Related jobs
-
AI Software Engineer USD 100K-200KDeep learning | Language Models | Large Language Models | Machine Learning | PyTorchMid-level Full TimeSan Francisco, CA, US / Palo …4h ago
-
Senior Confluent Kafka Lead USD 140K-213KAWS | Access Control | Access Control Lists | Apache Kafka | AvroSenior-level Full TimeColumbus, United States5h ago
-
Software Development Engineer - AI/LLM Network - Global Frontier Tech Research Program - 2027 Start USD 202K-368KC++ | Cause analysis | Fault Localization | High Availability | LinuxEntry-level Full TimeSeattle, Washington, United States5h ago
-
Causal Inference | Cross-modal fusion | Data Modeling | Direct Preference Optimization | Graph Neural NetworksEntry-level Full TimeSeattle, Washington, United States6h ago
-
AI/LLM Network Software Development Engineer USD 202K-368KAI communication | High Performance | High-performance networking | Monitoring | Network ArchitectureMid-level Full TimeSeattle, Washington, United States6h ago
-
Machine Learning Engineer Intern (E-commerce Governance Algorithms) - 2026 Summer (BS/MS) USD 122K-246KAlgorithm Design | Fraud Detection | Machine Learning | Python | Risk AssessmentDevelopment workshops | Hands-on experience | Industry exposure | Social eventsEntry-level InternshipSeattle, Washington, United States6h ago
-
Artificial Intelligence | Big Data | Data Processing | Distributed Systems | High PerformanceEntry-level InternshipSan Jose, California, United States6h ago
-
Staff Backend Engineer, Core Data Service USD 187K-280KAI | Active architecture | Active-active Architecture | Active/Active | Data ConsistencySenior-level Full TimeSan Jose, California, United States6h ago
-
Senior Backend Engineer, Core Data Service USD 187K-280KAI | Active architecture | Active-active Architecture | Active/Active | Anomaly DetectionSenior-level Full TimeSan Jose, California, United States6h ago
-
Software Engineer, Search AI Infra Performance USD 174K-252KData Processing | Debugging | Distributed Systems | Generative AI | Language ModelsMid-level Full TimeMountain View, CA, USA7h ago
-
Senior Data Engineer, YouTube Data Science USD 156K-226KApache Flume | Apache Spark | Automation | Business Intelligence | ComplianceSenior-level Full TimeSan Bruno, CA, USA7h ago
-
Staff Software Engineer, YouTube Data Science USD 207K-300KBig Data | Data Structures | Data Structures and Algorithms | Data analytics | Distributed ComputingSenior-level Full TimeSan Bruno, CA, USA7h ago
-
Software Engineer III, BigLake OSS USD 147K-211KApache Arrow | Apache Iceberg | Apache Spark | C++ | Data StorageSenior-level Full TimeSeattle, WA, USA7h ago
-
Senior Machine Learning Engineer USD 160K-250KCaching | Cloud Platforms | GPU Computing | Kubernetes | LLM InferenceSenior-level Full TimeIsrael, center, IL7h ago
-
ADLS Gen2 | API Gateway | AppDynamics | Autoscaling | AzurePaid time offSenior-level Full TimeAddison, United States18h ago
-
Senior Data Engineer USD 113K-188KApache Spark | Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage401k retirement plan | Adoption Assistance | Employee referral program | Health savings account | Parental leaveSenior-level Full TimeGH Office: San Antonio, TX (9903 …18h ago
-
Principal Associate, Data Scientist - Innovation Hub USD 147K-184KAWS | Apache Spark | Conda | Generative AI | H2OSenior-level Full TimeMcLean, VA, United States18h ago
-
Commercial Analytics - Senior Associate USD 80K-150KBigQuery | Hadoop | Microsoft Excel | Microsoft PowerPoint | Power BIEmployee benefits | Flexible work environment | Home-based option | Incentive eligibilitySenior-level Full Time127 Public Square, Cleveland, OH, United … R18h ago
-
Sr. Data Engineers USD 158K-173KData Mining | Data Modeling | Feature Selection | Language Processing | Machine Learning401k match | Healthcare benefits | Life insurance | Paid disability | Paid time offSenior-level Full TimeCA - Irvine, HQ, United States18h ago
-
Senior Staff AI Data Infrastructure Engineer USD 203K-344KApache Iceberg | Apache Spark | C++ | Concurrent programming | Data LakehouseSenior-level Full TimeSanta Clara, CA20h ago
-
Software Engineer - GPU Inference USD 165K-330KAPI | Async Scheduling | CLI | CUDA | Distributed Systems401k | Fertility and family building stipend | Flexible PTO | Medical/Dental/Vision insurance | Paid parental leaveSenior-level Full TimeSan Francisco21h ago
-
Senior-level Full TimeReston, VA1d ago
-
Forward Deployed AI Engineer USD 90K-140KArtificial Intelligence | Language Models | Large Language Models | Machine Learning | Neural NetworksAccess to cutting-edge tools | Dental insurance | Equity | Health insurance | Professional developmentMid-level Full TimeSan Francisco, New York1d ago
-
Communication Protocols | Computer Vision | Control Systems | Debugging | GRPC401k plan | Dental insurance | Medical insurance | Relocation benefits | Unlimited PTOMid-level Full TimeSan Francisco, CA1d ago
-
Cloud Machine Learning Engineer - US remote USD 150K-200KAWS CloudWatch | Accelerate | Amazon EC2 | Amazon S3 | Amazon SageMakerConference reimbursement | Flexible paid time off | Flexible working hours | Health, dental, and vision benefits | Parental leaveMid-level Full TimeUnited States - Remote R1d ago