Software Engineer, Machine Learning Platform
Tasks
- Build and maintain infrastructure as code
- Build and operate model training systems
- Design scalable ML infrastructure
- Develop data ingestion and streaming systems
- Develop distributed training and batch processing systems
- Enhance observability and reliability
- Improve ML CI CD workflows
- Optimize ML workload cost visibility
- Participate in on-call rotations
- Support feature store and feature pipelines
- Support real-time inference systems
Perks/Benefits
- 401k match
- Child elder pet care backup
- Commuter benefit
- Disability insurance
- Life insurance
- Medical, dental & vision coverage
- Paid parental leave
- Paid time off
- Wellness stipend
Skills/Tech-stack
AWS | Amazon Kinesis | Apache Flink | Apache Kafka | Apache Spark | CI/CD | CUDA | CloudFormation | Code review | Data Preprocessing | Docker | Feature Store | GPU Programming | Go | Infrastructure as Code | Java | Kubernetes | Model Deployment | Model Evaluation | Model Monitoring | Model Training | Observability | Python | Ray | Scala | Spark Streaming | Terraform | Testing | Version control | “as-code”
Education
N/A
Regions
Countries
States
Related jobs
-
Senior Applied AI Engineer USD 160K-210KAPI Design | AWS | CI/CD | Circuit Breakers | DockerDynamic work environment | Flexible working hoursSenior-level Full TimeUS - Remote, Canada - Remote R13h ago
-
Senior Data Scientist, Machine Learning USD 194K-218KAWS | Active Learning | Airflow | Amazon Redshift | Automated Labeling100% TelecommutingSenior-level Full TimeRedwood City, CA R1d ago
-
Principal Data Engineer USD 141K-166KAWS | Agile | Amazon Web Services | CI/CD | ConfluenceRemote work within United StatesSenior-level Full Time245 Summer St, Boston MA, United … R1d ago
-
Senior-level Full TimeUS - VA - Remote, United … R1d ago
-
Edge AI Engineer USD 100KBenchmarking | C++ | Core ML | DSPs | Edge AIFull-time W2 employment | H1B transfer support | Remote work | Technical coding assessmentSenior-level Full TimeUnited States - Remote R1d ago
-
AI Engineer USD 66K-145KAWS | Azure | CI/CD | Deep learning | DockerHealth benefits | Home-based work | Paid time off | Retirement contributionsMid-level Full TimeUS - VA - Remote, United … R1d ago
-
Senior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Deep learningCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Analytics Engineer, AV Safety Engineering USD 160K-250KAccess Management | CI/CD | Cloud Computing | Containerization | Data PipelinesFlexible work location with commute requirement | Remote workSenior-level Full TimeWork From Home - United States, … R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | Apache Spark | CI/CD | CachingMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KApache Beam | CI/CD | Caching | Code review | CompressionBenefits | Career growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter methods | Benchmarking | DPO | Distributed Training | Efficient AttentionBenefits | RemoteMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapter-Tuning | Benchmarking | DPO | Deep Policy Optimization | Distributed TrainingCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AWS | Ansible | Azure | CAD Integration | CI/CDMid-level Full TimeUnited States - Remote R1d ago
-
LLM Fine-Tuning Engineer USD 100K-150KAdapters | Attention Optimization | DPO | Dataset curation | Distributed TrainingCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Prompt Engineering Architect USD 100K-150KAgentic Workflows | Embeddings | Evaluation Frameworks | Language Models | Large Language ModelsCareer growth | H1B transfer support | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Quantitative Developer (Fintech) USD 100K-150KAudit Reporting | Audit trails | Backtesting | C++ | Cloud ArchitectureCareer growth | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Storage Engineer (NetApp / Pure / Ceph) USD 100K-150KAnsible | CRUSH maps | CSI | Capacity Planning | CephRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
Robotics Software Engineer USD 100K-150KBehavior Trees | C++ | Concurrent programming | Control Systems | DebuggingMid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | Continuous batching | Custom Kernel | Custom kernel development | CutlassCareer growth | H1B transfer support | Remote work | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | Continuous batching | DeepSpeed | Distributed Training | FSDPMid-level Full TimeUnited States - Remote R1d ago
-
Storage Engineer (NetApp / Pure / Ceph) USD 100K-150KAnsible | Backup | CRUSH | CSI | Capacity PlanningHealth benefits | Paid time off | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Engineer, Data USD 101K-144KApache Spark | Automated testing | Azure Data | Azure Data Factory | Azure Data LakeAnnual Incentives | Professional development | Quarterly incentives | Remote work | Retirement benefitsMid-level Full TimeWork at Home - Ohio - … R1d ago
-
Machine Learning Engineer USD 140K-190KApache Flink | Apache Kafka | Apache Spark | Bigtable | CI/CDMid-level Full TimeRemote - USA R1d ago