Senior Software Engineer, ML Platform Infrastructure
Tasks
- Automate resource provisioning with IaC
- Build ML platform abstraction
- Build data extraction and ETL pipelines
- Design workload orchestration and scheduling
- Implement feature caching and storage
- Integrate distributed training workloads
- Optimize hardware utilization and job wait times
Perks/Benefits
- N/A
Skills/Tech-stack
Apache Beam | Apache Spark | Ceph | Distributed Systems | Feast | Feature Store | Hopsworks | Infrastructure as Code | Kubernetes | Lustre | MLOps | NVMe | Networking | Pulumi | Ray | Redis | Slurm | Storage Systems | Terraform | “as-code”
Education
N/A
Regions
Countries
States
Related jobs
-
Mid-level Full TimeKing George, VA, United States3h ago
-
Data Engineer USD 130K-145KApache Spark | CI/CD | Cloud platform | Containerization | Data GovernancePublic trust clearance support | Remote workSenior-level Full TimeWork from home, VA, United States R3h ago
-
Senior Engineer, Big Data USD 90K-140KAmbari | Apache HBase | Apache Hive | Apache Impala | Apache KafkaSenior-level Full TimeUnited States7h ago
-
Senior Staff Software Engineer, AI/ML, Security USD 262K-365KAdversarial Machine Learning | Artificial Intelligence | Cloud Architecture | Cloud Computing | Data PrivacySenior-level Full TimeKirkland, WA, USA; Seattle, WA, USA7h ago
-
Senior Software Engineer, Google.org, Data Analytics USD 174K-252KArtificial Intelligence | Data Structures | Data Structures and Algorithms | Distributed Systems | Integration TestingSenior-level Full TimeSeattle, WA, USA7h ago
-
Software Engineer III, Storage for Analytics USD 147K-211KC++ | Data analytics | Databases | Distributed Systems | ExperimentationSenior-level Full TimeKirkland, WA, USA7h ago
-
Forward Deployed Engineer II, Applied AI, Cloud USD 127K-183KAPIs | Agent systems | Agentic Workflows | Conversational AI | DebuggingTravel 50% timeSenior-level Full TimeNew York, NY, USA; Atlanta, GA, …7h ago
-
Senior Software Engineer, Google Cloud Storage USD 174K-252KC++ | Data Structures | Data Structures and Algorithms | Distributed Systems | File systemsSenior-level Full TimeSeattle, WA, USA7h ago
-
Staff Engineer, Datacenter Server Lifecycle USD 320K-405KAWS | Asset tracking | Coreboot | Decommissioning | Firmware verificationFlexible working hours | Generous vacation | Hybrid work policy | Optional equity donation matching | Parental leaveSenior-level Full TimeSan Francisco, CA | New York …14h ago
-
Mid-level Full TimeTysons, VA, United States15h ago
-
Mid-level Full TimeRemote, United States R15h ago
-
Senior AI Infrastructure Engineer - Training Platform USD 216K-270KAWS | Admission controllers | C++ | CUDA | Custom ResourcesCommuter stipend | Comprehensive health, dental and vision coverage | Generous PTO | Learning and development stipend | Retirement benefitsSenior-level Full TimeSan Francisco, CA; Seattle, WA; New …15h ago
-
Software Engineer USD 100K-150KAmazon Simple Queue Service | Amazon Web Services | Bedrock | Data Pipelines | Distributed Systems401k retirement plan | Dental insurance | Health insurance | Unlimited vacation | Vision insuranceSenior-level Full TimeLos Angeles, CA, US18h ago
-
Data Engineer / Azure Fabric Engineer Consultant USD 119K-193KAI Search | Access Control | Automated Monitoring | Azure AI | Azure AI SearchSenior-level Full TimeMason, OH, US18h ago
-
Mid-level Full TimeScottsdale, AZ18h ago
-
Principal Engineer, Data & ML Platform USD 120K-180KAPIs | Automated testing | Batch Processing | Cloud platform | Data ModelingSenior-level Full TimeScottsdale, AZ18h ago
-
Principle Data Engineer USD 220K-235KAWS | Airflow | BigQuery | Capacity Planning | Compliance401k | Equity | Essential equipment | Flexible PTO | Fully remoteSenior-level Full TimeCleveland, OH R18h ago
-
Agent Frameworks | Deterministic systems | Distributed Systems | GraphQL | LLMDirect collaboration with executive leadership | High-ownership environment | Hybrid schedule | Relocation assistance | Remote flexibilitySenior-level Full TimeRemote; San Francisco, CA; United States R19h ago
-
Lead Data Engineer USD 122K-207KAgile | Apache NiFi | Apache Spark | ETL | Java401k match | Dental insurance | Disability insurance | Fitness reimbursement or facilities | Health insuranceSenior-level Full TimeO'Fallon, Missouri (Main Campus), United States19h ago
-
Technology Analyst Program - Data Engineer USD 71K-114KApache Spark | Azure Data | Azure Data Factory | CI/CD | Cloud DataflowDental insurance | Disability insurance | Employee resource groups | Employee stock purchase plan | Internal mobilityMid-level Full TimeAlpharetta, Georgia, United States19h ago
-
Summer Intern - Data Engineer USD 45K-62KApache Spark | Azure Data | Azure Data Factory | CI/CD | Cloud DataflowDental insurance | Disability insurance | Employee assistance program mental health support | Employee resource groups | Employee stock purchase planEntry-level Full Time InternshipAlpharetta, Georgia, United States19h ago
-
Senior Assistant Vice President USD 164K-220KAWS | Agent Orchestration | AgentOps | Asynchronous programming | AuditabilitySenior-level Full TimeUnited States19h ago
-
Software Engineer USD 153K-237KAPI | API Gateway | API Management | AWS | Apache Airflow401k employer match | Employer Disability Insurance | Employer health insurance | Employer life insurance | Paid HolidaysSenior-level Full TimeChantilly, VA20h ago
-
AMD Public-Dallas-Associate-Software Engineering USD 123K-188KAWS | AWS Glue | Amazon EMR | Amazon Redshift | Amazon S3Senior-level Full TimeDallas, Texas, United States21h ago
-
Data Engineer II USD 150K-180KAWS | Apache Airflow | Apache Kafka | Apache Spark | Argo Workflows401k match | CLEAR Plus membership | Catered lunches | Family building benefits | Flexible time offMid-level Full TimeNew York, NY, United States21h ago