Machine Learning Systems & Infrastructure Engineer
Tasks
- Build data ingestion pipelines
- Build machine learning training systems
- Debug distributed training across GPUs
- Develop workflow orchestration pipelines
- Enable distributed training
- Handle secrets IAM and network boundaries
- Implement monitoring, logging, and alerting
- Maintain CI/CD pipelines
- Maintain dataset loaders and checkpointing
- Manage infrastructure as code with Terraform
- Optimize GPU compute performance
- Orchestrate machine learning experiments
- Serve machine learning models in production
Perks/Benefits
- N/A
Skills/Tech-stack
AWS IAM | Airflow | Amazon Web Services | Azure Networking | BigQuery | CI/CD | CUDA | Cloud platform | DDP | Data Versioning | Distributed Training | Docker | Experiment tracking | FSDP | GCP IAM | GPU scheduling | Google Cloud | Google Cloud Platform | Grafana | IAM | Kubeflow Pipelines | Kubernetes | MLflow | Machine Learning | Microsoft Azure | Modal | NCCL | Object storage | Observability | OpenTelemetry | PostgreSQL | Prometheus | PyTorch | PyTorch distributed | Python | SQL | SQLite | Slurm | Snowflake | Terraform | Triton | Volcano | Web Services | Weights and Biases
Education
N/A
Related jobs
-
AI architecture | APIs | Agent systems | Cloud platform | Data PipelinesSenior-level Full TimeLondon, UK1d ago
-
C++ | Data Processing | Data debugging | Data extraction | Distributed ComputingSenior-level Full TimeLondon, UK1d ago
-
Machine Learning Engineer GBP 110K-145KCI/CD | Docker | Feature Engineering | Kubernetes | Model Deployment401k matching | Dental insurance | Flexible paid time off | Health and wellness stipend | Health insuranceSenior-level Full TimeUnited Kingdom R1d ago
-
Airflow | BigQuery | Computer Vision | Deep learning | EmbeddingsBike to work scheme | Family planning support | Flexible vacation | Gender-affirming care | Income replacement programsSenior-level Full TimeRemote - United Kingdom R1d ago
-
Staff Machine Learning Engineer, ML Efficiency GBP 81K-118KApache Spark | C++ | Caching | Cloud Cost Optimization | Cloud infrastructureBike to work scheme | Family planning support | Flexible vacation | Gender-affirming care | Group pension with employer matchSenior-level Full TimeRemote - United Kingdom R2d ago
-
Senior-level Full TimeLondon2d ago
-
Senior Data Engineer GBP 72K-88KApache Airflow | Async Processing | CI/CD | Cloud Build | Cloud ComposerCampus based work schedule | Global team collaboration | Hybrid working | Pathway to Lead Engineer | Support for certification and trainingSenior-level Full TimeDunton, Essex, United Kingdom2d ago
-
Data Business Engineer GBP 65K-80KAccess Control | Amazon Redshift | BigQuery | COGS Modeling | Data GovernanceMid-level Full TimeLondon, United Kingdom2d ago
-
Senior Robotics Research Engineer GBP 62K-78KC# | C++ | CI/CD | Concurrent programming | Continuous DeploymentAnnual leave | Cycle to work scheme | EAP | Employer pension matching | Exclusive discountsSenior-level Full TimeWelwyn Garden City, United Kingdom R2d ago
-
Senior Robotics Software Engineer GBP 62K-84KAgile | Automated testing | BigQuery | CI/CD | CUDAAnnual leave | Cycle to work scheme | Employee assistance program | Exclusive discounts | Free shuttle busSenior-level Full TimeHatfield, United Kingdom R2d ago
-
Data Infrastructure and AI Engineer GBP 35K-40KBenchmarking | C# | C++ | Cloud Native | Concurrency ControlEntry-level Full TimeEdinburgh, United Kingdom2d ago
-
Mid/Senior Solution Architect GBP 80K-110KAmazon Web Services | Azure Machine Learning | CI/CD | Cloud platform | DockerDiscounted lunch | Educational budget | Flexible working hours | Hybrid work | Language classesSenior-level Full TimeLondon, United Kingdom2d ago
-
Mid-level Full TimeManchester, England, United Kingdom2d ago
-
Data Engineer GCP (Remote) GBP 70K-80KCI/CD | Cloud Composer | Cloud Functions | Cloud Storage | Cloud platformRemote work environmentMid-level Full TimeStoke-on-Trent, England, United Kingdom R2d ago
-
Data Engineer (GCP) GBP 70K-80KCI/CD | Cloud Composer | Cloud Functions | Cloud Storage | Data pipelineMid-level Full TimeStoke-on-Trent, England, United Kingdom2d ago
-
Data Engineer GBP 45K-45KAPI | Alerting | Automation | CI/CD | Data AnalysisAgile working | Career development | Family-friendly policies | Holiday | Learning and developmentMid-level Full TimeNorthampton, United Kingdom2d ago
-
Senior Software Engineer, Embedded UI GBP 70K-85KAPI Design | Artificial Intelligence | BrightScript | Cloud Development | Content ManagementCommuter benefits | Disability insurance | Financial wellness support | Healthcare benefits | Life insuranceSenior-level Full TimeCambridge, United Kingdom2d ago
-
Bash | Cloud platform | Data Pipelines | Data Processing | DockerAsynchronous culture | Career growth opportunities | Remote workMid-level Full TimeCambridge, United Kingdom2d ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudAsynchronous culture | Friendly work environment | Inclusive workplace | Remote-friendly, distributed teamMid-level Full TimeReading, United Kingdom2d ago
-
Bash | Data Processing | Docker | GCP | LinuxAsynchronous work culture | Inclusive workplace | Remote work cultureMid-level Full TimeCardiff, United Kingdom2d ago
-
Junior Python Developer GBP 38K-38KAPI Development | AWS | Data Engineering | FastAPI | Machine LearningEntry-level Full TimeHolborn - London, United Kingdom2d ago
-
Mid-level Full TimeHolborn - London, United Kingdom2d ago
-
Senior Robotics & Lab Automation Engineer GBP 44K-53KAPIs | Automation | Autonomous Systems | Bioprocess | C#Annual leave | Comprehensive onboarding | Cycle to work scheme | Flexible work schedule | Free hot and cold drinksSenior-level Full TimeRoyston York Way, United Kingdom2d ago
-
Senior-level Full TimeBelfast, United Kingdom2d ago
-
Senior-level Full TimeBelfast, United Kingdom2d ago