Machine Learning Systems & Infrastructure Engineer
Tasks
- Build data ingestion pipelines
- Build machine learning training systems
- Debug distributed training across GPUs
- Develop workflow orchestration pipelines
- Enable distributed training
- Handle secrets IAM and network boundaries
- Implement monitoring, logging, and alerting
- Maintain CI/CD pipelines
- Maintain dataset loaders and checkpointing
- Manage infrastructure as code with Terraform
- Optimize GPU compute performance
- Orchestrate machine learning experiments
- Serve machine learning models in production
Perks/Benefits
- N/A
Skills/Tech-stack
AWS IAM | Airflow | Amazon Web Services | Azure Networking | BigQuery | CI/CD | CUDA | Cloud platform | DDP | Data Versioning | Distributed Training | Docker | Experiment tracking | FSDP | GCP IAM | GPU scheduling | Google Cloud | Google Cloud Platform | Grafana | IAM | Kubeflow Pipelines | Kubernetes | MLflow | Machine Learning | Microsoft Azure | Modal | NCCL | Object storage | Observability | OpenTelemetry | PostgreSQL | Prometheus | PyTorch | PyTorch distributed | Python | SQL | SQLite | Slurm | Snowflake | Terraform | Triton | Volcano | Web Services | Weights and Biases
Education
N/A
Related jobs
-
C plus plus | C# | Cloud Computing | Cloud platform | Data ProcessingSenior-level Full TimeLondon, UK6h ago
-
Machine Learning DSP Engineer GBP 50K-70KAudio Processing | C# | C++ | Digital Signal | Digital Signal ProcessingRelocation support | Visa sponsorshipSenior-level Full TimeMilton Keynes, United Kingdom8h ago
-
Data Engineer - Intermediate Level GBP 42K-50KAWS | Agile | Apache Flink | Apache Storm | AzureAgile environment | Hybrid work | Team cultureMid-level Full TimeLondon, United Kingdom10h ago
-
Data Engineer GBP 45K-50KApache Beam | BigQuery | CI/CD | Cloud Dataflow | Cloud FunctionsEmployee referral bonus | Hybrid work | Profit share bonus | Share purchase plan | Uncapped leaveSenior-level Full TimeBath, England, United Kingdom17h ago
-
.NET | AWS | Apache Flink | CI/CD | HarnessHybrid workSenior-level Full TimeLondon, United Kingdom17h ago
-
ARM | Buildroot | C# | C++ | Code CoverageDisability income protection | Employee assistance program | Life insurance | Pension plan | Private healthcareMid-level Full TimeFarnborough, Hampshire, United Kingdom23h ago
-
Senior Analytics Engineer GBP 70K-80KAccess Control | Airflow | Amazon Redshift | Automation | BigQuerySenior-level Full TimeUnited Kingdom, London1d ago
-
Senior AI Engineer GBP 75K-75KAWS | Agent systems | Artificial Intelligence | Azure | CI/CDAnnual bonus | Discounted gym membership | Electric vehicle leasing | Experience days | Hybrid workSenior-level Full TimeLondon, United Kingdom R1d ago
-
Data Engineer - Python GBP 60K-80KAgile | Data Architecture | Data Engineering | Data Pipelines | Data ScienceSenior-level Full TimeBelfast, Northern Ireland, United Kingdom1d ago
-
Applied AI Analytics Lead 14 Months FTC GBP 70K-90KAzure Foundry | Copilot Studio | DBT | Data Modeling | Microsoft FabricHybrid work modelSenior-level ContractLondon, England, United Kingdom R1d ago
-
Analytics Engineer GBP 55K-75KCI/CD | DBT | Data Modeling | Data Quality | Data WarehouseCompany wellbeing resources | Gym membership | Hybrid work | Life Event day | Life insuranceMid-level Full TimeLondon R1d ago
-
Forward Deployed Engineer - London GBP 111K-130KFull Stack | Full-Stack Development | Generative AI | JavaScript | LLMHybrid work model | Relocation assistance | Travel up to 50 percentMid-level Full TimeLondon, UK1d ago
-
Data Engineer - Brook Green Supply GBP 49K-55KApache Airflow | Apache Kafka | Apache Spark | CI/CD | DagsterMid-level Full TimeLondon, England, United Kingdom1d ago
-
Bash | Cloud platform | Data Ingestion | Data Processing | DockerAsynchronous culture | Career growth opportunity | Friendly work environment | Impact on consumer and enterprise products | Remote friendly 100% distributed settingMid-level Full TimeLondon, United Kingdom1d ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeAsynchronous culture | Entrepreneurial environment | Flexible management structure | Friendly laid-back atmosphereMid-level Full TimeBrighton, United Kingdom1d ago
-
Internal Audit AVP - Data Analytics GBP 47K-58KAWS | Alteryx | Audit Testing | Cloud Computing | DashboardsEntry-level Full TimeCanary Wharf, 1 Churchill Place, United …1d ago
-
Platform Data Engineer GBP 91K-110KAnalytical Databases | Batch Processing | ClickHouse | Compression | Data ModelingCompany retreats | Family leave | Flexible hours | Generous paid time off | Remote-first setupSenior-level Full TimeUnited Kingdom - Remote R1d ago
-
AI/ML/Data Science Engineer GBP 50K-60KAnomaly Detection | Cloud Platforms | Computer Vision | Computer Vision Video Analytics | Data PreprocessingAnnual leave | Bank Holiday Leave | Dental care | Discounts | Enhanced maternity/paternity leaveMid-level Full TimeBiggin Hill, United Kingdom1d ago
-
Alerting | Cloud infrastructure | Containerization | Deployment Pipelines | DockerFlexible schedule | Performance bonus program | Remote workMid-level FreelanceUnited Kingdom - Remote R1d ago
-
LLM Specialist / AI Developer GBP 59K-65KAPI Security | Azure OpenAI | Copilot Studio | Evaluation | GroundingFixed term position | Hybrid working | Working permit sponsorshipMid-level Full TimeLondon, Greater London, United Kingdom1d ago
-
Senior GenAI Software Engineer GBP 76K-100KA/B | A/B Testing | B testing | Debugging | Diffusion ModelsDental insurance | Equity | Health insurance | Vision insurance | Wellness stipendsSenior-level Full TimeLondon, UK R1d ago
-
Senior-level Full TimeUnited Kingdom (Remote) R1d ago
-
Data Science Lead - Logistics GBP 62K-80KArtificial Intelligence | Big Data | Cloud | IoT | JavaAnnual leave | Buy As You Earn Scheme | Cycle to work scheme | Employee assistance programme | Employee discountsSenior-level Full TimeLondon, United Kingdom R2d ago
-
Senior Data Engineer BI Hub GBP 62K-77KAWS | Agile | Alerting | Azure | CI/CDAnnual holiday allowance | Buy additional holiday | Colleague discount | Cycle to work scheme | DiscountsSenior-level Full TimeLondon, London, United Kingdom2d ago
-
AWS | Amazon Bedrock | Azure | Azure OpenAI | DockerMid-level Full TimeLONDON, LONDON, United Kingdom2d ago