Senior Site Reliability Engineer — Token Factory (Inference Platform)
Amsterdam, Netherlands; Berlin, Germany; London, United Kingdom; Prague, Czech Republic; Remote - Europe
R
GBP 225K-255K (estimate) Senior-level Full Time
Tasks
- Build infrastructure as code with Terraform
- Debug distributed backend failures
- Design telemetry pipelines metrics logs traces
- Detect isolate remediate incidents with runbooks
- Drive post-mortem culture
- Ensure performance and observability under extreme load
- Harden request-routing and retry logic
- Own inference stack reliability
- Tune Kubernetes autoscalers
Perks/Benefits
- Career growth
- Collaborative culture
- Flexibility
- Impactful AI projects
- International environment
- Learning opportunities
- Work-life balance
Skills/Tech-stack
Alert design | Alerting | Autoscaling | Bash | Distributed Systems | Grafana | Incident Management | Infrastructure as Code | Kubernetes | Logs | MLOps | Metrics | Observability | Prometheus | Python | Retry logic | SLO | Scripting | Telemetry | Terraform | Traces | “as-code”
Education
N/A
Regions
Countries
States
Related jobs
-
Applied AI Engineer GBP 85K-110KA/B | A/B Testing | Anthropic | B testing | ExperimentationFully remote | Global engineering collaboration | High ownership culture | Learning and development budgetMid-level Full TimeUnited Kingdom R1d ago
-
Lead AI Engineer (AI Systems & Automation) GBP 78K-109KAlerting | Anthropic | Distributed Systems | Docker | EmbeddingsFully remote | Global engineering collaboration | High ownership culture | Learning and development budgetSenior-level Full TimeUnited Kingdom R1d ago
-
Data Architect (m/w/d) EUR 56K-75KAWS | Analytics | Azure | CI/CD | ClaudeAnnual bonus | Childcare support | Coaching | Company bike leasing | Fitness programMid-level Full TimeSt. Georgen im Schwarzwald, Hannover, 100% … R1d ago
-
Consulting Systems Engineer, Data Management (EMEA) GBP 75K-101KAWS | Ansible | Apache Kafka | Azure | Cloud hybridCompany-sponsored team events | Flexible time off | Wellness resourcesSenior-level Full TimeRemote, United Kingdom R3d ago
-
Senior AI/ML engineer GBP 120K-150KAWS | CI/CD | Databricks | Deep learning | Delta LakeAccelerated professional growth | Enhanced parental leave | Female health leave | Fully paid sabbatical | Health pension wellbeing benefitsSenior-level Full TimeLondon R3d ago
-
Analytics Engineer GBP 75K-90KAmazon Redshift | Apache Airflow | CI/CD | DBT | Data ModellingEnglish language at office | In person 3 days per week | Relocation supportMid-level Full TimeLondon R3d ago
-
(Senior) Data Engineer (gn) EUR 50K-57KAWS | Airbyte | DBT | Data Modeling | Data PipelinesCorporate benefits | E GYM Wellpass Subsidy | Employee discounts | Flexible working hours | Paid time offSenior-level Full TimeDeutschland, remote R3d ago
-
(Senior) Data Engineer (gn) EUR 60K-78KAWS | Airbyte | DBT | Data Modeling | Data QualityCompany pension | Employee discounts | Fitness subsidy | Flexible working hours | HomeofficeSenior-level Full TimeDeutschland, remote R3d ago
-
Principal Data Engineer GBP 90K-103KCI | CI/CD | DBT | Data Contracts | Data ManagementCareer Development Programs | Collaborative culture | Flexible time off | Flexible work schedules | Health and wellness benefitsSenior-level Full TimeAntrim, Northern Ireland, United Kingdom; Remote- … R3d ago
-
ArgoCD | Artificial Intelligence | Bamboo | Cloud Orchestration | DevOpsHybrid work modelMid-level Full TimeAnywhere within commuting distance of Brno, … R4d ago
-
Senior Platform Engineer (m/f/d) EUR 65K-81KAlerting | Algorithms | Autoscaling | Big O | Big O NotationSenior-level Full TimeBerlin, DE, 10557 R4d ago
-
Senior-level Full TimeLondon, England, United Kingdom - Remote R4d ago
-
AI Engineer EUR 80K-105KAgent Orchestration | Async Programming | Embeddings | LLM | MCPComprehensive health benefits | Equity participation program | Family leave plus | Language training | Leadership programsMid-level Full TimeMünchen, Bayern, Germany (Hybrid) R4d ago
-
AI Engineer EUR 80K-105KAsync Programming | Embeddings | JavaScript | LLM | MCPAnnual learning budget | Comprehensive health benefits | Equity participation program | Family leave plus | Language trainingMid-level Full TimeBerlin, Germany (Hybrid) R4d ago
-
LLM Engineer (m/f/d) EUR 53K-66KAgent systems | Anthropic API | Authentication | Best practices | Event DrivenErgonomic workstations | Flexible working hours | Health subsidies | Training and development | Work from homeMid-level Full TimeRemote, Germany R4d ago
-
Python Developer GenAI & Agentic AI (m/f/d) EUR 40K-60KAPI Integration | AWS | Agent Orchestration | Autogen | AzureErgonomic workstations | Flexible working hours | Health services subsidy | Option to work from home | Training and personal developmentMid-level Full TimeRemote, Germany R4d ago
-
Junior Data Analyst / Analytics Engineer (m/w/d) EUR 39K-45KAd Manager | BigQuery | ELT | ETL | Git30 days vacation | Free coffee | Half day off on Christmas Eve | Half day off on New Years Eve | Obst and snacksEntry-level Full TimeAugsburg, Berlin, Hamburg, remote, München, Nürnberg, … R4d ago
-
Senior Staff Engineer (LLM) GBP 90K-120KAgent systems | Anthropic API | Authentication | Event Driven | Event-driven architectureSenior-level Full TimeRemote, United Kingdom R4d ago
-
Principal AI Engineer, Edinburgh GBP 90K-105KAPI Design | Agent systems | Artificial Intelligence | Backend Development | CachingGym membership | Health insurance | Hybrid work | Life insurance | Mental health supportSenior-level Full TimeEdinburgh R5d ago
-
AWS DMS | Amazon Bedrock | Amazon Kinesis | Amazon QuickSight | Amazon SageMakerCommunity forums | Documentation and transparency culture | Flexible shifts | Mentorship | No weekend workSenior-level Contract Full TimeLondon R5d ago
-
Algorithms | Computer Science | Data Analysis | Data Mining | Data VisualizationFlexible working hours | Hybrid work model | MentorshipEntry-level Full Time Part TimeSchwalbach / Frankfurt, Germany R5d ago
-
Sr Data Science Engineer GBP 81K-115KARIMA | Databricks | Decision Trees | K-Means | K-means ClusteringSenior-level Full TimeLondon, United Kingdom R5d ago
-
Senior Machine Learning Manager, Borrowing GBP 138K-176KAI | AI Platform | AWS | BigQuery | Cloud platformFlexible working hours | Learning budget | Relocation support | Remote work | Visa sponsorshipSenior-level Full TimeCardiff, London or Remote (UK) R5d ago
-
Staff MLOps Engineer (AI/ML Platform) CZK 1384K-1715KAWS | AWS EKS | Apache Spark | Batch Processing | Data DriftSenior-level Full TimeRemote, Remote, Czechia R5d ago
-
Staff MLOps Engineer (AI/ML Platform) GBP 90K-115KAWS | AWS EKS | Apache Spark | Batch Processing | DatabricksRemote work optionsSenior-level Full TimeRemote, Remote, United Kingdom R5d ago