Site Reliability Engineer (AI)
Tasks
- Build and maintain monitoring and alerting layer for AI applications and pipelines
- Collaborate with engineering teams to improve release quality and system stability
- Define and implement SLIs alerts and operational dashboards
- Diagnose production issues and implement fixes
- Manage incidents including triage coordination root cause analysis and prevention
- Optimize CI CD pipelines and implement quality gates
- Standardize telemetry across systems
Perks/Benefits
- Comprehensive healthcare
- Fully remote
- International projects
- Long-term B2B contract
- Multinational environment
Skills/Tech-stack
Alerting | Azure | Azure DevOps | CI/CD | Cause analysis | Datadog | Grafana | Incident Management | Kubernetes | Monitoring | Operational dashboards | Root Cause Analysis | Root cause | SLI | Telemetry
Education
N/A
Related jobs
-
AI/ML Engineer (Synthetic Data) CZK 708K-1074KAWS | CI/CD | Databricks | Deep learning | Docker25 days vacation | Height adjustable desk | Laptop provided | Meal vouchers | Pension fund contributionMid-level Full TimeRemote, Remote, Czechia R6h ago
-
API Design | AWS | Asynchronous processing | Azure | CI/CD3 days remote per week | English mandatoryEntry-level Full TimeIssy-les-Moulineaux, IDF, France R11h ago
-
Senior Software Engineer USD 107K-150KAWS | Agile | Datadog | GitHub | GitHub ActionsOn-call rotation | Remote workSenior-level Full TimeCosta Mesa, CA, United States R11h ago
-
Senior-level Full TimeRemote R13h ago
-
Alerting | Ansible | Bash | CI/CD | CephRemote workSenior-level Full TimeUnited States, United States R14h ago
-
Ansible | Bash | CI/CD | CentOS | CephContract-to-hire | No sponsorship | Remote workSenior-level Full TimeUnited States, United States R14h ago
-
GCP Data Engineer / Consultant Specialist INR 1500K-2000KAirflow | Alerting | Apache Beam | Automation | BigQueryFlexible working | Inclusive workplace | Opportunities for growth | Professional developmentMid-level Full TimePune, Maharashtra, India R16h ago
-
BE Software Engineer USD 100K-145KCI/CD | Distributed Systems | Docker | GraphQL | JavaRemote workMid-level Full TimeRemote R16h ago
-
Consultant Data Scientist IA (H/F) EUR 46K-55KAzure | Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake StorageEarly access to new Microsoft technologies | End-of-year bonus | Flexible telework | Health insurance | Meal ticketsSenior-level Full TimeAix-en-Provence, Provence-Alpes-Côte d'Azur, France R17h ago
-
Airflow | Atlas | Confluence | Datalake | DremioMeal vouchers | Remote work | Subsidized social and cultural activities | Training opportunitiesMid-level Full TimeMérignac, Nouvelle-Aquitaine, France R21h ago
-
GenAI Engineer - Staff - EY GDS Spain - Hybrid EUR 58K-79KAPI Development | AWS | Agentic AI | Autogen | AzureContinuous learning programs | Hybrid work model | Psychological support | Recognition programs | Training and development programsSenior-level Full TimeMalaga, ES, 29590 R1d ago
-
Staff Machine Learning Engineer USD 189K-389KCalibration | Contextual Bandits | Contextual Decisioning | Data Validation | EmbeddingsEquity eligible | In Office 1 Day Per WeekSenior-level Full TimeSan Francisco, CA, US; Remote, US R1d ago
-
Senior-level Full TimeUkraine - Remote R1d ago
-
AWS | Ansible | Azure | Blue-Green Deployment | Blue/greenEquity | Flexible working | Home office stipend | Paid vacation | Remote workSenior-level Full TimeLondon, England, United Kingdom - Remote R1d ago
-
Senior-level Full TimeIndia - Remote R1d ago
-
Azure Blob | Azure Blob Storage | Azure Cloud | Azure Cloud Security | Azure DataSenior-level Full TimeMakati City, Metro Manila, Philippines R1d ago
-
ADLS | Azure Data | Azure Data Factory | Azure Databricks | Azure DevOpsHybrid workSenior-level Full TimeMakati City, Metro Manila, Philippines R1d ago
-
Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage | Azure DatabricksMid-level Full TimeMakati City, Metro Manila, Philippines R1d ago
-
Senior Data Platform Engineer EUR 66K-84KAWS | AWS CloudWatch | AWS IAM | AWS Secrets | AWS Secrets ManagerCompany paid sick leave | Company paid volunteering days | Comprehensive medical insurance | Employee assistance program | Employee recognition and rewardsSenior-level Full TimeLimassol, Limassol, Cyprus R1d ago
-
Gen AI Engineer EUR 64K-80KAI Search | AI orchestration | Agent Framework | Agent-based | Agent-based architectureContinuous learning | Flexible working | Personal equipment | Private health insurance | Professional developmentSenior-level Full TimeAthens, Attica, Greece - Remote R1d ago
-
AWS | Ansible | Argo CD | ArgoCD | BGPAccess to knowledge base | Family days | Flexible work schedule | Paid vacation | Professional coachingSenior-level Full TimePrague, Prague, Czechia - Remote R1d ago
-
Principal AI/ML Engineer USD 165K-226KC# | C++ | CI/CD | CUDA | Computer Vision401k match | Dental insurance | Health insurance | Life insurance | Paid time offSenior-level Full TimeRemote PA - PA PAR, United … R1d ago
-
APIs | Compliance | Distributed Systems | Enterprise Integration | Generative AIOccasional evening calls | Remote workSenior-level Full TimeRemote - US Based R1d ago
-
Database Engineer EUR 60K-86KAnsible | Backup and Recovery | CI/CD | CloudWatch | Database UpgradesSenior-level Full TimeFuze Portugal - Remote R1d ago
-
Senior-level Full TimeRomania-Remote R1d ago