Engineering Manager, Inference Cloud
USD 180K-250K (estimate) Mid-level Full Time
Tasks
- Build distributed service discovery
- Collaborate with ML Product Infrastructure and Platform teams
- Define SLIs and SLOs
- Design service mesh and request routing
- Drive observability monitoring logging and alerting
- Implement graceful degradation under load
- Implement load balancing and caching
- Lead incident response and postmortems
- Lead multi region traffic management
- Operate active active fault tolerant systems
- Optimize latency throughput and cost
- Recruit and mentor engineering team
- Run chaos engineering for reliability
- Scale inference cloud platform
Perks/Benefits
- N/A
Skills/Tech-stack
AWS EKS | Active/Active | Admission control | Alerting | Backpressure | Batching | C++ | Caching | Chaos Engineering | Circuit breaking | Cloud Architecture | Distributed Systems | Distributed tracing | Fault Tolerance | Go | Grafana | Incident Management | Kubernetes | Latency optimization | Load Balancing | Load shedding | Logging | Metrics | Monitoring | Observability | Prometheus | Python | Quota Management | Rate Limiting | Request Routing | SLA | SLI | SLO | Service Discovery | Service Mesh | Throughput Optimization | Traffic prioritization
Education
N/A
Roles
Regions
Countries
States
Related jobs
-
Fraud Risk Analytics Manager USD 105K-130KClassification | Entity Resolution | Feature Engineering | Financial crime | Financial crime analyticsEducation reimbursement | Flexible work arrangements | Maternity & paternity leave | Medical, dental & vision coverage | Paid time offMid-level Full TimeUnited States3h ago
-
Software Engineer Manager, GKE and AI Infrastructure USD 207K-300KAI workflows | Container Orchestration | Containerization | Distributed Systems | GKESenior-level Full TimeSunnyvale, CA, USA5h ago
-
Affiliate Marketing | Customer Acquisition | Customer Retention | Data Modeling | Data Warehousing401k match | Life insurance | Medical, dental, and vision coverage | Mental health and wellness resources | Paid time offSenior-level Full TimeNew York R1d ago
-
GFS Reporting and Analytics Manager USD 107K-197KAzure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage | DAXDiscretionary annual incentive programMid-level Full TimeHermitage, Tennessee, United States; Nashville, Tennessee, …2d ago
-
Senior Manager, Finance Data Analytics Lead CAD 135K-155KArtificial Intelligence | Azure | Data Architecture | Data Governance | Data LakesCareer growth opportunities | Community involvement opportunities | Health and wellbeing resources | Hybrid work environment | Paid sick daysSenior-level Full TimeToronto, ON, M2N 5M9, CA2d ago
-
AVP, AML Analytics USD 75K-130KAWS | Anti-Money Laundering | Automation | BSA | Beneficial ownershipPaid travel for business as required | Work from home flexibilityExecutive-level Full TimeStamford Site, United States2d ago
-
Senior-level Full TimeRemote (United States) R2d ago
-
Senior Manager, Payments Optimization USD 152K-270KCheckout Conversion Optimization | Consumer authentication | Conversion Optimization | Data Analysis | Data Visualization401k | Dental insurance | FSA/HSA | Health insurance | Life insuranceSenior-level Full TimeSan Francisco, CA, United States2d ago
-
Learning Product Manager Lead USD 137K-212KAI-powered learning | Adaptive Systems | Adaptive learning | Analytics | AssessmentsSenior-level Full TimeUnited States2d ago
-
Sr. Manager, Security Analytics USD 135K-198K800-53 | Application Architecture | Audit management | Awareness Training | Cloud HostingSenior-level Full TimeRaleigh, NC2d ago
-
Sr. Manager, Security Analytics USD 135K-198K800-53 | Audit management | Awareness Training | Device Management | FedRAMPSenior-level Full TimeSalt Lake City, UT2d ago
-
Director, Quantitative Research USD 185K-225KData Governance | Data Integrity | Data Science | Data Visualization | Excel401k match | Company-wide events | Dependent care FSA | Educational stipend | Employee Referral Bonus ProgramExecutive-level Full TimeSeattle, Washington, United States2d ago
-
Data Science Manager USD 155K-225KEvaluation Frameworks | Experiment design | GenAI | Logging | Machine LearningMid-level Full TimeRemote, United States R2d ago
-
Analytics | B2B SaaS | Cash Flow | Cash Flow Optimization | Compliance401k retirement plan | Dental insurance | Disability insurance | Employee assistance program | Employee recognition programsSenior-level Full TimeDraper, Utah, United States; San Jose, …2d ago
-
Manager, Data Engineering USD 130K-166KAWS | Access Controls | Apache Airflow | Audit Logging | AzureCollaborative team culture | Remote work | Work-life balanceSenior-level Full TimeRemote, United States R3d ago
-
AI-enabled | AI-enabled solutions | AWS | Application development | Artificial IntelligenceEquipment and office stipend | Fully remote work | Health insurance hybrid plan | Laptop and tools | Learning and development stipendExecutive-level Full TimeCanada R3d ago
-
Strategy & Execution Manager, GTM USD 130K-223KAI | Agent Orchestration | Apache Spark | Cloud Data | Cloud data warehousingMid-level Full TimeUnited States3d ago
-
SOX Data Analytics & AI Manager USD 128K-148KBusiness Intelligence | Dashboarding | Data Integrity | Data Lineage | Data WarehousingMid-level Full TimeCottonwood Heights, Utah, Remote R3d ago
-
IT Technical Specialist - Databricks USD 146K-210KAWS | AWS Lambda | Agile | Amazon EC2 | Amazon RDSDisability benefits | Educational support | Medical, dental & vision coverage | Mental health and wellness support | Parental leaveSenior-level Full TimeCharlotte, NC, United States3d ago
-
Head of Data Engineering USD 200K-260KAWS | Airbyte | Airflow | Apache Iceberg | Avo401k plan with employer match | Dog-friendly office | Flexible working hours | Full health dental and vision coverage | Gympass subscriptionExecutive-level Full TimeSan Francisco, CA, Brooklyn, NY, Cambridge, … R3d ago
-
VP, AI Engineering USD 200K-250KData Governance | Deep learning | Distributed Systems | Experimentation | Language Models401k match | AD D Insurance | DCFSA | FSA | Flexible time offExecutive-level Full TimeAtlanta, GA3d ago
-
Software Engineering Manager - Platform Team USD 180K-350KC plus plus | Cassandra | Flink | Hibernate | JPACareer growth | High-impact projectsSenior-level Full TimeMountain View, California, United States3d ago
-
Robotics Program Manager, Product Data Operations USD 134K-202KBash | Data Curation | Data Quality | Data Quality Management | Data pipelineOnsite workMid-level Full TimeBurlingame, CA3d ago
-
Benchmarking | Code review | Data Pipelines | Distributed Systems | Evaluation FrameworksMid-level Full TimeMenlo Park, CA3d ago
-
Partner Engineering GenAI - US USD 133K-203KAPIs | Artificial Intelligence | C plus plus | Claude | Cloud ComputingSenior-level Full TimeMenlo Park, CA | Seattle, WA …3d ago