Senior MLOps Platform Engineer
Tasks
- Architect data pipelines to data lake
- Build automated testing canary releases and rollbacks
- Build self service model and endpoint tooling
- Create observability and monitoring stacks
- Design unify MLOps platform
- Develop CI CD pipelines for model packaging
- Ensure reproducibility security and compliance
- Implement and operate Kubernetes and AWS services
- Implement security IAM network policies and secret management
- Manage inference latency throughput and resource utilization metrics
- Mentor junior engineers and maintain internal knowledge bases
- Optimize inference performance with scaling and quantization
- Plan disaster recovery for hybrid infrastructure
Perks/Benefits
- Flexible time off
- Healthcare
- Learning resources
- Retirement benefits
- Robust learning and development opportunities
- Wellness
Skills/Tech-stack
AWS EKS | Airflow | Amazon S3 | Amazon SageMaker | Argo CD | CI/CD | CloudWatch | Direct Connect | Distributed Systems | Docker | ELK | Flink | Flux | GPU scaling | GitHub Actions | GitLab CI | GitOps | Go | Grafana | Helm | IAM | Java | Kafka | Kubernetes | Kubernetes Operators | MLOps | Model Quantization | OpenTelemetry | Prometheus | Python | Rust | S3 | Spark | Step Functions | VPN
Education
Bachelor of Arts | Bachelor of Engineering | Bachelor of Science
Related jobs
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KCombinatorics | Graph theory | MATLAB | NumPy | Number theoryFlexible hours | Freelance opportunities | Project based workSenior-level FreelanceNew York, New York, United States … R15h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KCombinatorics | Graph theory | Mathematics | NumPy | Number theoryFreelance opportunity | Part-time project-based workSenior-level FreelanceFlorida, United States - Remote R15h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KC# | Combinatorics | Graph theory | MATLAB | NumPyFlexible hours | Part-time opportunities | Project based workSenior-level FreelanceTexas, United States - Remote R15h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KC# | MATLAB | NumPy | Pandas | PythonPart-time availability | Project based workSenior-level FreelanceMichigan, United States - Remote R15h ago
-
ML Ops Engineer USD 174K-226KAWS | Cloud infrastructure | Cost Optimization | Data Ingestion | GCPHybrid work schedule | In-office at least 3 days per weekMid-level Full TimeSan Francisco HQ Office R16h ago
-
Machine Learning Engineer - 1 USD 130K-228KCNN | Cross-validation | Data Pipelines | Deep learning | Document processingEquity options | Flexible-hybrid work | Medical, dental & vision coverage | Professional development budget | Team offsitesNone Full TimeHybrid - San Mateo, California R16h ago
-
Lead AI Engineer - AI & Credit Analytics USD 156K-234KAWS | CI/CD | Data Governance | Generative AI | LLMOpsFlexible time off | Flexible work environment | Hybrid work option | Matching 401k | Medical/Dental/Vision insuranceSenior-level Full TimeCosta Mesa, CA, United States R17h ago
-
AI/ML Engineer - Shared Services Automation-Remote USD 128K-200KAI Center | Agentic Frameworks | Azure | CI/CD | Cloud infrastructureDental insurance | FSA | HSA | Health insurance | Retirement planMid-level Full TimeRochester, MN, United States R19h ago
-
AI Software Engineer - Greenwood Village, CO Office USD 80K-120KAI Agents | API | Automation | C# | Computer VisionCollaborative environment | Comprehensive benefits package | Employee ownership | Flexible workplace | Innovative cultureEntry-level Full TimeGreenwood Village, Colorado, United States R19h ago
-
Sr. AI/ML Engineer - Shared Services Automation-Remote USD 145K-225KAI Center | AI Engineering | Azure | Cloud platform | Communications Mining100 percent remote work | Advancement opportunities | Continuing education | Dental insurance | Flexible spending accountSenior-level Full TimeRochester, MN, United States R19h ago
-
AI/ML Engineer - Revenue Cycle Automation-Remote USD 125K-171KAzure | Bias detection | Cloud infrastructure | Cloud platform | Data Engineering100 percent remote | Dental insurance | FSA | HSA | Health insuranceMid-level Full TimeRochester, MN, United States R19h ago
-
Data Engineer USD 130K-145KApache Spark | CI/CD | Cloud platform | Containerization | Data GovernancePublic trust clearance support | Remote workSenior-level Full TimeWork from home, VA, United States R23h ago
-
Sr. Embedded Software Developer USD 130K-160KARM | Assembly | Bootloader development | C# | C++401k match | Employee assistance program | Medical, dental & vision coverage | Paid Holidays | Paid time offSenior-level Full TimeRemote (United States) R23h ago
-
Airflow | BigQuery | Cloud platform | Data Governance | Data ModelingCollaborative work culture | Flexible work environment | Team building eventsSenior-level Full TimeFlorida R23h ago
-
Airflow | BigQuery | Cloud platform | Data Modeling | Database ManagementFlexible work environment | Open office environment | Team building eventsSenior-level Full TimeArizona R23h ago
-
Sr. Analytics Engineer, Business Intelligence USD 115K-145KDBT | Data Architecture | Data Modeling | Data Quality | Data Warehouse401k | Dental insurance | Discounts and perks | Medical insurance | Paid leaveSenior-level Full TimeNew York, NEW YORK, United States R23h ago
-
DNS | FC | Fibre Channel | Isilon | LinuxRemote work | Unlimited growthSenior-level Full TimeUnited States, United States R1d ago
-
Senior Data Engineer USD 140K-200KAWS EMR | Amazon DynamoDB | Amazon S3 | Apache Airflow | Apache Spark401k matching | Annual bonus | Course support | Fitness reimbursements | Medical, dental & vision coverageSenior-level Full TimePasadena, United States R1d ago
-
API Integration | Asynchronous processing | Chatbots | Deep learning | Distributed Systems100% remote | Flexible scheduleMid-level Full TimeAnnapolis, Maryland, United States R1d ago
-
AI/ML Engineer, Senior - WFH1650 USD 128K-201KCPU Inference | Class imbalance | Data Analysis | Data Preprocessing | Data QualityWork from homeSenior-level Full TimeReston, VA - Remote R1d ago
-
Mid-level Full TimeRemote, United States R1d ago
-
Principle Data Engineer USD 220K-235KAWS | Airflow | BigQuery | Capacity Planning | Compliance401k | Equity | Essential equipment | Flexible PTO | Fully remoteSenior-level Full TimeCleveland, OH R1d ago
-
Agent Frameworks | Deterministic systems | Distributed Systems | GraphQL | LLMDirect collaboration with executive leadership | High-ownership environment | Hybrid schedule | Relocation assistance | Remote flexibilitySenior-level Full TimeRemote; San Francisco, CA; United States R1d ago
-
Apache Airflow | AtScale | BigQuery | CI/CD | Cloud DataSenior-level Full TimeGEORGIA - VIRTUAL - GA01, United … R1d ago
-
Airflow | BigQuery | CI/CD | Data Modeling | Data WarehousingRemoteMid-level Full TimeGEORGIA - VIRTUAL - GA01, United … R1d ago