Staff Software Engineer - AI/ML Systems and Reliability
Tasks
- Architect AI ML platform infrastructure
- Build MLOps model deployment pipelines
- Build inference infrastructure
- Build scalable platform services and APIs
- Create infrastructure as code tooling
- Design highly available systems
- Develop CI/CD pipelines and deployment automation
- Develop feature stores and model registries
- Evaluate and adopt emerging AI and ML infrastructure technologies
- Implement monitoring, alerting, logging, tracing
- Improve reliability scalability observability and operational efficiency
- Lead technical design and architecture discussions
- Participate in design development testing code reviews deployment and support
- Troubleshoot production issues
Perks/Benefits
- N/A
Skills/Tech-stack
AWS | Airflow | Azure | CI/CD | Cloud Native | Deployment Pipelines | Distributed Systems | Docker | Elasticsearch | Feature Stores | Inference infrastructure | Infrastructure as Code | Java | Kafka | Kubernetes | MLOps | Machine Learning | Microservices | Model Deployment | Model deployment pipelines | Model registries | MySQL | NoSQL Databases | Observability | PostgreSQL | Python | REST APIs | Ray | Redis | Relational databases | Snowflake | Spark | “as-code”
Education
Roles
Related jobs
-
Staff AI Engineer USD 170K-220KAPI Development | API Integration | Anthropic API | Artificial Intelligence | Backend Development401k match | Commuter benefits | Employee assistance program | Flexible spending accounts | Gym Fitness Discount ProgramSenior-level Full TimeRemote- US R8h ago
-
Software Engineer II, Computational Platform USD 124K-154KAPIs | AWS | Cloud Networking | Data Modeling | Docker401k plan | Commuter support | Company-provided laptop | Flexible paid time off | Holiday payMid-level Full TimeRemote; Watertown, Massachusetts, United States R12h ago
-
Machine Learning Engineer USD 128K-214KAWS | Agile | Azure | Cloud platform | GitHealth insurance | Holiday pay | Learning and development | Life insurance | Long-term disabilityMid-level Full TimeUSA-Remote Work R17h ago
-
Entry-level Full TimeUnited States - Remote R1d ago
-
CI/CD | Docker | Drift Detection | Embeddings | Experiment trackingMentorship | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Machine Learning Engineer II USD 131K-184KAzure | Batch inference | Data Pipelines | Databricks | Distributed SystemsContinuous learning | Flexible ways of working | Growth mindset cultureMid-level Full TimeUSA TX Houston Hybrid, United States R1d ago
-
Backend Software Engineer (GenAI) USD 104K-180KAPI Development | AWS | Amazon Bedrock | Amazon SageMaker | CI/CDMid-level Full TimeUnited States Remote, United States R1d ago
-
AI Developer – Customer Services and Support USD 112K-214KAPI Integration | Backend Development | Embeddings | Evaluation | FreshdeskAnnual wellness days | Flexible work environment | Global collaboration | Recognition for contributions | Volunteer daysMid-level Full TimeUtah, United States R1d ago
-
Sr Principal Applied AI Engineer USD 178K-320KAPIs | Agentic Workflows | Benchmarking | Cloud services | Computer VisionSenior-level Full TimeAMER - United States - Kansas … R1d ago
-
Microsoft Analytics Data Engineer Senior Consultant USD 110K-180KAccess Control | Azure | Azure Databricks | Azure Storage | DAX401k match | Adoption Assistance | Background check provided by employer | Choice time off | FSASenior-level Full TimeCHICAGO, United States R1d ago
-
Sr. Data Engineer USD 93K-124KAWS | Airflow | Amazon DMS | Azure? N/A | BashDisability insurance | Employee assistance program | Life insurance | Paid parental leave | Paid time offSenior-level Full TimeRemote, United States R1d ago
-
Applied AI Developer (Remote) USD 125K-180KAPI Integration | AWS EC2 | Apache Airflow | CI/CD | Data PipelinesEmployee networks | Paid parental leave | Paid time off | Professional development opportunities | Volunteer opportunitiesSenior-level Full TimeUSA CA Remote, United States R1d ago
-
Senior Staff Research Scientist, Agentic AI & RL USD 150K-200KDocker | Fine Tuning | LLM Fine-tuning | Language Models | Language ProcessingHigh autonomy | MentorshipSenior-level Full TimeRemote Work( USA), United States R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KActive Learning | Apache Beam | Apache Spark | CI/CD | Caching100 percent remote work | Career growth | Full-time W2 employment | Health benefitsMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KApache Beam | CI/CD | Code review | Data Compression | Data GovernanceCareer growth | Health benefits | Paid time off | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Edge AI Engineer USD 100K-150KBias Evaluation | C++ | Core ML | Embedded Systems | Federated LearningRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KComputer Vision | Deep learning | Distributed Training | JAX | Language ModelsMid-level Full TimeUnited States - Remote R1d ago
-
C plus plus | C# | CAD | Dynamics | FDA Compliance401k | Company holidays | Dental insurance | Health insurance | Paid maternity/paternity leaveSenior-level Full TimeLos Angeles, California R1d ago
-
Lead Data Engineer USD 316K-506KContinuous Delivery | Data Architecture | Data Engineering | Data Governance | Data LakesLearning and development programs | Mentorship | Remote workSenior-level Full TimeChicago, Illinois, USA R1d ago
-
Principal Agentic AI Engineer USD 274K-338KAgent Orchestration | Auditability | Benchmarking | Confidence scoring | Distributed SystemsContinuing education support | Dental insurance | Flexible vacation policy | Health insurance | Paid parental leaveSenior-level Full Timesan francisconew york R1d ago
-
Senior Data Engineer USD 117K-162KAWS | Azure | BigQuery | DBT | Data Architecture401k | Annual wellness stipend | Cell phone reimbursement | Coaches and therapists access | Collective Pause DaysSenior-level Full TimeRemote - US R1d ago
-
Embedded Software Engineer II USD 115K-140KBash | C plus plus | C# | CI/CD | D-busERGs | Family Caregiver Support | Flexible PTO | HSA match | Health benefitsMid-level Full TimeRemote - USA R1d ago
-
Data Analyst, Data Cloud Intelligence USD 85K-95KAWS | Attribution | Business Intelligence | Data Modeling | Data pipelineEmployee discounts | Employee equity | Medical, dental & vision coverage | Pet insurance | Stock purchase planMid-level Full TimeRemote - US R1d ago
-
AI Engineer USD 131K-185KAnthropic API | Apps Script | Autogen | Cloud deployment | CrewAIAsync first collaboration | Conversion to employment based on performance | Direct access to leadership | Fast feedback loops | Fully remoteMid-level Full TimeUnited R1d ago
-
Senior Solution Engineer USD 165K-216KAnalytics | Cloud Computing | Data Architecture | Data Lake | Data WarehouseSenior-level Full TimeUS-CA-Bay Area-Remote R1d ago