Inference Engineer
Tasks
- Design and build low latency scalable model inference and serving stack
- Design and build robust inference infrastructure and monitoring
- Implement inference pipelines for machine learning generative models
- Serve foundation model products with research and product teams
Perks/Benefits
- 401k
- Commuter allowance
- Dental insurance
- Flexible PTO
- Health insurance
- Meals and snacks
- Visa sponsorship support
- Vision insurance
Skills/Tech-stack
CUDA | Continuous batching | Distributed Systems | Generative Models | Machine Learning | Model Inference | Model Serving | Monitoring | Observability | Performance Engineering | Reliability Engineering | SGLang | State Space Models | State-Space | Transformers | Triton | VLLM
Education
Roles
Regions
Countries
States
Related jobs
-
Senior GenAI Software Engineer (North America) USD 165K-230KA/B | A/B Testing | B testing | Debugging | EvaluationEquity | Health, dental, and vision benefits | In person team gatherings quarterly | Remote-first work | Wellness stipendsSenior-level Full TimeUnited States R14h ago
-
Senior Software Engineer, AI Developer Experience USD 202K-230KAPI Integration | Agentic Workflows | Artificial Intelligence | Code review | Command LineCareer coaching and support | In-office culinary options | Inclusive family building benefits | Long term savings or retirement plans | Mental health wellness and fitness benefitsSenior-level Full TimeNew York City R14h ago
-
Machine Learning Scientist, BioML USD 200K-330KAWS | Azure | Bioinformatics | Cloud Computing | Computational Biology401k employer match | Equity participation | Health, dental, vision insurance | Paid time off | Professional developmentMid-level Full TimeEmeryville, California, United States; Hybrid (2-3 … R14h ago
-
Machine Learning Platform Engineer USD 135K-160KAmazon SageMaker | Apache Flink | C++ | CI/CD | Cloud PubSub401k match | Annual bonus | Company equipment provided | Company medical dental vision plans | Disability benefitsMid-level Full TimeAtlanta, GA preferred, Remote R16h ago
-
Machine Learning Engineer, Customer Support Engineering USD 162K-186KAgent Orchestration | Agent systems | Artificial Intelligence | Autonomous Reasoning | Fine TuningSenior-level Full TimeRemote-USA R16h ago
-
Senior Developer Advocate - Modern App Development USD 194K-237KAPI Integrations | AWS | Cloud platform | Code Quality | Google CloudCommunity groups | Employee stock purchase plan | Inclusion talks | Mental health benefits | Mentor/Buddy programSenior-level Full TimeCalifornia, USA, Remote; Nevada, USA, Remote; … R16h ago
-
Staff Software Engineer, AI Developer Tools USD 180K-245KAPI Design | Agent systems | CI/CD | Compliance | Data PrivacySenior-level Full TimeDenver, CO;San Francisco, CA;New York, NY;Seattle, … R16h ago
-
Staff Software Engineer, Big Data Storage USD 177K-364KApache Flink | Apache Hive | Apache Iceberg | Apache Spark | Column BackfillSenior-level Full TimePalo Alto, CA, US; Remote, US R17h ago
-
Senior Embedded Software Engineer - Future Forward USD 153K-201KAuthentication | Board Bring-up | Bring-up | C# | C++Senior-level Full TimeSunnyvale, CA, United States R17h ago
-
Lead AI Engineer, Business Operations (Hybrid or Remote USD 150K-220KAPI Design | Backend Development | Cloud Platforms | Evaluation Frameworks | Fine Tuning401k company match | Career advancement opportunities | Dental insurance | Flexible time off policy | Life insuranceSenior-level Full TimeDallas, Texas, United States; United States R18h ago
-
AWS | Airflow | Apache Spark | Azure Synapse | Azure Synapse Analytics401k matching | Disability insurance | Employee assistance program | Life insurance | Medical/Dental/Vision insuranceMid-level Full TimeRemote, USA ; Remote, Canada R19h ago
-
Principal Data Engineer/ Technical Lead USD 219K-298KAWS | Access Layer | Aggregation pipelines | Apache Kafka | Apache Spark401k match | Employer paid medical/dental/vision | Flexible spending account | Paid parental leave | Remote first work from homeSenior-level Full TimeUnited States (Remote) R20h ago
-
Senior Software Engineer II - (AI Core Platform) USD 100K-177KAPI Development | API Gateway | AWS | Agile | AlertingMid-level Full TimeRemote, United States R20h ago
-
Senior Software Engineer I - AI/ML USD 145K-190KAPI Development | Agile | Alerting | CI/CD | Data ModelingSenior-level Full TimeRemote, United States R20h ago
-
AI Expert USD 148K-175KAWS | Agile | Batch Processing | Data Mapping | Data ModelingHybrid work | Public Trust Clearance | Remote workSenior-level Full TimeMemphis, TN, United States R20h ago
-
People Analytics AI Engineer USD 146K-221KAPI Integration | AWS | Amazon Redshift | Automation | Data ModelingFlexible working | Health benefits | Parental leave plans | Professional development stipend | Remote ModelSenior-level Full TimeRemote - Seattle R21h ago
-
Senior AI Integration Engineer USD 190K-190KAWS | AgenticAI | AmazonS3 | Bash | BedrockPart-time remote workSenior-level Full TimeNew York, New York, United States R22h ago
-
Sr . SAP Datasphere + Databricks Engineer - Hybrid USD 180K-248KAPI | Access Control | CI/CD | Data Classification | Data GovernanceHybrid workSenior-level ContractDurham or Philadelphia, United States R1d ago
-
Assistant Director, Data Science (STP) USD 139K-226KData Visualization | GLM | Insurance pricing | MLOps | Machine LearningDomestic travel | TelecommutingExecutive-level Full TimeBoston, MA, United States R1d ago
-
Principal Machine Learning Engineer USD 205K-230KAWS Lambda | BigQuery | C# | CI/CD | Cloud Functions401k | Dental insurance | Health insurance | Life insurance | Paid HolidaysSenior-level Full TimeUnited States of America - Remote … R1d ago
-
Generative AI Engineering Intern (Graduate) USD 70K-70KAWS | Agile | Azure OpenAI | Azure OpenAI Service | CI/CDDedicated mentorship | Flexible scheduling | Networking opportunities | Potential full-time employment | Remote friendly schedulingEntry-level Full Time InternshipUnited States R1d ago
-
Freelance Machine Learning Engineer USD 180KLangchain | MLOps | Machine Learning | NumPy | PandasFlexible part-time hours | Project-based assignments | Remote workMid-level FreelanceTexas, United States - Remote R1d ago
-
Edge AI Engineer USD 100K-150KC plus plus | Core ML | Deep learning | Edge Computing | Embedded SystemsCareer growth | No third party employment | Remote work | W2 employmentSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer (Applied AI) USD 100K-150KAblation Studies | Accelerator hardware | Data Quality | Data Validation | Data labelingMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Infrastructure Engineer USD 100K-150KApache Beam | Apache Spark | CI/CD | Caching | Code reviewCareer growth | Health benefits | Remote workMid-level Full TimeUnited States - Remote R1d ago