Inference Engineer
Tasks
- Design and build low latency scalable model inference and serving stack
- Design and build robust inference infrastructure and monitoring
- Implement inference pipelines for machine learning generative models
- Serve foundation model products with research and product teams
Perks/Benefits
- 401k
- Commuter allowance
- Dental insurance
- Flexible PTO
- Health insurance
- Meals and snacks
- Visa sponsorship support
- Vision insurance
Skills/Tech-stack
CUDA | Continuous batching | Distributed Systems | Generative Models | Machine Learning | Model Inference | Model Serving | Monitoring | Observability | Performance Engineering | Reliability Engineering | SGLang | State Space Models | State-Space | Transformers | Triton | VLLM
Education
Roles
Regions
Countries
States
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R3d ago
-
Principal Engineer - Data Platform USD 221K-387KAWS | Airflow | Apache Hive | Apache Iceberg | Apache ImpalaRemote workSenior-level Full TimeSanta Clara, California, United States R10h ago
-
Senior-level Full TimeUnited States - Remote R1d ago
-
Edge AI Engineer USD 100K-150KC++ | Core ML | Cross Platform Inference | Cross-platform | DSPCareer growth potential | Full-time remote work | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Data Quality | Data labeling | Data quality monitoring100 percent remote | Career growth | Full-time employment | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | Code review100 percent remote | Career growth | Full-time employment | H1B transfer support | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Data Engineer USD 100K-150KApache Beam | CI/CD | Code review | Data Lineage | Data Modeling100 percent remote | Career growthMid-level Full TimeUnited States - Remote R1d ago
-
LLM Engineer USD 100K-150KDPO | Deep learning | Distributed Training | Efficient Attention | Efficient Fine TuningRemote workMid-level Full TimeUnited States - Remote R1d ago
-
Principal Applied AI Engineer, Finance USD 193K-340KAPI Development | AWS | Bias Mitigation | CI/CD | Churn modeling401k matching | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Full TimeVirtual Office (Massachusetts), United States R1d ago
-
Senior-level Full TimeRemote US, United States R1d ago
-
AWS | Airflow | Apache Spark | Azure Event | Azure Event Hubs401k contribution | Career development opportunities | Comprehensive benefits package | Equity stock purchase | Incentive and recognition programsSenior-level Full TimePrimary location: Eden Prairie, MN R1d ago
-
Data Engineer USD 72K-130KAI/ML | Analytics engineering | Azure DevOps | Bronze Silver Gold | CI/CD401k contribution | Career development opportunities | Comprehensive benefits package | Equity stock purchase | Incentive and recognition programsMid-level Full TimePrimary location: Eden Prairie, MN R1d ago
-
NLP Engineer USD 72K-130KArtifact Repositories | Artifactory | C# | CI/CD | Containerization401k contribution | Career development opportunities | Comprehensive benefits package | Equity stock purchase | Incentive and recognition programsMid-level Full TimePrimary location: San Diego, CA R1d ago
-
Principal, Data Engineer USD 107K-216KAI/ML | Azure | Azure Data | Azure Data Factory | Azure DevOpsSenior-level Full Time11 Keewaydin Dr, Salem NH, United … R1d ago
-
Applied AI Engineer USD 120K-158KA/B | A/B Testing | API Integration | Anthropic API | B testingCareer growth | Fully remote | Global Engineering Organization | High ownership culture | Learning and development budgetMid-level Full TimeUnited States R1d ago
-
Lead AI Engineer (AI Systems & Automation) USD 130K-260KAlerting | Anthropic API | Automation | Distributed Systems | DockerFully remote | Global Engineering Organization | High ownership culture | Learning and development budget | Modern engineering practicesSenior-level Full TimeUnited States R1d ago
-
AWS | Application Security | Artificial Intelligence | Azure | Cloud SecurityConference speaking opportunities | Flexible schedule | Health Premium Plan Option | Mentorship | Paid trainingSenior-level Full TimeLos Angeles, California, United States R1d ago
-
Machine Learning Engineer V USD 231K-382KAWS | Agent Orchestration | Automated testing | Azure | CI/CDBonus eligibility | Disability insurance | Life insurance | Paid parental leave | Paid time offSenior-level Full TimeRemote, United States R2d ago
-
Senior AI Engineer USD 145K-181KAWS | Alerting | Azure | Docker | Embeddings401k match | Commuter benefits | Dental | Healthcare | Remote friendly workplaceSenior-level Full Time3750 Market Street, Philadelphia, PA, United … R3d ago
-
Senior Machine Learning Engineer USD 180K-250KComputer Vision | Data Pipelines | Data labeling | Deep learning | Embedding Models100 percent remote | 13 paid holidays | 401k plan | Dental insurance | Medical insuranceSenior-level Full TimeRemote USA R3d ago
-
Senior AI Engineer USD 250K-300KAPI Development | Artificial Intelligence | Cost Optimization | GitHub | Inference Optimization401k match | Co working sessions | Flexible PTO | Health and wellness allowance | Health insuranceSenior-level Full TimeSan Francisco (Hybrid) R3d ago
-
AWS | AWS CDK | Access Control | Airflow | Athena401k plan | Health insurance | Paid Holidays | Paid time off | Phone stipendSenior-level Full TimeSan Carlos - Hybrid R3d ago
-
Sr AI Engineer - Agentic Systems USD 166K-205KAI Safety | API Integration | Agent Orchestration | Artificial Intelligence | Distributed SystemsSenior-level Full TimeAnywhere, US R3d ago
-
Applied AI Specialist, Commercial Customer Success USD 105K-142KAPI Integration | Accuracy Monitoring | Automated testing | CRM | Evaluation FrameworksRemote workSenior-level Full TimeRemote - US R3d ago