Senior Inference Engineer - AI
United States of America, Eagan, Minnesota
R
USD 100K-204K Senior-level Full Time
Tasks
- Build containerized inference pipelines
- Collaborate with cloud engineers on automation and guardrails
- Deploy inference workloads on GPUs
- Ensure deployment monitoring governance and drift detection
- Implement observability and health monitoring
- Implement routing and failover strategies
- Integrate models into production APIs
- Optimize and scale model inference
- Optimize quantization pruning distillation and precision
- Productionize AI and LLM workloads
- Profile inference performance and optimize utilization
- Reduce inference latency
Perks/Benefits
- Career development
- Flexible work hours
- Hybrid work model
- Mental health days
- Retirement savings
- Tuition reimbursement
- Wellbeing programs
- Work from anywhere up to 8 weeks per year
Skills/Tech-stack
API Design | API Integration | AWS | Azure | C++ | CI/CD | CUDA | GCP | GPU Computing | Knowledge Distillation | Kubernetes | Microservices | OCI | ONNX Runtime | OpenSearch | Pruning | PyTorch | Python | Quantization | Retrieval-Augmented Generation | Snowflake | TensorFlow | TensorRT | Vector Search
Education
N/A
Related jobs
-
Applied AI Engineer, Agentic Systems USD 115K-192K.NET | APIs | Anthropic | CrewAI | Evaluation FrameworksAI and productivity tools access | Remote work accessSenior-level Full TimeRemote - United States R10h ago
-
Senior Industrial Engineer, Process Optimization USD 100K-120K5S | AutoCAD | Cause analysis | Cost modeling | Excel401k | Dental insurance | Disability insurance | Flexible spending account | Health savings accountSenior-level Full TimeBethlehem, PA, United States R14h ago
-
AWS | Cloud Data | Cloud data warehousing | Data Modeling | Data WarehousingSenior-level Contract Full TimeRemote, OR, United States R1d ago
-
Deployment DevOps Engineer USD 135K-155KAKS | ArgoCD | Containers | DNS | DevSecOps401k matching | Dental insurance | Health insurance | Mental health support | Unlimited PTOEntry-level Full TimeNew York Office R1d ago
-
Jaeger Analytics / Development Engineer USD 200K-285KCI/CD | Distributed tracing | Grafana | Jaeger | MicroservicesRemote workMid-level ContractUnited States (Remote) R2d ago
-
Software Engineer, Machine Learning USD 213K-293KAI ethics | API Design | Agent Orchestration | Artificial Intelligence | Bias MitigationSenior-level Full TimeSunnyvale, CA | Remote, US | … R2d ago
-
Sr AI Engineer USD 124K-171KAPIs | Cause analysis | Code review | JavaScript | JiraCompany year end break | Flexible time off | Learning and development stipend | Medical/Dental/Vision insurance | Mental wellbeing resourcesSenior-level Full TimeRemote - United States R2d ago
-
AI Research Engineer (Applied AI) USD 150K-222KAccelerator hardware | Agentic Systems | Computer Vision | Data labeling | Deep learningRemote workMid-level Full TimeUnited States - Remote R2d ago
-
LLM Fine-Tuning Engineer USD 150K-270KAdapter-Tuning | DPO | Dataset curation | Efficient Attention | EvaluationHealth insurance | Paid time off | Remote workMid-level Full TimeUnited States - Remote R2d ago
-
AI Performance Optimization Engineer USD 136K-258KAccess Optimization | Attention Optimization | Benchmarking | C++ | Compiler optimizationMid-level Full TimeUnited States - Remote R2d ago
-
Prompt Engineering Architect USD 119K-228KAgent Frameworks | Chunking | Embeddings | Evaluation | Fine TuningSenior-level Full TimeUnited States - Remote R2d ago
-
Robotics Software Engineer USD 125K-169KBehavior Trees | C++ | Concurrent Systems | Control | Embedded SystemsMid-level Full TimeUnited States - Remote R2d ago
-
API Design | AWS | AWS Lambda | Agentic AI | Amazon EC2Senior-level Full TimeOffice Location or Remote - USA R2d ago
-
API Design | AWS | Agentic AI | Cypher | Data ArchitectureSenior-level Full TimeOffice Location or Remote - USA R2d ago
-
Senior AI Engineer - Contract USD 136K-172KBehavior Trees | C# | C++ | CPU Optimization | Game AICareer improvement plan | Company events | Flexible work arrangements | Generous time-off policy | Medical, dental & vision coverageSenior-level Full TimeIrvine, CA R2d ago
-
Principal Engineer - GenAI Applications & MLOps USD 175K-242KAWS | Bigtable | Data integration | Distributed Systems | Event ProcessingRemote US basedSenior-level Full TimeUS Remote R2d ago
-
Founding Sr. Data Engineer USD 158K-198KAirbyte | Amazon Redshift | BigQuery | DBT | Data IngestionSenior-level Full TimeAnywhere in the US R2d ago
-
Staff AI Engineer USD 200K-300KAccuracy Monitoring | Agent systems | Artificial Intelligence | Authentication | Authorization401k eligibility | Hybrid work | Paid time off | Parental leave | Remote workSenior-level Full TimeUnited States (Remote) R2d ago
-
AI Analyst USD 80K-120KAWS | Azure | Computer Vision | Data Analysis | Deep learning401k employer match | AD&D insurance | Dental insurance | Health insurance | Life insuranceMid-level Full TimeRemote, United States R2d ago
-
Principal AI Software Engineer USD 224K-308KAWS | Cloud Computing | Data Processing | Docker | Endpoint Security401k match | Adoption and surrogacy reimbursement | Cancer Care Program | Dependent care FSA | Employee assistance programSenior-level Full TimeUnited States - Remote R2d ago
-
Senior Analytics Engineer USD 140K-170KAirbyte | DBT | Data Governance | Data Modeling | GitHubFlexible work schedule | Paid time off | Remote-friendly work environment | Team inclusionSenior-level Full TimeRemote - US R2d ago
-
Senior AI Developer USD 106K-133KAI SDK | AVA | Agentic AI | Agile | Cloud Foundry401k matching | Bereavement | Employee assistance program | Health, dental, and vision care | HolidaysSenior-level Full TimeRemote - Nationwide, United States R2d ago
-
Sr. Agentic AI Software Engineer USD 139K-258KAgent Orchestration | Architecture | Claude Code | Context engineering | DebuggingSenior-level Full TimeFarmington Hills or Remote (US only) R2d ago
-
Sr. Engineer, Machine Learning USD 127K-228KAWS | Azure | Bias Mitigation | CI/CD | Data EngineeringSenior-level Full TimeUnited States R3d ago
-
Sr. Engineer, Machine Learning USD 127K-228KAWS | Azure | CI/CD | Deep learning | Delta LakeRemote work | Time off | Volunteer days | Wellness initiativesSenior-level Full TimeUnited States R3d ago