AI Inference Engineer
Burlingame, California, United States
R
USD 110K-270K Senior-level Full Time
Tasks
- Benchmark model performance and accuracy
- Convert models for deployment
- Develop tools to scale deployment
- Improve SDK and runtime
- Optimize inference deployment for latency
- Port AI models to platform
- Profile model performance
- Quantize and prune models
- Support customers with technical documentation
Perks/Benefits
- 401k retirement plan
- Commuting support
- Company Provided Lunches
- Flexible paid time off
- Medical, dental, vision plans
Skills/Tech-stack
Benchmarking | C# | C++ | Hugging Face | Hugging Face Transformers | Inference Optimization | Llama.cpp | Machine Learning | Model Accuracy | Model Pruning | Model Quantization | Model profiling | Neural Compressor | ONNX Runtime | PyTorch | Python | VLLM
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Senior Machine Learning Engineer USD 198K-287KData Engineering | Fine Tuning | Foundation Models | GenAI | Incident ResponseOn-call rotationSenior-level Full TimeRemote - US R15h ago
-
Senior-level Full TimeRemote, US R16h ago
-
Sr. Staff Machine Learning Engineer, Content Ecosystem USD 227K-469KCausal Inference | Data Quality | Experimentation | Game theory | Language ModelsSenior-level Full TimeSan Francisco, CA, US; Remote, US R16h ago
-
Senior Data Platform Engineer USD 133K-197KAWS | Amazon IAM | Amazon Redshift | Ansible | Apache IcebergDental benefits | Free 1Password account | Generous paid time off | Health benefits | Maternity and Parental Leave Top-UpSenior-level Full TimeRemote (United States | Canada) R17h ago
-
AI Engineer I - Hybrid USD 125K-135KAI Services | API Development | Agentic Workflows | Azure | Azure AIHealth insurance | Hybrid work | Paid time off | Remote work options | Retirement planSenior-level Full TimeWindsor, Colorado, United States R17h ago
-
Senior Machine Learning Engineer, Vector Bidding Science USD 148K-229KA/B | A/B Testing | B testing | BigQuery | Control TheoryCommute subsidy | Disability insurance | Employee assistance program | Employee resource groups | Employee stock ownershipSenior-level Full TimeRemote, Washington, USA R17h ago
-
Staff Machine Learning Engineer, Vector Bidding Science USD 172K-266KA/B | A/B Testing | B testing | BigQuery | Control TheoryCommute subsidy | Employee resource groups | Employee stock ownership | Generous vacation | Global employee assistance programSenior-level Full TimeRemote, Washington, USA R17h ago
-
Senior ML Engineer USD 152K-228KAmazon SageMaker | Baseten | Batching | CI/CD | Google Vertex401k matching | Dental insurance | Fertility assistance | Flexible time off | Health insuranceSenior-level Full TimeRemote - US & Canada (see … R17h ago
-
Data Engineer - Assistant Vice President USD 100K-135KAWS | Apache Airflow | Apache Flink | Apache Spark | AzureEmployer-Matched Retirement Plan | Parental leave | Remote work Friday flexibility | Subsidized healthcare | Unlimited paid time offExecutive-level Full TimeSalt Lake City, Utah, United States R17h ago
-
Sr AI Engineer USD 124K-171KAPIs | Agentic Workflows | Event Driven | Event-driven architecture | JavaScriptFlexible time off | Learning and development stipend | Medical, dental, and vision insurance | Mental wellbeing resources | Paid HolidaysSenior-level Full TimeRemote - United States R18h ago
-
Staff Machine Learning Engineer, Ads Conversion USD 222K-389KA/B | A/B Testing | B testing | Computer Vision | Data MiningSenior-level Full TimeSan Francisco, CA, US; Palo Alto, … R18h ago
-
Senior Solution Engineer Optimization USD 109K-150KAI machine learning | APIs | Advanced Analytics | Algorithms | Analytical problem-solvingHybrid option | Remote work | Travel for customer engagementsSenior-level Full TimeRemote (United States) R18h ago
-
Senior Analytics Engineer USD 135KAmazon Redshift | Anomaly Detection | DBT | Data Modeling | LLMFinancial benefits | Full remote options | Hybrid schedule | Medical benefits | Stock optionsSenior-level Full TimeNew York, New York, United States R19h ago
-
Full-Stack Engineer, AI Data Platform USD 130K-200KAWS DynamoDB | Cassandra | Cloud platform | GCS | Google CloudHybrid workMid-level Full TimeSan Francisco Bay Area R20h ago
-
Azure | Azure AI | Azure Data | Azure Data Factory | Azure DevOpsEU remote work days | Employee assistance program | Employer pension plan | Flexible working hours | JobradSenior-level Full TimeHomeoffice R21h ago
-
Senior Engineer - Ingestion & Streaming Frameworks USD 150K-190KAWS DMS | Airbyte | Amazon Web Services | Apache Airflow | Apache SparkSenior-level Full TimeRemote - United States R21h ago
-
Data Engineer II USD 110K-125KAPI | AWS Glue | AWS Lambda | Alerting | Amazon Redshift4 week paid sabbatical | 401k match | Flexible PTO | Fully remote | Health benefitsMid-level Full TimeRemote, US R21h ago
-
AI-ML Engineer - Finance-Hybrid USD 155K-230KAgentic AI | CI/CD | Cloud Computing | Computer Vision | Data EngineeringContinuing education | Dental insurance | FSA | HSA | Health insuranceSenior-level Full TimeRochester, MN, United States R1d ago
-
Engineer IV, Embedded Software Developer USD 111K-133KC# | Change Management | Circuit Schematics | Digital circuit | Digital circuit schematicsTelecommutingEntry-level Full TimePeoria, IL, US, 61639 R1d ago
-
DMLSS Business Data Analyst / Data Engineer USD 55K-65KConfluence | Data Governance | Data QA | Data QA/QC | Data QualityFederal clearance support | Remote workMid-level Full TimeRemote, United States R1d ago
-
Data Engineer USD 89K-141KAWS Glue | AWS Lambda | Access Control | Amazon Kinesis | Amazon QuickSightFully remote | Secret clearanceMid-level Full TimeUnited States R1d ago
-
Junior AI Engineer (Open to remote) USD 110K-135KAPI Development | Language Model | Language Model Evaluation | Language Models | Language Processing401k | Dental insurance | Health savings account | Medical insurance | Paid time offEntry-level Full TimeNew York, NY, US, NY 10019 R1d ago
-
Senior Healthcare Data Engineer USD 85K-125KAPI Integration | Access Control | Agile | Amazon Web Services | AzureCareer development | Employee benefits | Flexible work across time zones | Remote work option | Wellbeing supportSenior-level Full TimeRemote, United States R1d ago
-
Senior Data Platform Engineer, Remote USD 135K-180KAWS | AWS Lambda | Access Control | Amazon Aurora | Amazon CloudWatchSenior-level Full TimeUnited States, UNITED STATES, United States R1d ago
-
AI Software Engineer USD 181K-270KAWS | CI/CD | Docker | Edge Functions | GitHub CopilotComprehensive benefits | Equity | Learning stipend | Remote-first cultureSenior-level Full TimeUnited States or Canada R1d ago