Inference Optimization Engineer
Tasks
- Build inference optimization platform
- Collaborate with engineering and product teams
- Create automated optimization loop
- Deliver LLM performance improvements to production
- Develop reusable optimization tools and libraries
- Optimize LLM inference performance
- Profile inference workloads end to end
- Publish technical content on LLM inference optimization
- Translate customer insights into optimization strategy
Perks/Benefits
- 401k matching
- Flexible paid time off
- Health insurance
- Remote work
- Team meetups
- Team onsites
- Travel opportunities
Skills/Tech-stack
ASIC | Cloud Computing | Distributed Systems | GPU Programming | Kubernetes | Language Models | Large Language Models | Machine Learning | Optimization | Performance Engineering | Profiling
Education
N/A
Related jobs
-
Senior-level Full TimeAnnapolis Junction, MD7h ago
-
AI Safety | API Development | AWS | Agentic Systems | Amazon Web ServicesSenior-level Full TimeRemote, Canada R8h ago
-
Fullstack Engineer, AI Integrations USD 50K-70KAWS | Agile | Alerting | C++ | CSSAgile team environment | Hybrid work | MentorshipEntry-level Full TimeMountain View, CA / San Francisco, … R9h ago
-
Entry-level Full TimeMountain View, CA / San Francisco, … R9h ago
-
API Integration | ARM | Angular | Appian | Azure DevOpsFlexible extensions contract | Hybrid work schedule | Knowledge transfer coaching | Onsite work with mission teamsSenior-level ContractAustin, United States10h ago
-
Algorithms | COT | Data Structures | Deep learning | Dense Vector DatabasesSenior-level Full TimeToronto11h ago
-
Data Science Team Leader USD 165K-165KCI/CD | Cloud platform | Docker | Google BigQuery | Google CloudSenior-level Full TimeDenver, Colorado, United States11h ago
-
Software Engineer, Systems ML USD 141K-208KC plus plus | CUDA | Co-design | Compiler optimization | Deep learningSenior-level Full TimeBellevue, WA | Menlo Park, CA …12h ago
-
Network Engineer, Foundation & Support USD 120K-184KAI Assisted Development | Automation | C# | C++ | Distributed SystemsGlobal team collaboration | Mentorship | On-the-job trainingEntry-level Full TimeDenver, CO | Reston, VA | …12h ago
-
RTL Design Engineer, Machine Learning Accelerators USD 138K-198KASIC design | Code review | Machine Learning | Machine Learning Accelerators | Memory hierarchyMid-level Full TimeSunnyvale, CA, USA12h ago
-
Agentic Workflows | Automated testing | Computer Vision | Data Processing | Function CallingSenior-level Full TimeMountain View, CA, USA12h ago
-
Technical Lead, AI/ML Infrastructure USD 207K-301KC# | C++ | Compute architecture | Cryptography | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA12h ago
-
Research Software Engineer USD 207K-301KData Structures | Data structures algorithms | Distributed Computing | Information Retrieval | Language ModelsBonus | Career development | Equity | Health insurance | Paid time offSenior-level Full TimeMountain View, CA, USA12h ago
-
Data Platform DevOps Engineer - Senior Consultant CAD 80K-138KAWS | Access Controls | Alerting | Azure | Azure DevOpsDeloitte Days | Development and Innovation Days | Flexible benefit spending account | Flexible work arrangements | Hybrid work arrangementSenior-level Full TimeToronto, ON, CA, M5H 0A917h ago
-
AI Research Engineer CAD 72K-138KAccelerate | Autogen | Benchmarking | DeepSpeed | DevOpsDeloitte Days | Flexible benefit spending account | Flexible work arrangements | Hybrid work structure | Learning daysMid-level Full TimeToronto, ON, CA, M5H 0A917h ago
-
GenAI & Python Specialist - Operate CAD 72K-125KAPI Development | Agent systems | Claude | Clustering | EmbeddingsFlexible work arrangements | Hybrid work arrangement | Learning days | Paid vacation daysMid-level Full TimeToronto, ON, CA, M5H 0A917h ago
-
Data Engineer (MS Fabric / Databricks) - Consultant CAD 58K-102KADLS | Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake StorageFlexible work arrangements | Hybrid work structure | Learning development and innovation days | Mental health support benefits | MentoringMid-level Full TimeToronto, ON, CA, M5H 0A917h ago
-
Principal AI Platform Engineer USD 104K-166KAPIs | Access Control | Audit trails | Data Engineering | Data GovernanceSenior-level Full TimeSan Francisco, CA21h ago
-
Artificial Intelligence Developer (AI) USD 114K-218KAmazon Web Services | C++ | Conda | Data Modeling | ETL401k matching | Employer Covered Dental Insurance | Employer Covered Disability Insurance | Employer Covered Vision Insurance | Employer-covered health insuranceMid-level Full TimeChantilly, VA22h ago
-
Sr. Embedded Software Engineer - Radar & DSP USD 165K-220KAgile | Anomaly Detection | C# | C++ | ClassificationHealth insurance | Onsite work | Professional development | Retirement plansSenior-level Full TimeHuntington Beach, CA22h ago
-
Distinguished Machine Learning Engineer - Safety USD 399K-457KComputer Vision | Data Architecture | Data Processing | Distributed Systems | Language ModelsEquity compensation | Onsite work schedule | Workplace inclusion cultureSenior-level Full TimeSan Mateo, CA, United States R22h ago
-
Senior Machine Learning Engineer, Personalization CAD 156K-222KA/B | A/B Testing | B testing | CI/CD | DatabricksSenior-level Full TimeRemote - Canada R23h ago
-
Data Engineer Senior Principal (Hybrid) USD 144K-195KAmazon S3 | Amazon Web Services | Amazon Web Services (AWS) | Apache Airflow | Apache Flink401k match | Health insurance | Hybrid work | Paid time offSenior-level Full TimeUSA NC Fort Bragg - 2929 … R23h ago
-
Gen AI Engineer USD 112K-168KAKS | AWS | Agile | Agile frameworks | Apache Spark401k match | Dental insurance | Financial education resources | Health insurance | Life insuranceMid-level Full TimeGA-ATLANTA, 740 W PEACHTREE ST NW, …23h ago
-
Lead Cloud Data and AI/ML Engineer, AVP USD 90K-157KAPI | AWS | AWS Lambda | Agentic AI | AirflowDental insurance | Employee assistance program | Family care benefits | Health insurance | Incentive compensationSenior-level Full TimeQuincy, Massachusetts, United States23h ago