Senior AI Infra Engineer - Large Model Inference Systems (Multimodal/LLM/VLM)
San Jose, California, United States
USD 198K-368K (estimate) Senior-level Full Time
Tasks
- Build deployment pipelines for inference services
- Build inference systems for online traffic
- Develop high performance inference kernels
- Enable AI driven performance tuning and validation
- Ensure production reliability for inference at scale
- Implement global scheduling across heterogeneous compute
- Improve throughput and latency in production
- Optimize distributed inference for large models
- Support multimodal fusion and attention mechanisms
Perks/Benefits
- N/A
Skills/Tech-stack
Attention Mechanisms | Batching | CUDA | Data parallelism | Distributed Systems | Language Models | Large Language Models | Latency optimization | Load Balancing | Machine Learning | Mixture of Experts | Model Parallelism | Multimodal AI | Pipeline parallelism | Tensor Parallelism | Throughput Optimization | Triton
Education
N/A
Related jobs
-
Software Engineer, Systems ML USD 141K-208KC plus plus | CUDA | Co-design | Compiler optimization | Deep learningSenior-level Full TimeBellevue, WA | Menlo Park, CA …8h ago
-
Network Engineer, Foundation & Support USD 120K-184KAI Assisted Development | Automation | C# | C++ | Distributed SystemsGlobal team collaboration | Mentorship | On-the-job trainingEntry-level Full TimeDenver, CO | Reston, VA | …8h ago
-
RTL Design Engineer, Machine Learning Accelerators USD 138K-198KASIC design | Code review | Machine Learning | Machine Learning Accelerators | Memory hierarchyMid-level Full TimeSunnyvale, CA, USA8h ago
-
Agentic Workflows | Automated testing | Computer Vision | Data Processing | Function CallingSenior-level Full TimeMountain View, CA, USA8h ago
-
Technical Lead, AI/ML Infrastructure USD 207K-301KC# | C++ | Compute architecture | Cryptography | Distributed SystemsSenior-level Full TimeSunnyvale, CA, USA8h ago
-
Research Software Engineer USD 207K-301KData Structures | Data structures algorithms | Distributed Computing | Information Retrieval | Language ModelsBonus | Career development | Equity | Health insurance | Paid time offSenior-level Full TimeMountain View, CA, USA8h ago
-
Principal AI Platform Engineer USD 104K-166KAPIs | Access Control | Audit trails | Data Engineering | Data GovernanceSenior-level Full TimeSan Francisco, CA17h ago
-
Artificial Intelligence Developer (AI) USD 114K-218KAmazon Web Services | C++ | Conda | Data Modeling | ETL401k matching | Employer Covered Dental Insurance | Employer Covered Disability Insurance | Employer Covered Vision Insurance | Employer-covered health insuranceMid-level Full TimeChantilly, VA18h ago
-
Sr. Embedded Software Engineer - Radar & DSP USD 165K-220KAgile | Anomaly Detection | C# | C++ | ClassificationHealth insurance | Onsite work | Professional development | Retirement plansSenior-level Full TimeHuntington Beach, CA18h ago
-
Distinguished Machine Learning Engineer - Safety USD 399K-457KComputer Vision | Data Architecture | Data Processing | Distributed Systems | Language ModelsEquity compensation | Onsite work schedule | Workplace inclusion cultureSenior-level Full TimeSan Mateo, CA, United States R18h ago
-
Gen AI Engineer USD 112K-168KAKS | AWS | Agile | Agile frameworks | Apache Spark401k match | Dental insurance | Financial education resources | Health insurance | Life insuranceMid-level Full TimeGA-ATLANTA, 740 W PEACHTREE ST NW, …19h ago
-
Lead Cloud Data and AI/ML Engineer, AVP USD 90K-157KAPI | AWS | AWS Lambda | Agentic AI | AirflowDental insurance | Employee assistance program | Family care benefits | Health insurance | Incentive compensationSenior-level Full TimeQuincy, Massachusetts, United States19h ago
-
Machine Learning Engineer USD 137K-275KAWS | C++ | Docker | Java | KubernetesHybrid work | Remote work options | Work-life balanceMid-level Full TimeSeattle (WA), United States19h ago
-
Machine Learning Engineer USD 110K-165KAWS | Backend Services | CAD | Computational Geometry | Computer GraphicsCareer advancement | Catered team lunches | Equity ownership | Growth opportunities | Medical/Dental/Vision insuranceMid-level Full TimeSan Mateo, CA19h ago
-
Software Engineer I USD 74K-108KAzure | Cloud platform | Data Preprocessing | Data Structures | Data Structures and Algorithms401k matching | Accident and hospital indemnity | Comprehensive medical, dental, and vision | Corporate fitness program | Flexible time offEntry-level Full TimeDallas, United States19h ago
-
Data Engineer II USD 93K-100KAmazon Web Services | CI/CD | Cloud platform | Deep learning | Distributed ComputingPaid Holidays | Paid time off | Remote workMid-level Full TimeColumbia, MD, US19h ago
-
AI Engineer USD 165K-240KAPI Design | AWS | Agentic Workflows | Asynchronous processing | BM25401k enrollment | Gym membership stipend | Health coverage | Hybrid work environment | Paid HolidaysSenior-level Full TimeNew York19h ago
-
Machine Learning Engineer (NCG 2026) USD 140K-160KAgentic AI | C++ | Context engineering | Data Pipelines | Deep learningSenior-level Full TimeSan Jose, California, United States20h ago
-
Modeling and Simulation Project Embedment Lead USD 121K-230KArtificial Intelligence | Change Management | Clinical Trial Design | Clinical trial | Data Science401k | Medical/Dental/Vision insurance | Paid time offSenior-level Full TimeIrvine, CA, United States20h ago
-
Compression | Computer Vision | CoreML | Data Curation | Dense PredictionCommute subsidy | Employee resource groups | Employee stock ownership | Generous vacation and personal days | Global employee assistance programSenior-level Full TimeSan Francisco, CA, USA21h ago
-
Senior Software Engineer, Online Storage USD 140K-230KAmazon ElastiCache | Amazon RDS | Amazon S3 | Aurora RDS | Data ResidencyCatered meals | ClassPass membership | Commuter benefit | Employee assistance program | Fertility benefitsSenior-level Full TimeSan Francisco R21h ago
-
Senior Machine Learning Engineer USD 130K-160KData Pipelines | Deep learning | Distributed Training | Experimentation | Feature Engineering401k plan | Company paid life insurance | Company-Provided Technology Package | Health savings account | Hybrid workSenior-level Full TimeSan Francisco R21h ago
-
Applied AI Engineer, Advertising Agents USD 135K-185KA/B | A/B Testing | Agent Orchestration | Artificial Intelligence | Asynchronous programming401k match | Commuter benefits | Dental insurance | Flexible spending account | Health insuranceEntry-level Full TimeMountain View, California, United States21h ago
-
Senior AI Integrations Engineer USD 115K-173KC# | C++ | CPU caches | Compression | Data-Oriented ProgrammingCommute subsidy | Comprehensive health life and disability insurance | Employee resource groups | Employee stock ownership | Generous vacation and personal daysSenior-level Full TimeRemote, Washington, USA R21h ago
-
Senior AI Integrations Engineer USD 115K-173KAI | C# | C++ | Compression | DebuggingCommute subsidy | Employee assistance program | Employee resource groups | Employee stock ownership | Generous vacation and personal daysSenior-level Full TimeSan Francisco, CA, USA21h ago