Find jobs in AI/ML, Data Science and Big Data
89 results
for LLM Inference
(Skill/Tech stack)
-
AWS | Amazon Bedrock | Code review | Conversational State | Distributed SystemsFlexible paid time off | Health spending account | Long-term disability insurance | Medical, dental & vision coverage | RRSP employer matchingSenior-level Full TimeCanada1d ago
-
Senior Principal Machine Learning Engineer, vLLM USD 206K-351KCPU architecture | Code review | Computer Vision | Deep learning | GPU Architecture401k employer match | Employee stock purchase plan | Flexible spending account | Health savings account | Paid parental leaveSenior-level Full TimeBoston, United States R2d ago
-
AWS Bedrock | Agentic AI | Azure | Bias Variance | Bias-Variance TradeoffSenior-level Full TimeMcLean, VA, United States2d ago
-
Entry-level Full Time北京、上海2d ago
-
Senior Lead AI Engineer (Gen AI Platform Services) USD 229K-286KAWS | Azure | C# | C++ | Cost OptimizationSenior-level Full TimeSan Jose, CA, United States3d ago
-
Director, AI Engineering (Agentic AI Platform) USD 293K-335KAI orchestration | AWS | Artificial Intelligence | Azure | C#Executive-level Full TimeSan Jose, CA, United States3d ago
-
Computer Vision | Data Processing | Data Storage | Debugging | Deep learningSenior-level Full TimeSunnyvale, CA, USA5d ago
-
Research Engineer, ML Systems (All Industry Levels) USD 225K-400KCUDA | CUDA kernels | Cloud | Cutlass | DeepSpeedMid-level Full TimeRedwood City, CA5d ago
-
Senior ML Engineer- Poland PLN 267K-402KAPI Integration | Accelerator systems | Caching | Decode | GPU ComputingSenior-level Full TimeKraków, Małopolskie, PL6d ago
-
Senior-level Full Time北京6d ago
-
Senior Product Manager (LLM API Platform) SGD 180K-191KAPI Design | API Gateway | API Management | Cost of ownership | Developer experienceSenior-level Full TimeCrimson House Singapore7d ago
-
AWS | Artificial Intelligence | C# | C++ | Cloud ComputingSenior-level Full TimeSan Jose, CA, United States7d ago
-
AI Governance | AWS | Azure | C# | C++Senior-level Full TimeSan Jose, CA, United States7d ago
-
Deep learning | GPU clusters | HPC | High Performance | High ThroughputSenior-level Full TimeIsrael, Tel Aviv8d ago
-
Senior Lead AI Engineer (LLM Gateway, FM Hosting) USD 229K-286KAWS | Azure | C# | C++ | ExperimentationSenior-level Full TimeMcLean, VA, United States8d ago
-
Senior-level Full TimeSanta Clara, California, United States8d ago
-
ASR | Agile | Bash | C++ | CMakeCareer growth | International collaboration | Multicultural environment | On site training environmentEntry-level Full TimeCAIRO - CAI1, Egypt9d ago
-
Lead AI Engineer (FM Hosting, LLM Inference) USD 197K-245KAI Governance | AWS | C# | C++ | Cloud ComputingSenior-level Full TimeNew York, NY, United States9d ago
-
Deep learning | Fine Tuning | GPU Computing | LLM Inference | Language ModelsEntry-level InternshipSan Jose, California, United States9d ago
-
Principal Machine Learning USD 120K-220KAI Observability | AWS | AWS Bedrock | Agentic AI | Amazon SageMakerSenior-level Full TimeLivonia, MI, United States R9d ago
-
Senior-level Full TimeSan Jose, CA, United States10d ago
-
Senior Applied AI Engineer USD 129K-185KAPIs | Apache Airflow | Apache Beam | Artificial Intelligence | BigQuery401k matching | Healthcare benefits | Online learning platform | Paid time offSenior-level Full TimeUSA - Georgia - Alpharetta - …10d ago
-
Solution Architect (AI/LLM Inference) USD 165K-330KArtificial Intelligence | Benchmarking | Embeddings | GPU Selection | Image Generation401k company match | Fertility and family building stipend | Flexible PTO | Medical/Dental/Vision insurance | Paid parental leaveSenior-level Full TimeSan Francisco10d ago
-
Engineering Manager - Forward Deployed Engineering (LLM) USD 260K-380KDistributed inference | Docker | GPU infrastructure | Hugging Face | LLM InferenceCompany 401K | Fertility and family building stipend | Flexible PTO | Medical, dental, and vision insurance | Paid parental leaveMid-level Full TimeSan Francisco12d ago
-
AWS | Azure | Batching | CI/CD | Container OrchestrationFlexible working hours | Generous vacation | Parental leave | Visa sponsorship optionalSenior-level Full TimeSan Francisco, CA | Seattle, WA12d ago
-
AI Governance | AI Observability | AWS | Artificial Intelligence | AzureSenior-level Full TimeSan Jose, CA, United States13d ago
-
Senior-level Full TimeIraklio, Greece13d ago
-
Senior Developer Relations Manager CNY 348K-480KAPIs | Accelerated computing | Agentic AI | CUDA | Data MigrationSenior-level Full TimeChina, Beijing14d ago
-
AI Governance | AWS | Azure | C# | C++Financial benefits | Health benefits | Incentive compensation | Inclusion supportSenior-level Full TimeCambridge, MA, United States14d ago
-
Senior-level Full TimeMcLean, VA, United States R14d ago
-
AI Engineer (m/w/d) EUR 47K-47KArgoCD | Automated testing | Clean Code | Code review | DPOCompany pension | Corporate benefits | Professional developmentSenior-level Full TimeBerlin, Berlin, DE15d ago
-
【26届校招】大语言模型后训练算法工程师(Foundation Model) CNY 240K-480KAxolotl | Cloud Computing | Data loading | Distributed Training | DockerEntry-level Full Time上海、深圳15d ago
-
Principal Model Optimization Engineer USD 295K-345KCUDA | Continuous batching | GPU | LLM Inference | Machine LearningSenior-level Full TimeSan Mateo, CA, United States R15d ago
-
AI Engineer (Remote, International) USD 78K-110KArtificial Intelligence | Embeddings | Java | Kubernetes | LLM InferenceDirect stakeholder access | Flat hierarchy | International team | Remote workEntry-level Full TimeUnited States R15d ago
-
Senior-level Full TimeTaichung - AATT, Taiwan16d ago
-
Senior Deep Learning Performance Architect USD 184K-356KASIC architecture | C++ | Deep learning | GPU Architecture | InterconnectBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States16d ago
-
Distinguished AI Engineer USD 244K-335KAI Governance | AWS | C# | C++ | ExperimentationHealth benefits | Incentive compensationSenior-level Full TimeMcLean, VA, United States16d ago
-
Agent architecture | Browser Automation | Data Processing | Deep learning | EvaluationDirect founder exposure | High ownership | On-site collaboration | Research to production ownership | Visa sponsorshipMid-level Full TimeSan Francisco, CA; Onsite R16d ago
-
Senior GenAI Engagement Lead, Partner Platforms USD 184K-356KAgent systems | Agentic AI | Autogen | Automl | Cloud NativeSenior-level Full TimeUS, CA, Santa Clara, United States16d ago
-
Senior GenAI Technical Lead, Partner Platforms USD 184K-356KAWS | Agent systems | Agentic AI | Autogen | AzureSenior-level Full TimeUS, CA, Santa Clara, United States16d ago
-
Knowledge Base AI Engineer USD 150K-186KCI/CD | Clojure | Data Ingestion | DeepEval | DockerBirthday vouchers | Corporate events | Fit pass | Flexible schedule | Modern officeMid-level Full TimeBelgrade, Belgrade, Serbia16d ago
-
AWS | C# | C++ | Cloud Computing | Deep learningSenior-level Full TimeCambridge, MA, United States20d ago
-
AI Engineer - Model Performance USD 165K-250KAttention Backend | Audio Processing | Batching | CUDA | CUDA graphAsync communication | Innovation-focused culture | Remote work | Startup environment | Supportive teamMid-level Full TimeSF Hybrid R20d ago
-
Research Engineer, Frontier Safety Mitigations, DeepMind GBP 225K-300KAI Coding Agents | AI coding | Adversarial Machine Learning | Anomaly Detection | Coding AgentsMid-level Full TimeLondon, UK20d ago
-
AWS | Asynchronous Architecture | Azure | Batching | CI/CDOnsite workSenior-level Full TimeLahore, Pakistan20d ago
-
Lead AI Engineer (Gen AI Platform Services) USD 215K-245KAWS | AWS Ultraclusters | Azure | C# | C++Senior-level Full TimeSan Jose, CA, United States21d ago
-
Research Scientist - AI Compute & DPU - Global Frontier Tech Recruitment Program - 2027 Start (PhD) USD 212K-387KAI Agent | AI agent workflows | Agent workflows | CPU Scheduling | Cause analysisNone Full TimeSan Jose, California, United States21d ago
-
Research Scientist - AI Compute & DPU - Global Tech Research Program - 2027 Start (PhD) USD 202K-368KAI Agent | AI agent workflows | Agent workflows | CPU Scheduling | CachingNone Full TimeSeattle, Washington, United States21d ago
-
Classification | Contrastive Learning | Cross-lingual NLP | Distributed Training | Embedding ModelsEarly-stage startup | High ownership role | Visa sponsorshipMid-level Full TimeRemote R22d ago
-
Mid-level Full Time深圳、上海23d ago