Find jobs in AI/ML, Data Science and Big Data
270 results
for Inference Optimization
(Skill/Tech stack)
-
Lead Researcher, Large Language Models/LLM, TikTok USD 224K-410KData Processing | Deep learning | Inference Optimization | Language Models | Large Language ModelsSenior-level Full TimeSan Jose, California, United States6h ago
-
Intern, AI Engineering USD 64K-106KCUDA | CUDA kernel | CUDA kernel development | Hugging Face | Inference OptimizationEntry-level InternshipSan Francisco, California17h ago
-
AI Safety | API Management | Agile | App Service | Automated testingAnnual leave | Medical benefits | Retirement benefits schemeSenior-level Full TimeHong Kong17h ago
-
3D Rendering | Cloud Computing | Computer Vision | Containerization | DeepStreamSenior-level Full TimeFrance, Courbevoie18h ago
-
Field Engineering Intern - Summer 2026 USD 110K-170KBenchmarking | Fine Tuning | Inference Optimization | LLM Inference | LLM Inference Optimization401(k) plan match | Commuter stipend | Flexible paid time off | Health, dental, vision coverage | Wellness stipendEntry-level InternshipSan Francisco Office (Second St)1d ago
-
AWS | ArangoDB | Asynchronous programming | Context engineering | Distributed Systems401k | Health, dental, vision coverage | Relocation assistance | Unlimited learning stipend | Visa sponsorshipSenior-level Full TimeSan Francisco, California, United States1d ago
-
AI infrastructure | Benchmarking | Error rate | Error rate analysis | Inference OptimizationAutonomy | Continuous learning | Flexible location | Fully remote | International teamMid-level Full TimeAustralia R1d ago
-
Benchmarking | Bottleneck analysis | Cloud Computing | Edge Computing | Inference OptimizationCollaborative international team | Flexible location options | Fully remote | High technical ownership | Professional development opportunitiesMid-level Full TimeSouth Africa R1d ago
-
Benchmarking | CUDA | Docker | Inference Optimization | KubernetesAutonomy and technical ownership | Continuous learning and professional development | Flexible location options | Fully remote work | International team collaborationMid-level Full TimeRomania R1d ago
-
Benchmarking | Cloud Computing | Edge Computing | Inference Optimization | Latency optimizationCollaborative international team | Continuous learning and professional development | Flexible location options | Fully remote work | High impact technical ownershipMid-level Full TimeItaly R1d ago
-
AI Systems Performance | AI systems | Benchmarking | Bottleneck analysis | Cloud ComputingAutonomy and creativity | Continuous learning and professional development | Flexible location options | Fully remote work | Innovation-driven cultureMid-level Full TimePortugal R1d ago
-
AI Systems Performance | AI systems | Benchmarking | Cloud Computing | Edge ComputingContinuous learning | Flexible location | Fully remote | International collaboration | Professional developmentMid-level Full TimeNetherlands R1d ago
-
AI Model Deployment | AI model | Artificial Intelligence | Benchmarking | Bottleneck analysisAutonomy and creativity | Collaborative international team | Continuous learning and professional development | Flexible location options | Fully remote workMid-level Full TimeIreland R1d ago
-
AI infrastructure | Artificial Intelligence | Benchmarking | Bottleneck analysis | Cloud ComputingContinuous learning and professional development | Flexible location options | Fully remote | High technical ownershipMid-level Full TimeSwitzerland R1d ago
-
Artificial Intelligence | Benchmarking | Cloud Computing | Data Pipelines | Edge ComputingAutonomy and creativity | Collaborative international team | Continuous learning and professional development | Flexible location options | Fully remote work environmentMid-level Full TimeFrance R1d ago
-
AI infrastructure | Benchmarking | Bottleneck analysis | Cloud Computing | Deep learningContinuous learning | Flexible location | Fully remote | Professional development | Technical autonomyMid-level Full TimeGermany R1d ago
-
Benchmarking | C++ | CUDA | Docker | Inference OptimizationAutonomy | Continuous learning | Flexible location | Fully remote | Professional developmentMid-level Full TimeSpain R1d ago
-
Benchmarking | CUDA | Docker | Inference Optimization | KubernetesAutonomy | Collaborative international team | Continuous learning | Flexible location options | Fully remoteMid-level Full TimeBrazil R1d ago
-
AI infrastructure | Benchmarking | Bottleneck analysis | Cloud Computing | Edge ComputingAutonomy | Continuous learning | Flexible location options | Fully remote work | Professional developmentMid-level Full TimeCanada R1d ago
-
Artificial Intelligence | Benchmarking | Bottleneck analysis | Inference Optimization | Inference scalabilityAutonomy | Continuous learning | Flexible location options | Fully remote | Professional developmentMid-level Full TimeIndia R1d ago
-
Sr. Staff AI Research TLM - AI Systems USD 270K-340KCompute efficiency | Distributed Training | Energy Efficiency | Generative AI | Inference OptimizationSenior-level Full TimeMountain View, California; San Francisco, California1d ago
-
Senior Software Engineer, Perception (Robotics) SGD 120K-135K3D Perception | C++ | Calibration | Cause analysis | CeresBirthday leave | Employee assistance programme | FlexWork | GrabFlex Benefits Package | Medical insuranceSenior-level Full TimeSingapore, Singapore1d ago
-
Staff AI engineer USD 155K-225KAI Evaluation | AWS | Agent Orchestration | Caching | Conversational InterfacesFlexible working hours | Hybrid work culture | Unlimited time offSenior-level Full TimeSan Francisco1d ago
-
Machine Learning Engineer 4 INR 2475K-4500KAttention Mechanism | Automated retraining | CI/CD | Checkpointing | DDPSenior-level Full TimeNoida, India R1d ago
-
Staff Machine Learning Engineer, Voice AI USD 220K-280KAudio codecs | Audio signal processing | Batching | CUDA | Deep learningHealth insurance | Startup equitySenior-level Full TimeSan Francisco2d ago
-
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism100 percent remoteSenior-level Full TimeRemote job R2d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KComputer Vision | Diffusion Models | Edge Computing | Expert parallelism | Flash AttentionRemote workSenior-level Full TimeRemote job R2d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KCompute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelismEnglish communication support | Remote workSenior-level Full TimeRemote job R2d ago
-
[Job - 29399] AI Solutions Architect, Brazil BRL 230K-270K.NET | Amazon Bedrock | Amazon SageMaker | Apache Spark | Azure OpenAIChildcare assistance | Continuous learning platform | Dental insurance | Discount club | Extended paternity leaveSenior-level Full TimeBrazil2d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelismRemote workSenior-level Full TimeRemote job R2d ago
-
Diffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash Attention100 percent remote | Worldwide remoteSenior-level Full TimeRemote job R2d ago
-
Data labeling | Data pipeline | Debugging | Inference | Inference OptimizationEquity compensation | Flexible leave options | Inclusive parental leave | Remote work within Australia | Wellbeing allowanceSenior-level Full TimeSydney, Australia R2d ago
-
Forward Deployed Machine Learning Engineer USD 180K-300KAPI Design | Cloud Computing | Deep learning | Diffusion Models | Fine TuningIn-person collaboration days | Remote work flexibility | Travel cost coverageSenior-level Full TimeSan Francisco (USA) R2d ago
-
Senior-level Full TimeShanghai, China2d ago
-
Senior Principal Machine Learning Engineer, vLLM USD 206K-351KCPU architecture | Code review | Computer Vision | Deep learning | GPU Architecture401k employer match | Employee stock purchase plan | Flexible spending account | Health savings account | Paid parental leaveSenior-level Full TimeBoston, United States R2d ago
-
Senior Machine Learning Engineer USD 174K-287KComputer Vision | Deep learning | Gradient optimization | Graph theory | Inference OptimizationPaid parental leave | Paid time offSenior-level Full TimeBoston, United States R2d ago
-
Staff Applied Scientist USD 164K-313KContrastive Learning | Data Curation | Data Quality | Data quality control | Diffusion ModelsSenior-level Full TimeSan Jose, United States R2d ago
-
Specialist Solutions Architect - AI/ML USD 180K-247KAI guardrails | Amazon Web Services | Apache Spark | Artificial Intelligence | Cloud ComputingMentorship | Remote work | Technical training | Travel up to 30 percentSenior-level Full TimeUnited States2d ago
-
具身世界模型推理INFRA工程师 - XiaomiRobotics CNY 240K-480KCFG Parallelization | Diffusion Models | Expert parallelism | FP8 Quantization | Inference OptimizationSenior-level Full Time北京2d ago
-
Entry-level Full Time北京、上海3d ago
-
AGI 服务端资深工程师-Talkie&星野 CNY 180K-300KData Engineering | Dify | Distributed Systems | Go | Inference OptimizationMid-level Full Time北京、上海3d ago
-
AWS | Account architecture | Agent Orchestration | Batch Processing | CI/CDCareer development | Continuous learning opportunities | International travel | MentorshipSenior-level Full TimeHa Noi, Viet Nam3d ago
-
Computer Vision | Deep learning | Foundation Models | Inference Optimization | Language ModelsEntry-level Full TimeSingapore, Singapore3d ago
-
Staff Software Engineer, Machine Learning, Google Chat USD 207K-300KAgentic Workflows | Caching | Cloud Spanner | Continuous Delivery | Continuous integrationSenior-level Full TimeSunnyvale, CA, USA3d ago
-
AWS | Agents | Amazon S3 | Artificial Intelligence | Cloud StorageOnsite work schedule | Visa sponsorshipMid-level Full TimeMassachusetts, Massachusetts, United States3d ago
-
AWS | Asynchronous programming | Context engineering | Distributed Systems | Embeddings401k | Dental insurance | Health insurance | Learning stipend | Onsite workSenior-level Full TimeNew York, New York, United States3d ago
-
Senior Staff AI Scientist INR 2500K-4500KAWS Bedrock | AWS SageMaker | Agent systems | Agentic AI | Autonomous AgentsSenior-level Full TimeIND19-01-Bengaluru-EPIP 122 (Phase II), India3d ago
-
AI Enablement - Full Stack Developer INR 2000K-3000KAI Services | Angular | App Service | Async API | AuthenticationMid-level Full TimeGurugram, DLF Downtown, India3d ago
-
Forward Deployed Engineer (Generative AI) USD 153K-222KAWS Bedrock | Amazon SageMaker | Autogen | Azure OpenAI | ChromaCareer development | High individual responsibility | Travel to client sitesSenior-level Full TimeUnited States - Remote R3d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash AttentionEnglish support | Remote workSenior-level Full TimeRemote job R3d ago