Find jobs in AI/ML, Data Science and Big Data
34 results
for Distributed inference
(Skill/Tech stack)
-
Compute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelism100 percent remoteSenior-level Full TimeRemote job R1d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KCompute Shaders | Diffusion Models | Distributed inference | Edge Computing | Expert parallelismEnglish communication support | Remote workSenior-level Full TimeRemote job R1d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 201K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelismRemote workSenior-level Full TimeRemote job R1d ago
-
Diffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash Attention100 percent remote | Worldwide remoteSenior-level Full TimeRemote job R1d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Expert parallelism | Flash AttentionEnglish support | Remote workSenior-level Full TimeRemote job R3d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KCompute Shaders | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge ComputingRemote workSenior-level Full TimeRemote job R3d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KComputer Vision | Deep learning | Diffusion Models | Distributed inference | Edge ComputingRemote workSenior-level Full TimeRemote job R3d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KCompute Shaders | Diffusion Models | Distributed Inference Systems | Distributed inference | Edge ComputingRemote workSenior-level Full TimeRemote job R3d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KDiffusion Models | Distributed Inference Systems | Distributed inference | Edge Computing | Expert parallelismCareer growth | Collaborative research environment | English communication support | Remote work opportunitySenior-level Full TimeRemote job R3d ago
-
AI Research Engineer (Kernel & Inference Optimization) USD 200K-332KCompute Shaders | Custom Compute Shaders | Data Pipelines | Diffusion Models | Distributed Inference SystemsRemote workSenior-level Full TimeRemote job R3d ago
-
Senior Machine Learning Engineer USD 152K-250KAutomation | Distributed Training | Distributed inference | GPU | Go401k | Employee assistance program | Flexible PTO | Flexible spending account | Health savings account contributionsSenior-level Full TimeLas Vegas, Nevada3d ago
-
AI Performance Optimization Engineer USD 136K-258KAccess patterns | Benchmarking | C++ | Cache optimization | Compiler optimizationFull-time W2 employment | Health benefits | Remote workMid-level Full TimeUnited States - Remote R3d ago
-
AI Performance Optimization Engineer USD 136K-258KBenchmarking | C++ | Compiler optimization | Continuous batching | DebuggingMid-level Full TimeUnited States - Remote R3d ago
-
AI Researcher, LLMs USD 200K-300KDataset curation | Distributed Training | Distributed inference | Fine Tuning | GPU ComputingEntry-level Full TimeLondon, United Kingdom; New York, NY, …5d ago
-
Senior AI Architect, APAC SGD 120K-135KAI Platform | Artificial Intelligence | Containerization | Data Pipelines | Deep learningSenior-level Full TimeSingapore R6d ago
-
Large Model Training Acceleration Engineer USD 187K-387KBenchmarking | Data parallelism | Deep learning | Distributed Training | Distributed inferenceMid-level Full TimeSan Jose, California, United States7d ago
-
Senior Solutions Architect, AI Hyperscalers USD 184K-356KCUDA | Cloud Computing | Containerization | Data Science | Distributed TrainingSenior-level Full TimeUS, CA, Santa Clara, United States7d ago
-
Engineering Manager - Forward Deployed Engineering (LLM) USD 260K-380KDistributed inference | Docker | GPU infrastructure | Hugging Face | LLM InferenceCompany 401K | Fertility and family building stipend | Flexible PTO | Medical, dental, and vision insurance | Paid parental leaveMid-level Full TimeSan Francisco11d ago
-
Audio Inference Engineer, Model Efficiency USD 165K-300KC++ | Deep learning | Distributed inference | GPU Programming | Low-level systemCo-working stipend | Health and dental benefits | Inclusive culture | Mental health budget | Parental leave top-upMid-level Full TimeNew York14d ago
-
Ansible | CI/CD | Distributed Training | Distributed inference | DockerExtra paid vacation and sick leave | Flexible working hours | Language courses | Private medical insurance | Remote workSenior-level Full TimeRemote but only within Poland R20d ago
-
Senior Software Development Engineer (AI/ML and GenAI Engineer) INR 2000K-4500KAWS | Agentic AI | Autogen | Azure | CrewAIAgile work environment | Cross-functional team collaboration | Mentorship opportunitiesSenior-level Full TimeMumbai, Maharashtra, India24d ago
-
Machine Learning Manager, Notifications Relevance USD 230K-322KArtificial Intelligence | Deep learning | Distributed Training | Distributed inference | Machine Learning401k employer match | Caregiving support | Coaching benefits | Family planning support | Flexible vacationMid-level Full TimeRemote - United States R27d ago
-
Machine Learning Engineer - Enterprise CAD 143K-200KAgentic Systems | Cloud Platforms | Data Security | Distributed Training | Distributed inferenceSenior-level Full TimeToronto29d ago
-
Senior AI Architect, APAC THB 2040K-2520KAgile | Containerization | Data Pipelines | Deep learning | DevOpsCareer mentoring | Open source technology work | Travel up to 40 percentSenior-level Full TimeBangkok - MSO - Gaysorn, Thailand R30d ago
-
Autoregressive models | CPU/GPU acceleration | Deep learning | Diffusion Models | Distributed TrainingMedical plan | Paid Holidays | Paid sick leaveEntry-level Full Time InternshipUS-California-Palo Alto, United States1mo ago
-
Senior Machine Learning Engineer USD 149K-294KAgent workflows | Automated Validation | CI/CD | Deep learning | Distributed TrainingContinuous learning opportunities | Flexible working models | Great benefits | Health and well-being | Hybrid work modelSenior-level Full TimeRiyadh, SA, 114351mo ago
-
Senior Machine Learning Engineer AED 312K-386KAgent workflows | CI/CD | Cloud Native | Deep learning | Distributed TrainingFlexible working models | Health and well-being benefits | Learning opportunities | Skill growthSenior-level Full TimeDubai, Dubai, AE, 1183531mo ago
-
Senior Solutions Architect, Generative AI Specialist USD 184K-356KAI Observability | Agent systems | Agentic AI | CUDA | Container OrchestrationComprehensive benefitsSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Senior Research Scientist - NLP USD 128K-266KAdapter methods | Data Pipelines | Decoder Only | Deep learning | Distributed Training401k | Backup childcare | Education stipends | Healthcare | Hybrid work optionsSenior-level Full TimeUS - United States of America1mo ago
-
C++ | CUDA | CUDNN | Cutlass | Distributed inferenceSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Senior Machine Learning Engineer (Large Systems) PLN 292K-507KC++ | CUDA | Cloud Computing | Deep learning | Distributed TrainingAnnual leave | Dental insurance | Employee Pension Matching | Gym membership | Medical insuranceSenior-level Full TimeGdańsk, Pomeranian Voivodeship, Poland1mo ago
-
Machine Learning Research Engineer USD 150K-275KCUDA | Deep learning | Distributed Training | Distributed inference | Inference OptimizationDaily meals | Housing subsidy | Medical, dental & vision coverage | Relocation supportSenior-level Full TimeCupertino, CA1mo ago
-
Applied Research - RL & Agents USD 295K-440KAccelerate | Agent Frameworks | Distributed Training | Distributed inference | DockerConference attendance | Flexible work | Professional development budget | Relocation support | Team offsitesMid-level Full TimeSan Francisco1mo ago
-
C++ | CUDA | Distributed inference | GPU Architecture | Graph CompilerEquity | Hybrid workSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago