Find jobs in AI/ML, Data Science and Big Data
27 results
for Continuous batching
(Skill/Tech stack)
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | C++100% remote | Full-time W2 employment | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
Senior Forward Deployed Engineer II (AI/ML) INR 1800K-3500KAgents SDK | CUDA | Cache optimization | Continuous batching | CrewAIMid-level Full TimeBengaluru2d ago
-
Senior Forward Deployed Engineer I (AI/ML) INR 3000K-4800KAgents SDK | CUDA | Continuous batching | CrewAI | Data CompressionHybrid work | Travel up to 30%Senior-level Full TimeBengaluru2d ago
-
AI Software Engineer USD 151K-332KC++ | CUDA | CUDA kernels | CUDA profiling | Cache ManagementCommunity involvement | Health benefits | Hybrid work options | In-person work options | Remote work optionsMid-level Full TimeSeattle (WA), United States5d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | BatchingRemote workSenior-level Full TimeUnited States - Remote R5d ago
-
Senior-level Full Time上海、北京7d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Continuous batching | CutlassRemote workMid-level Full TimeUnited States - Remote R8d ago
-
AI Performance Optimization Engineer USD 100K-150KAccess Optimization | Attention Mechanisms | Benchmarking | C plus plus | CPUMid-level Full TimeUnited States - Remote R8d ago
-
Continuous batching | Data parallelism | Deep learning | Distributed Training | Dynamic MemoryComputational resources access | Full sponsorship | Hired by Rakuten Asia after completion | Research exchangesMid-level Full TimeCrimson House Singapore11d ago
-
Application Software Engineer, Inference USD 135K-185KAgent Orchestration | Agent SDK | Auto Scaling | Batch scheduling | C++401k plan | Employee stock purchase plan | Long-term incentives | Medical, dental & vision coverage | Onsite Palo AltoEntry-level Full TimePalo Alto, CA11d ago
-
Benchmarking | C++ | Continuous batching | Deep learning | Disaggregated servingSenior-level Full TimeSunnyvale, CA, USA12d ago
-
AI Engineer EUR 60K-80KAWQ | AWS | Agent SDK | CI/CD | CUDACareer growth opportunities | Permanent employment | Remote work optionMid-level Full TimeRemote - Paris, France R13d ago
-
Senior Machine Learning Engineer USD 188K-282KAdversarial Training | Calibration monitoring | Continuous batching | DPO | Deep learningSenior-level Full TimePalo Alto, CA14d ago
-
AI Engineer USD 100K-135KAWQ | AWS | AWS EC2 | Agent Frameworks | CI/CD401k match | Health insurance | Learning and development stipend | Paid parental leave | Paid time offMid-level Full TimeRemote USA - In Tandem R19d ago
-
Inference Engineer USD 180K-250KCUDA | Continuous batching | Distributed Systems | Generative Models | Machine Learning401k | Commuter allowance | Dental insurance | Flexible PTO | Health insuranceMid-level Full Time*HQ - San Francisco, CA R24d ago
-
Senior Machine Learning Engineer (Inference Platform) USD 175K-225KAWS | Alerting | CI/CD | Continuous batching | Data ProcessingSenior-level Full TimeRemote - USA R26d ago
-
Product Manager - AI Inference & Model Serving USD 165K-275KAI Inference | Artificial Intelligence | Autoscaling | Cache Management | Continuous batchingConference attendance | Professional development | Stock options | Training | Workstation providedMid-level Full TimeAustin, TX, United States1mo ago
-
Senior MLOps Engineer - LLMs EUR 56K-76KA/B | A/B Testing | Argo | Async API | AuthenticationAutonomy | Hybrid work model | Professional growth and learningSenior-level Full TimeNetherlands - Amsterdam1mo ago
-
Senior-level Full TimeMilpitas, CA, United States1mo ago
-
AI/ML ASIC Architect USD 163K-249KARM | ASIC architecture | AXI interconnect | Area Optimization | Attention MechanismsSenior-level Full TimeMilpitas, CA, United States1mo ago
-
Product Manager - AI Inference & Model Serving USD 160K-275KAI Inference | Autoscaling | Cache Management | Cold Start | Cold Start OptimizationConference attendance | Professional development and training | Stock options | Workstation providedMid-level Full TimeAustin, TX, United States1mo ago
-
AI Platform Engineer INR 1500K-2500KAutomated Evaluation | CI/CD | CUDA | Continuous Checkpointing | Continuous batchingMid-level Full TimeBangalore, India1mo ago
-
AI Platform Engineer INR 1500K-2500KAlerting | CUDA | Cause analysis | Continuous batching | GPU ProfilingMid-level Full TimeBangalore, India1mo ago
-
Staff Forward Deployed Engineer USD 195K-239KArtificial Intelligence | Benchmarking | CUDA | CUDA Interconnect | Continuous batchingEmployee assistance program | Flexible time off | Hybrid work | LinkedIn Learning | Local Employee MeetupsSenior-level Full TimeSeattle1mo ago
-
Staff Forward Deployed Engineer USD 195K-239KArtificial Intelligence | Benchmarking | CUDA | Continuous batching | CrewAIConference reimbursement | Employee assistance program | Employee stock purchase program | Flexible time off | LinkedIn LearningSenior-level Full TimeSan Francisco R1mo ago
-
AWQ | Audio codecs | Audio streaming | Autoscaling | Chunked prefill401k matching | Annual offsites | Dental coverage | Employer-paid training | Healthcare benefitsMid-level Full TimeSan Francisco, CA1mo ago
-
Principal Model Optimization Engineer USD 295K-345KCUDA | Continuous batching | GPU | LLM Inference | Machine LearningSenior-level Full TimeSan Mateo, CA, United States R1mo ago