Find jobs in AI/ML, Data Science and Big Data
80 results
for SGLang
(Skill/Tech stack)
-
Intern Engineer – RL Post-Training for LLMs CAD 58K-104KData Generation | Deep learning | DeepSpeed | Distributed Training | GRPOInternshipEntry-level InternshipVancouver, British Columbia, Canada1d ago
-
Mid-level Full TimeSeattle (WA), United States2d ago
-
Inference Intern USD 60K-142KC++ | Collective communication | Compilers | Consensus Protocols | Consistency modelsDaily meals | Direct mentorship | Housing support | Paid internshipEntry-level InternshipSan Jose3d ago
-
大语言模型后训练算法工程师 CNY 240K-480KDistributed Training | Docker | Fine Tuning | Human Feedback | KubernetesMid-level Full Time深圳、上海4d ago
-
AWQ | AWS | Batching | CPU architecture | CUDASenior-level Full TimeGuangzhou, Guangdong, China5d ago
-
Inference Engineer USD 180K-250KCUDA | Continuous batching | Distributed Systems | Generative Models | Machine Learning401k | Commuter allowance | Dental insurance | Flexible PTO | Health insuranceMid-level Full Time*HQ - San Francisco, CA R5d ago
-
AI Framework Software Engineer CNY 300K-420KAsynchronous Communication | C++ | Computational graphs | Data parallelism | Deep learningOn-site work environmentEntry-level Full TimeCHN - Minhang, China5d ago
-
Artificial Intelligence | Attention Mechanisms | Benchmarking | C++ | GEMMEntry-level Full Time InternshipChina, Beijing6d ago
-
Senior Machine Learning Engineer (Inference Platform) USD 175K-225KAWS | Alerting | CI/CD | Continuous batching | Data ProcessingSenior-level Full TimeRemote - USA R6d ago
-
Senior-level Full TimeUS, CA, Remote, United States R8d ago
-
Mid-level Full Time北京8d ago
-
Senior Solutions Architect - Generative AI INR 2475K-4500KArgo | CI/CD | CUDA | Evaluation | FedRAMPSenior-level Full TimeIndia, Pune13d ago
-
Engineering Manager, Inference Benchmarking — AI Perf USD 224K-356KDCGM | Distributed Systems | GPU Telemetry | GPU observability | HelmSenior-level Full TimeUS, CA, Santa Clara, United States13d ago
-
Product Manager - AI Inference & Model Serving USD 165K-275KAI Inference | Artificial Intelligence | Autoscaling | Cache Management | Continuous batchingConference attendance | Professional development | Stock options | Training | Workstation providedMid-level Full TimeAustin, TX, United States13d ago
-
AI Software Engineer PLN 179K-229KAWS | Cloud platform | Containers | Google Cloud | Google Cloud PlatformAccess to training courses | Employee pension plan | Employee stock programs | Flexible working time | Hybrid work modelMid-level Full TimePOL - Gdansk, Poland16d ago
-
Senior NLP/LLM Engineer PLN 237K-326KBERT | DPO | Deep learning | Entity recognition | Fine TuningEnglish lessons discount | Health benefits | Professional training reimbursement | Remote work | VacationSenior-level Full TimeWorldwide R17d ago
-
Software Engineer (Python, Kubernetes, AI/ML) USD 153K-258KAI Inference | Autoscaling | Container Orchestration | Docker | GPU schedulingExtra Paid Sick Leave | Extra paid vacation | Flexible working hours | Language courses | Modern office amenitiesSenior-level Full TimePoland, Serbia, Cyprus, Georgia R18d ago
-
Machine Learning Ops Lead - VP SGD 165K-191KAWS | AWS Lambda | AWS SageMaker | Amazon CloudWatch | Amazon ECRSenior-level Full TimeSGP-Head Office, Singapore19d ago
-
Intern, AI Engineering USD 64K-106KCUDA | CUDA kernel | CUDA kernel development | Hugging Face | Inference OptimizationEntry-level InternshipSan Francisco, California20d ago
-
Machine Learning Engineer, Distributed vLLM USD 136K-287KAPI Gateway | Cilium | Distributed Systems | Envoy | GPU ProfilingPaid parental leave | Paid time off | Retirement 401k match | Tuition reimbursementMid-level Full TimeBoston, United States R20d ago
-
Senior, ML Engineer - Auto Tagger USD 177K-212KAWS | Apache Arrow | Apache Beam | Apache Spark | Cloud platform401k match | Company holiday office closures | Company-paid medical, dental & vision | Disability insurance | Flexible scheduleSenior-level Full TimeAnn Arbor, MI, Remote - US R20d ago
-
Product Manager - AI Inference & Model Serving USD 160K-275KAI Inference | Autoscaling | Cache Management | Cold Start | Cold Start OptimizationConference attendance | Professional development and training | Stock options | Workstation providedMid-level Full TimeAustin, TX, United States20d ago
-
AWQ | AWS | Accelerate | Azure | BatchingMid-level Full TimeShenzhen, Guangdong, China R21d ago
-
Staff Machine Learning Engineer, Voice AI USD 220K-280KAudio codecs | Audio signal processing | Batching | CUDA | Deep learningHealth insurance | Startup equitySenior-level Full TimeSan Francisco21d ago
-
Staff Engineer USD 191K-239KAMD GPU | Apache Yunikorn | Autoscaling | Bin packing | CRIUConference reimbursement | Education reimbursement | Employee assistance program | Employee stock purchase program | Equity compensationSenior-level Full TimeSeattle22d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitEntry-level Internship深圳22d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳22d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳22d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳22d ago
-
Inference Engineer - Acceleration CHF 110K-160KAdmission control | CUDA | Cutlass | FlashAttention | KV cacheCommuting subsidy | Learning and development budget | Offsites and team events | Pension plan | Vacation daysMid-level Full TimeZürich, Switzerland22d ago
-
Staff Software Engineer, AI/ML Heterogeneous Systems TWD 1500K-1900KC++ | CUDA | KVCaching | LLVM | MLIRSenior-level Full TimeHsinchu, Taiwan26d ago
-
Senior ML Engineer- Poland PLN 267K-402KAPI Integration | Accelerator systems | Caching | Decode | GPU ComputingSenior-level Full TimeKraków, Małopolskie, PL26d ago
-
Senior-level Full Time北京26d ago
-
Senior Product Manager (LLM API Platform) SGD 180K-191KAPI Design | API Gateway | API Management | Cost of ownership | Developer experienceSenior-level Full TimeCrimson House Singapore27d ago
-
Senior Deep Learning Frameworks CUDA Software Engineer USD 184K-356KAI compilers | C++ | CUDA | Distributed machine learning | HPC communicationSenior-level Full TimeUS, CA, Santa Clara, United States27d ago
-
AI Platform Engineer INR 1500K-2500KAutomated Evaluation | CI/CD | CUDA | Continuous Checkpointing | Continuous batchingMid-level Full TimeBangalore, India28d ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | Containerization | Context Parallelism | Data I/O | Data parallelismEntry-level Full TimeMountain View, CA, United States28d ago
-
AWS | AWS Lambda | Amazon SageMaker | Cost estimation | Data ScienceFlexible work schedule | Foreign business trips | Free English classes | Health insurance | Modern office facilitiesSenior-level Full TimeLviv, Lviv Oblast, Ukraine28d ago
-
AI Engineer - Tieto Banktech (m/f/d) NOK 792K-1075KAWS | Anthropic | Azure | CI/CD | DockerAutonomy | Collaborative culture | Hybrid workingMid-level Full TimeTrondheim, Trøndelag, Norway29d ago
-
AI Engineer - Tieto Banktech (m/f/d) NOK 792K-1075KAWS | Anthropic | Azure | CI/CD | Cloud platformHybrid workingMid-level Full TimeTrondheim, Trøndelag, Norway29d ago
-
AI Engineer - Tieto Banktech (m/f/d) NOK 792K-1075KAWS | Agentic Workflows | Anthropic | Azure | CI/CDFlexible hybrid workingMid-level Full TimeFornebu, Akershus, Norway29d ago
-
AI Engineer - Tieto Banktech (m/f/d) NOK 792K-1075KAWS | Agent Frameworks | Azure | CI/CD | Cloud platformAutonomy | Flexible hybrid working | Knowledge sharingMid-level Full TimeBergen, Vestland, Norway29d ago
-
AI Platform Engineer INR 1500K-2500KAlerting | CUDA | Cause analysis | Continuous batching | GPU ProfilingMid-level Full TimeBangalore, India29d ago
-
【26届校招】大语言模型后训练算法工程师(Foundation Model) CNY 240K-480KData loading | Distributed Training | Docker | Fine Tuning | Inference OptimizationEntry-level Full Time上海、深圳29d ago
-
AI/ML Engineering Manager CAD 152K-234KAWS Bedrock | AWS CDK | AWS CloudFormation | AWS Lambda | AWS SageMakerEquipment and office stipend | Flexible PTO | Fully remote | Learning and development stipend | Medical insuranceMid-level Full TimeCANADA R29d ago
-
Solution Architect (AI/LLM Inference) USD 165K-330KArtificial Intelligence | Benchmarking | Embeddings | GPU Selection | Image Generation401k company match | Fertility and family building stipend | Flexible PTO | Medical/Dental/Vision insurance | Paid parental leaveSenior-level Full TimeSan Francisco30d ago
-
Staff Forward Deployed Engineer USD 195K-239KArtificial Intelligence | Benchmarking | CUDA | CUDA Interconnect | Continuous batchingEmployee assistance program | Flexible time off | Hybrid work | LinkedIn Learning | Local Employee MeetupsSenior-level Full TimeSeattle1mo ago
-
Staff Forward Deployed Engineer USD 195K-239KArtificial Intelligence | Benchmarking | CUDA | Continuous batching | CrewAIConference reimbursement | Employee assistance program | Employee stock purchase program | Flexible time off | LinkedIn LearningSenior-level Full TimeSan Francisco R1mo ago
-
AWQ | Audio codecs | Audio streaming | Autoscaling | Chunked prefill401k matching | Annual offsites | Dental coverage | Employer-paid training | Healthcare benefitsMid-level Full TimeSan Francisco, CA1mo ago
-
Automatic Speech Recognition | DeepSpeed | Distributed Training | FSDP | GPU Memory Optimization401k matching | Healthcare Dental Vision | Hybrid work | New parent leave | Office StockedMid-level Full TimeSan Francisco, CA1mo ago