Find jobs in AI/ML, Data Science and Big Data
83 results
for SGLang
(Skill/Tech stack)
-
Entry-level Full Time北京、广州、上海17h ago
-
AI Solution Architect, AI 解決方案架構師 (內湖瑞光) TWD 310K-480KAI Agent | AI Foundry | AI Search | API Gateway | AWS BedrockSenior-level Full TimeTaipei Neihu, Taiwan R1d ago
-
Software Dev Engineer II, Stores Foundational AI -SFAI USD 165K-223KAsync Rollouts | Batching | C++ | CUDA | Cluster computing401k matching | Adoption reimbursement | Dental insurance | Employee assistance program | Flexible spending accountsMid-level Full TimePalo Alto, California, USA1d ago
-
Software Dev Engineer II, Stores Foundational AI -SFAI USD 143K-194KCUDA | Data Pipelines | Distributed Training | Dynamo | Experiment tracking401k matching | Employee assistance program | Health insurance | Paid time off | Parental leaveMid-level Full TimeSeattle, Washington, USA1d ago
-
Senior-level Full TimeAbu Dhabi2d ago
-
Senior Forward Deployed Engineer II (AI/ML) INR 1800K-3500KAgents SDK | CUDA | Cache optimization | Continuous batching | CrewAIMid-level Full TimeBengaluru3d ago
-
Senior Forward Deployed Engineer I (AI/ML) INR 3000K-4800KAgents SDK | CUDA | Continuous batching | CrewAI | Data CompressionHybrid work | Travel up to 30%Senior-level Full TimeBengaluru3d ago
-
Mid-level Full TimeSingapore4d ago
-
R&D Engineer CNY 28K-50KAI Agents | Control Systems | Deep learning | Industrial Control Systems | Industrial controlEntry-level Full TimeShanghai, Shanghai, China4d ago
-
Machine Learning Engineer USD 128K-260KArtificial Intelligence | Cache optimization | Inference | KV cache | KV cache optimizationSenior-level Full TimeSanta Clara, California, United States4d ago
-
AI Software Engineer USD 151K-332KC++ | CUDA | CUDA kernels | CUDA profiling | Cache ManagementCommunity involvement | Health benefits | Hybrid work options | In-person work options | Remote work optionsMid-level Full TimeSeattle (WA), United States5d ago
-
Senior-level Full TimeBangkok, Bangkok, Thailand6d ago
-
MLOps Engineer (LLM/GenAI) GBP 27K-27KAWS | Accelerate | Azure | Batching | CUDAContributory pension scheme | Enhanced maternity and adoption pay | Private healthcare | Tailored professional development opportunities | Work accommodations supportEntry-level Full TimeSheffield, United Kingdom R6d ago
-
AI Engineer (AI Products) SGD 12K-60KAPI Development | CI/CD | Checkpointing | Cloud Platforms | Compute OptimizationMid-level Full TimeNTU Main Campus, Singapore7d ago
-
Engineering Manager, LLM Performance USD 224K-431KAPI Development | C++ | CUDA | Distributed Systems | GPU ArchitectureEquity | Health benefits | Hybrid workMid-level Full TimeUS, CA, Santa Clara, United States7d ago
-
Engineering Manager, LLM Performance USD 224K-431KAPI Development | C++ | CUDA | GPU Architecture | LLM InferenceMid-level Full TimeUS, CA, Santa Clara7d ago
-
Senior-level Full Time上海、北京7d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GRPOEntry-level Internship深圳7d ago
-
Senior Deep Learning Framework Communications Engineer USD 152K-287KC++ | CUDA | CUDA kernels | CuTe | Distributed TrainingBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara R10d ago
-
AI运维工程师(大模型推理 / AI Infra) CNY 180K-300KAlerting | Automation | Docker | GPU Acceleration | High AvailabilityEntry-level Full Time深圳13d ago
-
AI Engineer EUR 60K-80KAWQ | AWS | Agent SDK | CI/CD | CUDACareer growth opportunities | Permanent employment | Remote work optionMid-level Full TimeRemote - Paris, France R13d ago
-
Senior SW Engineer – AI Infrastructure & Optimization PLN 280K-383KCUDA | Cloud Platforms | Distributed Systems | GPU Performance | GPU Performance OptimizationSenior-level Full TimeKraków, Małopolskie, PL14d ago
-
Director, Engineering - Inference Serving Engine INR 1500K-6000KAuto Scaling | Benchmarking | CRIU | CUDA | Checkpoint RestoreEmployee assistance program | Flexible time off | LinkedIn Learning access | Local Employee Meetups | Training and education reimbursementExecutive-level Full TimeBengaluru14d ago
-
Director, Engineering - Forward Deployed Engineering INR 1500K-5199KAI infrastructure | AI orchestration | Agentic Systems | Agents SDK | AutomationConference reimbursement | Employee assistance program | Employee stock purchase program | Flexible time off | LinkedIn Learning accessExecutive-level Full TimeBengaluru14d ago
-
Senior Machine Learning Engineer USD 188K-282KAdversarial Training | Calibration monitoring | Continuous batching | DPO | Deep learningSenior-level Full TimePalo Alto, CA14d ago
-
Senior-level Full TimeTel Aviv-Yafo, Tel Aviv, ISR15d ago
-
Senior Inference Engineer, AIConfigurator for Dynamo USD 184K-356KBatching | Distributed Systems | Expert parallelism | GPU Computing | High PerformanceEquity | Health benefits | Hybrid workSenior-level Full TimeUS, CA, Santa Clara, United States18d ago
-
Senior SW Engineer – AI Infrastructure & Optimization USD 184K-300KCUDA | Cloud Platforms | GPU Performance | GPU Performance Optimization | Gateway APISenior-level Full TimeIsrael, center, IL18d ago
-
Applied Scientist II, Sponsored Products USD 172K-223KAWS Neuron | Algorithms | C plus plus | Data Mining | Data Structures401k matching | Dental insurance | EAP | Health insurance | Mental health supportMid-level Full TimeNew York, New York, USA19d ago
-
AI Engineer USD 100K-135KAWQ | AWS | AWS EC2 | Agent Frameworks | CI/CD401k match | Health insurance | Learning and development stipend | Paid parental leave | Paid time offMid-level Full TimeRemote USA - In Tandem R19d ago
-
C++ | Cloud Native | Container Orchestration | Deep learning | Distributed SystemsCareer growth | Open Source contribution | World Class CollaborationEntry-level Full TimeSan Jose, California, United States19d ago
-
C# | C++ | Computer Vision | Debugging | Deep learningSenior-level Full TimeChina, Shanghai20d ago
-
Intern Engineer – RL Post-Training for LLMs CAD 58K-104KData Generation | Deep learning | DeepSpeed | Distributed Training | GRPOInternshipEntry-level InternshipVancouver, British Columbia, Canada21d ago
-
Inference Intern USD 60K-142KC++ | Collective communication | Compilers | Consensus Protocols | Consistency modelsDaily meals | Direct mentorship | Housing support | Paid internshipEntry-level InternshipSan Jose23d ago
-
AWQ | AWS | Batching | CPU architecture | CUDASenior-level Full TimeGuangzhou, Guangdong, China25d ago
-
Inference Engineer USD 180K-250KCUDA | Continuous batching | Distributed Systems | Generative Models | Machine Learning401k | Commuter allowance | Dental insurance | Flexible PTO | Health insuranceMid-level Full Time*HQ - San Francisco, CA R25d ago
-
Artificial Intelligence | Attention Mechanisms | Benchmarking | C++ | GEMMEntry-level Full Time InternshipChina, Beijing26d ago
-
Senior Machine Learning Engineer (Inference Platform) USD 175K-225KAWS | Alerting | CI/CD | Continuous batching | Data ProcessingSenior-level Full TimeRemote - USA R26d ago
-
Senior-level Full TimeUS, CA, Remote, United States R28d ago
-
Mid-level Full Time北京28d ago
-
Senior Solutions Architect - Generative AI INR 2475K-4500KArgo | CI/CD | CUDA | Evaluation | FedRAMPSenior-level Full TimeIndia, Pune1mo ago
-
Engineering Manager, Inference Benchmarking — AI Perf USD 224K-356KDCGM | Distributed Systems | GPU Telemetry | GPU observability | HelmSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Product Manager - AI Inference & Model Serving USD 165K-275KAI Inference | Artificial Intelligence | Autoscaling | Cache Management | Continuous batchingConference attendance | Professional development | Stock options | Training | Workstation providedMid-level Full TimeAustin, TX, United States1mo ago
-
Senior NLP/LLM Engineer PLN 237K-326KBERT | DPO | Deep learning | Entity recognition | Fine TuningEnglish lessons discount | Health benefits | Professional training reimbursement | Remote work | VacationSenior-level Full TimeWorldwide R1mo ago
-
Software Engineer (Python, Kubernetes, AI/ML) USD 153K-258KAI Inference | Autoscaling | Container Orchestration | Docker | GPU schedulingExtra Paid Sick Leave | Extra paid vacation | Flexible working hours | Language courses | Modern office amenitiesSenior-level Full TimePoland, Serbia, Cyprus, Georgia R1mo ago
-
Machine Learning Ops Lead - VP SGD 165K-191KAWS | AWS Lambda | AWS SageMaker | Amazon CloudWatch | Amazon ECRSenior-level Full TimeSGP-Head Office, Singapore1mo ago
-
Intern, AI Engineering USD 64K-106KCUDA | CUDA kernel | CUDA kernel development | Hugging Face | Inference OptimizationEntry-level InternshipSan Francisco, California1mo ago
-
Senior, ML Engineer - Auto Tagger USD 177K-212KAWS | Apache Arrow | Apache Beam | Apache Spark | Cloud platform401k match | Company holiday office closures | Company-paid medical, dental & vision | Disability insurance | Flexible scheduleSenior-level Full TimeAnn Arbor, MI, Remote - US R1mo ago
-
Product Manager - AI Inference & Model Serving USD 160K-275KAI Inference | Autoscaling | Cache Management | Cold Start | Cold Start OptimizationConference attendance | Professional development and training | Stock options | Workstation providedMid-level Full TimeAustin, TX, United States1mo ago
-
AWQ | AWS | Accelerate | Azure | BatchingMid-level Full TimeShenzhen, Guangdong, China R1mo ago