Find jobs in AI/ML, Data Science and Big Data
55 results
for SGLang
(Skill/Tech stack)
-
AI Engineer - Tieto Banktech (m/f/d) SEK 396K-480KAWS | Anthropic | Azure | CI/CD | DockerAutonomy | Collaborative culture | Hybrid workingMid-level Full TimeSolna, Sweden15h ago
-
AI Software Engineer Intern CNY 28K-50KAWQ | Cache optimization | DINOv2 | DeepSpeed | Diffusion ModelsEntry-level Full Time InternshipCHN - Minhang, China1d ago
-
AI Software Engineer USD 151K-332KC++ | CUDA | CUDA kernels | Continuous batching | FP8Hybrid work | In-person work | Remote work | Work-life balanceMid-level Full TimeSeattle (WA), United States2d ago
-
Mid-level Full Time深圳、上海3d ago
-
Manager, Machine Learning Operations (MLOps) USD 170K-230KAWS | ArgoCD | CI/CD | Cost Optimization | DBT401k match | FSA options | Flexible PTO | Flexible hybrid work schedule | Life insuranceMid-level Full TimeMarina del Rey, CA3d ago
-
Senior Machine Learning Engineer USD 160K-250KCaching | Cloud Platforms | GPU Computing | Kubernetes | LLM InferenceSenior-level Full TimeIsrael, center, IL3d ago
-
AI Engineer INR 2500K-4500KA/B | A/B Testing | AWS | Adapters | AirflowHealth insurance | Learning and development resourcesSenior-level Full TimeNoida, Uttar Pradesh3d ago
-
Applied AI Frameworks Engineer INR 3125K-5000KAttention Mechanisms | C++ | CUDA | Co-design | Computer ArchitectureCross geo collaboration | On-site work | Open source collaborationSenior-level Full TimeIND - Bangalore, India3d ago
-
Applied AI Frameworks Engineer INR 3125K-5000KC++ | CUDA | Compiler optimization | Computer Architecture | ConvolutionOn-site workSenior-level Full TimeIND - Bangalore, India3d ago
-
Applied Research - Evals & Data USD 150K-300KAccelerate | Data Pipelines | Data Versioning | Distributed Systems | Distributed tracingConference attendance | Professional development budget | Relocation support | Remote work | Team offsitesSenior-level Full TimeSan Francisco4d ago
-
Senior-level Full TimeKarachi/Lahore6d ago
-
Senior Deep Learning Algorithms Engineer - BioNeMo USD 180K-333KApplied Machine Learning | C++ | CUDA | CUDA kernels | Deep learningSenior-level Full TimeVietnam, Ho Chi Minh City8d ago
-
CPU performance | Compiler optimization | Data Visualization | Databases | Deep learningSenior-level Full TimeUS, CA, Santa Clara, United States8d ago
-
Data Scientist Lead - LLM (Chatbot) TWD 516K-612KAgent systems | Autogen | Bias detection | CrewAI | Direct Preference OptimizationSenior-level Full TimeTaiwan, Taipei10d ago
-
LLM Algorithm Engineer CNY 38K-50KAPI Development | Agent systems | Attention Mechanisms | Autogen | CUDAEnglish courses | Meal allowance | Online learning access | Onsite gym | Onsite massagesMid-level Full TimeChang Sha Shi, China11d ago
-
AI Computing Development Engineer, TensorRT-LLM CNY 337K-490KArtificial Intelligence | C# | C++ | Debugging | Deep learningSenior-level Full TimeChina, Shanghai13d ago
-
Agent planning | Architecture Design | Deep learning | Graph Neural Networks | Language ModelsEntry-level Full TimeSingapore, Singapore14d ago
-
Graph Neural Networks | Language Models | Language Processing | Large Language Models | Machine LearningEntry-level Full TimeSingapore, Singapore14d ago
-
Forward Deployment Engineer (Inference & RL POC) USD 150K-230KDeepSpeed | Distributed Systems | Fine Tuning | GPU Performance | GPU UtilizationFrequent customer interaction | Hybrid workMid-level Full TimeMountain View, California, United States15d ago
-
Member of technical staff (Inference) - Paris EUR 80K-120KC++ | CUDA | Caching | Continuous batching | Distributed ComputingCareer development | Continuous learning | Hybrid work | Professional growthSenior-level Full TimeParis16d ago
-
Principal Deep Learning Communication Architect USD 272K-431K3D Parallelism | CUDA | Context Parallelism | Data parallelism | DeepSpeedSenior-level Full TimeUS, CA, Santa Clara, United States16d ago
-
Senior-level Full TimeChina, Shanghai16d ago
-
Senior Software Engineer - AI Inference USD 152K-287KBatching | C++ | CUDA | Concurrency | Distributed SystemsBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States16d ago
-
AI Infrastructure Engineer USD 170K-210KC++ | CI/CD | CUDA | Container Orchestration | Datadog401k match | Dental insurance | Health insurance | Paid time off | Remote workMid-level Full TimeUnited States - Remote R16d ago
-
Senior Performance Analyst, Inference USD 175K-260KAttention Mechanism | CUDA | Flash Attention | GPU kernel optimization | KV cacheSenior-level Full TimeSunnyvale, CA17d ago
-
Member of technical staff (Inference) - London GBP 230K-325KC++ | CUDA | CUDA kernel | CUDA kernel programming | CachingContinuous learning | Hybrid work | Professional developmentSenior-level Full TimeLondon20d ago
-
Intern, AI Engineering USD 80K-124KCUDA | Hugging Face | Hugging Face Transformers | Inference Optimization | Language ModelsEmployee benefits | Flexible work environment | Remote work optionsEntry-level InternshipSan Francisco, California20d ago
-
Senior Software Engineer, Machine Learning Inference USD 152K-287KC plus plus | CUDA | Compilers | Deep learning | GPU ProgrammingBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States20d ago
-
DataOps Engineer (AI Platform Engineer) EUR 40K-60KAMD ROCm | CI/CD | Distributed Systems | GPU MIG | GoCompany car | Education allowance | Employee share scheme | Gym access | Health insuranceMid-level Full TimeCyprus21d ago
-
DataOps Engineer (AI Platform Engineer) EUR 40K-66KAMD | CI/CD | Distributed Systems | Fine Tuning | GPUAnnual leave | Career development | Education allowance | Employee share scheme | English lessonsMid-level Full TimeCyprus21d ago
-
Senior Technical VP - AI & Efficient Deep Learning CAD 121K-220KAI chip | AI chip architecture | Artificial Intelligence | Chip architecture | Co-designSenior-level Full TimeMarkham, Ontario, Canada21d ago
-
AI Platform Engineer INR 1500K-2000KAlerting | CUDA | Capacity Planning | Continuous batching | Distributed tracingMid-level Full TimeBangalore, India21d ago
-
ML Research Engineer (Inference) INR 120K-180KC++ | Deep learning | Generative AI | Hugging Face | Hugging Face TransformersEntry-level Full TimeBengaluru, Karnataka, India22d ago
-
Machine Learning Systems Engineer USD 160K-253KAWQ | C# | C++ | CUDA | Distributed TrainingDental insurance | Free meals and snacks | Health insurance | Professional development | Unlimited PTOSenior-level Full TimeMenlo Park, CA22d ago
-
Senior NLP / LLM Engineer USD 150K-200KDeep learning | Experimentation | Fine Tuning | Hugging Face | Hugging Face TransformersBonuses | Conference support | English lessons discount | Health benefits | Professional training supportSenior-level Full TimeBelgrade, RS - Remote R22d ago
-
Senior Software Engineer, LLM Performance USD 180K-339KC++ | CUDA | Cutlass | FlashAttention | FlashInferSenior-level Full TimeSF Bay Area (Hybrid) R23d ago
-
Principal AI/ML Architect BRL 305K-396KAI Governance | AWS | AWS CDK | Agentic Systems | AirflowEquipment and office stipend | Flexible time off | Learning and development stipend | Paid Certification Exams | Paid tools and equipmentSenior-level Full TimeBRAZIL R23d ago
-
Senior Solutions Architect - KV Cache and AI Storage CNY 460K-600KBluefield | CMX | Caching | Cassandra | CephSenior-level Full TimeChina, Beijing23d ago
-
Senior Member of Technical Staff: ML Systems and Infrastructure INR 2500K-4000KArgo Workflows | ArgoCD | CI/CD | CUDA | GitHub ActionsSenior-level Full TimeBangalore, India28d ago
-
Senior Deep Learning Engineer PLN 221K-383KCUDA | Diffusion Models | Docker | Inference Server | PyTorchHybrid workSenior-level Full TimeUK, Remote, United Kingdom R28d ago
-
Senior Forward Deployed Engineer (AI/ML) USD 140K-174KBenchmarking | CUDA | Continuous batching | CrewAI | DatabasesConference reimbursement | Employee assistance program | Flexible time off | LinkedIn Learning access | Local Employee MeetupsSenior-level Full TimeSeattle R29d ago
-
Senior Forward Deployed Engineer (AI/ML) USD 140K-174KArtificial Intelligence | CUDA | Continuous batching | CrewAI | DatabasesConference reimbursement | Employee assistance program | Flexible time off | LinkedIn Learning access | Remote workSenior-level Full TimeSan Francisco R29d ago
-
Container Orchestration | Distributed Systems | GPU Acceleration | Kubernetes | LLM InferenceCareer growth opportunities | Collaborative engineering environment | Global datacenter exposure | Hyper scale environment | Open source contribution opportunitiesEntry-level Full TimeSeattle, Washington, United States30d ago
-
Deep Learning Algorithms Engineer - ACOT USD 152K-287KC++ | CUDA | Diffusion Models | Distributed Training | GPU ArchitectureEntry-level Full TimeVietnam, Ho Chi Minh City30d ago
-
Senior Machine Learning Engineer, Voice AI USD 200K-260KAudio codecs | Audio signal processing | Automatic Speech Recognition | Batching | CUDAHealth insurance | Startup equitySenior-level Full TimeSan Francisco1mo ago
-
AI framework Engineering CNY 240K-480KAlgorithms | Bayesian analysis | C++ | CPU Optimization | Code optimizationSenior-level Full TimeCHN - Minhang, China1mo ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | CUDA profiling | Containerization | Context Parallelism | Data I/OHealth and wellness programs | Hybrid work | Time away from workEntry-level Full TimeMountain View, CA, United States1mo ago
-
AWQ | C++ | CUDA | CUDA kernels | Continuous batchingSenior-level Full TimeDonostia, Spain1mo ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | Containerization | Data parallelism | Distributed Systems | DockerFlexible-hybrid work | Health and wellness programs | Time offEntry-level Full TimeMountain View, CA, United States1mo ago
-
Senior-level Full TimeMontreal1mo ago