Find jobs in AI/ML, Data Science and Big Data
321 results
for VLLM
(Skill/Tech stack)
-
Intern, AI Engineering USD 64K-106KCUDA | CUDA kernel | CUDA kernel development | Hugging Face | Inference OptimizationEntry-level InternshipSan Francisco, California18h ago
-
Senior, ML Engineer - Auto Tagger USD 177K-212KAWS | Apache Arrow | Apache Beam | Apache Spark | Cloud platform401k match | Company holiday office closures | Company-paid medical, dental & vision | Disability insurance | Flexible scheduleSenior-level Full TimeAnn Arbor, MI, Remote - US R22h ago
-
Product Manager - AI Inference & Model Serving USD 160K-275KAI Inference | Autoscaling | Cache Management | Cold Start | Cold Start OptimizationConference attendance | Professional development and training | Stock options | Workstation providedMid-level Full TimeAustin, TX, United States23h ago
-
Engineering Manager, Model Inference USD 220K-270KAPIs | Attention Mechanism | Batching | Distributed Systems | Docker401k matching | Commuter benefits | Flexible PTO | Flexible spending accounts | Generous time offMid-level Full TimeSF Office23h ago
-
Senior AI Engineer USD 160K-250KAPI Design | Agent Orchestration | Agent systems | Audit Logging | Authentication401k eligibility | Flexible work environment | Hybrid work option | Paid time off | Parental leave eligibilitySenior-level Full TimeUnited States (Remote) R1d ago
-
AI Safety | Agents | Cloud Native | Containerization | Distributed SystemsFlexible workSenior-level Full TimePoland R1d ago
-
Consultant Technique Confirmé AI (H/F) EUR 45K-60KAWS | Airflow | Azure | Azure DevOps | Azure OpenAIMid-level Full TimeSAINT OUEN, France1d ago
-
API Integration | CI/CD | Computer Vision | Dataset Preparation | Deep learningHybrid work | IT code required | Long-term engagement | NDA requiredSenior-level Full TimeAntwerp, Belgium1d ago
-
IN_Senior Associate_ AI/ML Engineer_D&A_Advisory_Bangalore INR 2500K-4000KA/B | A/B Testing | AWS | Azure | B testingFlexibility programs | Hybrid work | Inclusive benefits | Mentorship | Wellbeing supportSenior-level Full TimeBengaluru Millenia, India1d ago
-
Senior, Data Scientist (Machine Learning Engineer) USD 110K-220KAccessibility guidelines | Airflow | CI/CD | Computer Vision | Container OrchestrationSenior-level Full Time(USA) Crossman Respect Building CA SUNNYVALE …1d ago
-
Senior AI Software Engineer – Agentic AI System USD 170K-275KAI Agents | Ansible | C++ | CI/CD | Chart.jsSenior-level Full TimeUSA - CA - Santa Clara, …1d ago
-
Sr GenAI Infra Specialist SA, AWS WWSO Startup USD 153K-228KAWS | Amazon EC2 | Amazon EKS | Amazon S3 | Cache optimizationInclusive team culture | Mentorship and career growth | Work-life balanceSenior-level Full TimeNew York, New York, USA1d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | CPU Profiling | Continuous batching | Cutlass | Deep Learning ProfilingBenefits | Career growth potential | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
Inference Intern - Spring 2027 USD 60K-142KC++ | Compilers | Consensus Protocols | Consistency models | Distributed SystemsDaily Dinner | Daily lunch | Direct mentorship | Housing support | Paid internshipEntry-level InternshipSan Jose1d ago
-
Staff Machine Learning Engineer, Voice AI USD 220K-280KAudio codecs | Audio signal processing | Batching | CUDA | Deep learningHealth insurance | Startup equitySenior-level Full TimeSan Francisco2d ago
-
Mid-level Full TimeSAINT OUEN, France2d ago
-
[Job - 29399] AI Solutions Architect, Brazil BRL 230K-270K.NET | Amazon Bedrock | Amazon SageMaker | Apache Spark | Azure OpenAIChildcare assistance | Continuous learning platform | Dental insurance | Discount club | Extended paternity leaveSenior-level Full TimeBrazil2d ago
-
Multimodal AI Engineer, Document Understanding USD 180K-250KBenchmarking | Computer Vision | Data Pipelines | Distillation | Distributed SystemsAccess to compute resources and research tools | Catered lunch and snacks | Conference budget | Hybrid work options | Medical, dental, and vision coverageSenior-level Full TimeSan Francisco2d ago
-
Forward Deployment Engineer - Gen AI USD 162K-224KAWS Bedrock | AWS SageMaker | Autogen | Azure OpenAI | ChromaCareer development opportunities | Individual responsibility | Travel to client sitesMid-level Full TimeNew York, New York, United States2d ago
-
A/B | A/B Testing | AWS | Artificial Intelligence | AzureFlexibility programmes | Hybrid work environment | MentorshipSenior-level Full TimeMumbai Goregaon, India2d ago
-
A/B | A/B Testing | AWS | Apache Airflow | Artificial IntelligenceFlexibility programmes | Hybrid work environment | Inclusive benefits | Mentorship | Wellbeing supportSenior-level Full TimeMumbai Goregaon, India2d ago
-
Staff AI Platform Engineer - Abu Dhabi USD 139K-300KAlerting | Azure | CI/CD | Distributed tracing | DockerSenior-level Full TimeAmman, Amman Governorate, Jordan2d ago
-
Apache TVM | C++ | CUDA | CuTile | FlashAttentionEmployee benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States2d ago
-
Staff Engineer USD 191K-239KAMD GPU | Apache Yunikorn | Autoscaling | Bin packing | CRIUConference reimbursement | Education reimbursement | Employee assistance program | Employee stock purchase program | Equity compensationSenior-level Full TimeSeattle3d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitEntry-level Internship深圳3d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data Retrieval | Data StorageEntry-level Internship深圳3d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAlerting | Asynchronous programming | Concurrency | Data pipeline | Distributed SystemsEntry-level Internship深圳3d ago
-
大模型 Infra 研发实习生(Agentic RL 方向) CNY 25K-37KAsynchronous programming | Concurrency | Distributed Systems | Docker | GitFlexible work schedule | Internship opportunity | MentorshipEntry-level Internship深圳3d ago
-
Entry-level Full Time北京、上海3d ago
-
AGI 服务端资深工程师-Talkie&星野 CNY 180K-300KData Engineering | Dify | Distributed Systems | Go | Inference OptimizationMid-level Full Time北京、上海3d ago
-
AI Platform Engineer, Training and Inference USD 150K-225KANN indexing | BF16 | DDP | Embeddings | FP8Career growth | Learning opportunitiesSenior-level Full TimeSan Francisco3d ago
-
Senior-level Full TimeLondon3d ago
-
Inference Engineer - Acceleration CHF 110K-160KAdmission control | CUDA | Cutlass | FlashAttention | KV cacheCommuting subsidy | Learning and development budget | Offsites and team events | Pension plan | Vacation daysMid-level Full TimeZürich, Switzerland3d ago
-
2026 - Senior LLM Researcher - Contractor EUR 62K-93KAgent systems | Deep learning | Hugging Face | Language Models | Language ProcessingAnnual leave | Training and developmentSenior-level ContractDublin, Ireland3d ago
-
Senior-level Full TimeSingapore3d ago
-
AI SW Stack Deployment Architect INR 2500K-4500KAPI Design | Cloud Computing | Distributed Systems | Edge Computing | Inference ServerSenior-level Full TimeBengaluru, KA, India3d ago
-
Full Stack AI Engineer (Contract) SGD 140K-180KAngular | Audit Logging | Authentication | Authorization | CachingContract employment | Medical declaration assessmentSenior-level Contract Full TimeMAS: MAS Building, Singapore3d ago
-
Entry-level Full TimeMilano (Bassi), Italy3d ago
-
Forward Deployed Engineer (Generative AI) USD 153K-222KAWS Bedrock | Amazon SageMaker | Autogen | Azure OpenAI | ChromaCareer development | High individual responsibility | Travel to client sitesSenior-level Full TimeUnited States - Remote R3d ago
-
Applied AI Engineer USD 99K-225KAWS | AgentOps | Azure | ChromaDB | Continued Pretraining401k retirement plan | Bike storage | Commuter benefits | Dependent care FSA | Desk setup stipendMid-level Full TimeWashington DC4d ago
-
Senior-level Full TimeWuxi, Jiangsu, China4d ago
-
Senior-level Full TimeTel Aviv-Yafo, Tel Aviv District, IL4d ago
-
Software Engineer, Inference - Multi Modal USD 295K-555KDistributed Systems | GPU | High Throughput | Inference | Language ModelsEntry-level Full TimeSan Francisco4d ago
-
AI Performance Optimization Engineer USD 159K-264KC++ | Continuous batching | Cutlass | Deep learning | DeepSpeedRemote workMid-level Full TimeUnited States - Remote R4d ago
-
Research Engineer, ML Systems (All Industry Levels) USD 225K-400KCUDA | CUDA kernels | Cloud | Cutlass | DeepSpeedMid-level Full TimeRedwood City, CA6d ago
-
Machine Learning Engineer Intern USD 60K-110KChroma | Data Transformation | Data extraction | Data loading | ETL401k plan | Free lunch | Unlimited snacks and beveragesEntry-level InternshipSanta Clara, CA6d ago
-
Senior-level Full TimeChina Shanghai6d ago
-
AI Engineer INR 2000K-4500KAI SDK | AWS | Accelerate | Autogen | AzureDisconnect week | Flexible working hours | Health insurance | Learning budget | MacBookSenior-level Full TimeIndia - Remote R6d ago
-
Staff Software Engineer, AI/ML Heterogeneous Systems TWD 1500K-1900KC++ | CUDA | KVCaching | LLVM | MLIRSenior-level Full TimeHsinchu, Taiwan6d ago
-
IN_Manager_GEN AI_Data and Analytics_Advisory_Bangalore INR 1500K-2500KAI Studio | AWS Bedrock | Amazon SageMaker | Aurora | Azure AIFlexible work programs | Inclusive benefits | Mentorship | Wellbeing supportMid-level Full TimeBengaluru Millenia, India6d ago