Find jobs in AI/ML, Data Science and Big Data
90 results
for LLM Inference
(Skill/Tech stack)
-
Ansible | Autogen | Bash | CI/CD | CloudWatchChild education allowance | Employee stock purchase scheme | Health reimbursement | Life insurance | Medical insuranceSenior-level Full TimeRiyadh, Saudi Arabia1d ago
-
Sr AI Engineer - Agentic Systems USD 166K-205KAI Safety | API Integration | Agent Orchestration | Artificial Intelligence | Distributed SystemsSenior-level Full TimeAnywhere, US R3d ago
-
Senior Product Manager - Agent Integrations USD 192K-240KAI Agents | Backlog Management | BentoML | Customer discovery | Data AnalysisContinuous professional development | Inclusive community culture | Mental health benefits | Mentor/Buddy program | Stock equitySenior-level Full TimeNew York, New York, USA3d ago
-
Principal Engineer, Python fullstack (React+Genai) INR 3000K-5000KAPI Integration | AWS | Agent Orchestration | Azure | ComplianceSenior-level Full TimeRemote, India R3d ago
-
Senior-level Full TimeMcLean, VA, United States4d ago
-
AWS ECS | AWS Lambda | AWS RDS | AWS S3 | AWS SQSE3 visa sponsorship | Healthcare stipend | Meal stipend | Onsite five day collaboration | Paid on call compensationSenior-level Full TimeSan Francisco, CA; Onsite4d ago
-
Senior-level Full TimeUnited States - Remote R4d ago
-
AWS | AWS Bedrock | Amazon RDS | Amazon SageMaker | AuroraFlexibility programs | Inclusive benefits | Mentorship | Wellbeing supportSenior-level Full TimeBengaluru Millenia, India4d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | BatchingRemote workSenior-level Full TimeUnited States - Remote R5d ago
-
Lead Machine Learning Engineer (Manager IC) USD 179K-225KAgentic AI | Apache Spark | Dask | Data Pipelines | Distributed ComputingHealth insurance | Paid time off | Performance bonuses | Retirement benefitsSenior-level Full TimeMcLean, VA, United States5d ago
-
AI Computing Software Development Engineer, TensorRT-LLM TWD 1500K-2000KC++ | CUDA | Debugging | Huggingface | LLM InferenceSenior-level Full TimeTaiwan, Taipei6d ago
-
C++ | CUDA | Cluster scheduling | Compute scheduling | Deep learningSenior-level Full TimeIsrael, Tel Aviv6d ago
-
Principal Agentic AI Engineering Lead, Managing Director INR 3584K-5500KAIOps | AWS Cloud | AWS Cloud Development Kit | AWS Lambda | AWS Step FunctionsEmployee networks | Flexible work/life support | Inclusive development opportunities | Paid volunteer daysSenior-level Full TimeBangalore, India6d ago
-
A/B | A/B Testing | AI Pipelines | Agent Orchestration | Agent planningFlexible work programs | Inclusive benefits | Mentorship | Wellbeing supportSenior-level Full TimeBengaluru Millenia, India6d ago
-
A/B | A/B Testing | AI Pipelines | Agent systems | AirflowFlexibility programmes | Inclusive benefits | Mentorship | Wellbeing supportSenior-level Full TimeBengaluru Millenia, India6d ago
-
Senior. Distinguished AI Engineer (Agentic AI Platform) USD 314K-392KAWS | Amazon SageMaker | Audit APIs | Autogen | AzureSenior-level Full TimeSan Francisco, CA, United States6d ago
-
Distinguished AI Engineer (Agentic AI Platform) USD 269K-335KAPIs | AWS | Autogen | Azure | Azure Machine LearningSenior-level Full TimeSan Francisco, CA, United States7d ago
-
Engineering Manager, LLM Performance USD 224K-431KAPI Development | C++ | CUDA | GPU Architecture | LLM InferenceMid-level Full TimeUS, CA, Santa Clara7d ago
-
Mid-level Full TimeMcLean, VA, United States8d ago
-
AWS | Azure | C# | C++ | ExperimentationSenior-level Full TimeNew York, NY, United States8d ago
-
Mid/Senior Solution Architect GBP 80K-110KAmazon Web Services | Azure Machine Learning | CI/CD | Cloud platform | DockerDiscounted lunch | Educational budget | Flexible working hours | Hybrid work | Language classesSenior-level Full TimeLondon, United Kingdom10d ago
-
CUDA | CUDNN | Cutlass | Deep learning | GPU ArchitectureMid-level Full TimeUS-WA-Bellevue13d ago
-
Director, Engineering - Inference Serving Engine INR 1500K-6000KAuto Scaling | Benchmarking | CRIU | CUDA | Checkpoint RestoreEmployee assistance program | Flexible time off | LinkedIn Learning access | Local Employee Meetups | Training and education reimbursementExecutive-level Full TimeBengaluru13d ago
-
Principal Engineer - Agentic AI Architect CNY 240K-375KAI Deployment | API Design | ASIC | Agent systems | Agentic AISenior-level Full TimeShanghai, China14d ago
-
Attention Mechanism | Cursor | DSP | Diffusion Models | Edge inferenceSenior-level Full TimeSan Diego, California, United States of …15d ago
-
Staff Software Engineer, Inference GBP 325K-390KAWS | Batching | Caching | Distributed Systems | GCPFlexible work environment | Flexible working hours | Generous vacation | Parental leaveSenior-level Full TimeLondon, UK17d ago
-
Sr. Software Engineer, Inference GBP 225K-325KAWS | Batching | Caching | Distributed Systems | GCPFlexible working hours | Generous vacation | Parental leave | Visa sponsorshipSenior-level Full TimeLondon, UK17d ago
-
Senior Lead AI Engineer (GenAI Platform Services) USD 229K-286KAI Governance | AWS | Azure | Experimentation | GoSenior-level Full TimeSan Jose, CA, United States18d ago
-
AI Algorithms | APIs | Accelerated computing | Agentic AI | BenchmarksContinuous learning culture | Cross-functional collaboration | Team collaboration | Travel opportunitiesSenior-level Full TimeChina, Beijing18d ago
-
Senior Inference Engineer, AIConfigurator for Dynamo USD 184K-356KBatching | Distributed Systems | Expert parallelism | GPU Computing | High PerformanceEquity | Health benefits | Hybrid workSenior-level Full TimeUS, CA, Santa Clara, United States18d ago
-
AI Governance | AWS | AWS Ultraclusters | Artificial Intelligence | C#Health and wellness benefits | Incentive compensationSenior-level Full TimeSan Jose, CA, United States19d ago
-
C++ | Cloud Native | Container Orchestration | Deep learning | Distributed SystemsCareer growth | Open Source contribution | World Class CollaborationEntry-level Full TimeSan Jose, California, United States19d ago
-
Senior-level Full TimeNew York, NY, United States20d ago
-
Mid-level Full TimeBucharest, Romania R20d ago
-
Lead AI Engineer (Vision model customization, VLM) USD 197K-245KAWS | Azure | C# | C++ | ExperimentationDrug-free workplace | Health benefits | Inclusive work environment | Long-term incentives | Performance bonusesSenior-level Full TimeNew York, NY, United States21d ago
-
AI Infrastructure Engineer CHF 128K-192KAgentgateway | Ansible | Apache Kafka | C++ | Cloud Native24x7 on-call rotationSenior-level Full TimeGland, VD, Switzerland21d ago
-
Senior Deep Learning Solution Architect CNY 240K-480KAccelerated computing | Computer Systems | Data Structures | Deep learning | Distributed TrainingSenior-level Full TimeChina, Beijing22d ago
-
AWS | Artificial Intelligence | Cloud Computing | Experimentation | GoSenior-level Full TimeNew York, NY, United States22d ago
-
AWS | Azure | C# | C++ | ExperimentationSenior-level Full TimeNew York, NY, United States22d ago
-
Agent systems | Artificial Intelligence | Deep learning | Inference Optimization | KV cacheEntry-level Full TimeSeoul26d ago
-
Research Machine Learning Scientist CAD 140K-250KAWS | Azure | Cloud Computing | Deep learning | GPU ComputingCareer development | Health and well-being benefits | Mentoring programs | Online learning platform | Paid time offEntry-level Full Time661 University Avenue, Toronto, Ontario, Canada26d ago
-
Account CTO USD 271K-478KAI machine learning | Cloud infrastructure | Digital Transformation | Distributed File System | Enterprise Architecture401k plan | Commuter stipend | Dental insurance | Flexible paid time off | Health insuranceSenior-level Full TimeRemote, USA R26d ago
-
Director - AI Engineering - Agentic AI USD 144K-256KAWS | Airflow | Async Processing | Distributed Systems | EmbeddingsExecutive-level Full TimeNew York, NY, United States26d ago
-
Senior-level Full TimeUS, CA, Remote, United States R28d ago
-
AWS Bedrock | Agentic AI | Application code | Azure | Bias VarianceSenior-level Full TimeMcLean, VA, United States28d ago
-
AI Engineer - Tieto Banktech (m/f/d) SEK 480K-600KAgentic Workflows | Amazon Web Services | Artificial Intelligence | Azure | CI/CDHybrid workingMid-level Full TimeSolna, Sweden28d ago
-
Senior-level Full TimeNew York, NY, United States29d ago
-
Senior-level Full TimeNew York, NY, United States29d ago
-
Senior Data Scientist, Growth CAD 140K-175KA/B | A/B Testing | AWS | Azure | B testingCareer growth and development | Discounted fitness membership | Employee resource groups | Generous vacation and PTO | Health & dental benefitsSenior-level Full TimeToronto, Ontario, Canada1mo ago
-
Agent Frameworks | Deep learning | Distributed Systems | Fine Tuning | LLM InferenceEquity package | High-impact work | Hybrid schedule | Remote work optionSenior-level Full TimeRemote; New York, New York; Onsite R1mo ago