Find jobs in AI/ML, Data Science and Big Data
21 results
for LLM serving
(Skill/Tech stack)
-
Benchmarking | C++ | Docker | FP16 | INT8Fully remote work | Indefinite contract | Long-term engagement | Occasional on-site visitsSenior-level Full TimeBrazil R3d ago
-
AI Platform Engineer EUR 42K-66KAI Enterprise | Ansible | ClearML | Container Orchestration | FluxCDAccess to AI Lab | Certification support | Claude subscription | Hybrid work model | Onboarding programMid-level Full TimeBeverwijk, Noord-Holland, Netherlands3d ago
-
Manager, Machine Learning Operations (MLOps) USD 170K-230KAWS | ArgoCD | CI/CD | Cost Optimization | DBT401k match | FSA options | Flexible PTO | Flexible hybrid work schedule | Life insuranceMid-level Full TimeMarina del Rey, CA3d ago
-
AI Platform Engineer #AIDA SGD 60K-60KAPI Gateway | Active Directory | Agent Orchestration | Backup Systems | CI/CDMid-level Full TimeSingapore, Singapore4d ago
-
Software Engineer II - Machine Learning (B3617) CAD 81K-115KAIOps | AWS | Azure | Azure Data | Azure Data FactoryCareer development | Skill development | Training and onboardingMid-level Full Time661 University Avenue, Toronto, Ontario, Canada4d ago
-
Director of AI Engineering USD 198K-250KA/B | A/B Testing | API Versioning | AWS | Agent Orchestration401k match | Annual company retreats | Medical, dental, vision benefits | Paid time off | Promote from withinExecutive-level Full TimeUnited States - Remote R4d ago
-
IN_Senior Associate_GEN AI_Data and Analytics_Advisory_Noida INR 2520K-4000KAWS Bedrock | Amazon Aurora | Amazon DynamoDB | Amazon RDS | Amazon SageMakerSenior-level Full TimeNoida, India7d ago
-
Agent systems | Backtesting | Data pipeline | Fine Tuning | Inference OptimizationSenior-level Full TimeLos Angeles, California, United States8d ago
-
AWS Bedrock | Agile | Amazon Aurora | Amazon RDS | Amazon SageMakerFlexibility programmes | Inclusive benefits | Mentorship | Wellbeing supportSenior-level Full TimeBengaluru Millenia, India10d ago
-
Principal AI Platform Architect USD 185K-220KAPI Development | Access Management | Agent Orchestration | Batch Processing | Cloud ArchitectureSenior-level Full TimeFoster City, United States11d ago
-
Applied AI ML Director - AGENT BUILDER PLATFORM USD 140K-195KA/B | A/B Testing | API Design | AWS Bedrock | AWS SageMakerBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersExecutive-level Full TimePalo Alto, CA, United States13d ago
-
Staff Engineer - AI Development INR 2400K-3880KAutomation | Benchmarking | C plus plus | CPU GPU data movement | Cache ManagementSenior-level Full TimePune, MH, India16d ago
-
Senior Software Engineer, AI Inference CAD 135K-220KC++ | Chunked prefill | Continuous batching | Cutlass | DockerSenior-level Full TimeCanada, Toronto21d ago
-
Staff AI Engineer USD 175K-250KAgent systems | Backtesting | Data Pipelines | Distributed Systems | Fine TuningBonus eligibility | Equity | Performance incentives | Remote work | Token participationSenior-level Full TimeNew York, United States - Remote R24d ago
-
Causal Inference | Data Analysis | Deep learning | Distributed Training | Experiment designEntry-level Full TimeSeoul, South Korea25d ago
-
Senior GenAI Engineer GBP 84K-110KA/B | A/B Testing | B testing | CI/CD | Cloud infrastructureGrowth opportunities | Hybrid work | Knowledge sharing | Learning and developmentSenior-level Full TimeManchester, United Kingdom28d ago
-
Senior Machine Learning Engineer, Voice AI USD 200K-260KAudio codecs | Audio signal processing | Automatic Speech Recognition | Batching | CUDAHealth insurance | Startup equitySenior-level Full TimeSan Francisco1mo ago
-
Engineering Manager, Inference ML Runtime USD 180K-250KC++ | Cloud infrastructure | Deep learning | Distributed Systems | High PerformanceMid-level Full TimeSunnyvale CA or Toronto Canada1mo ago
-
Sr. Staff Software Engineer, AI Infra USD 198K-326KC++ | CUDA | DeepSpeed | Distributed Training | GNNSenior-level Full TimeMountain View, CA, United States1mo ago
-
Senior Data Scientist (GenAI) EUR 60K-78KAgentic Architectures | Apache Spark | Azure | Data Analysis | Data collectionCommuting expenses | Continuous training | Family support allowance | Health and life insurance for family | Health insuranceSenior-level Full TimeLisbon, Portugal1mo ago
-
Software Engineer, Inference Platform USD 200K-250KCUDA | Distributed Systems | Expert parallelism | GPU Compute | GPU OptimizationDental insurance | Equity | Health insurance | PTO policy | Retirement planMid-level Full TimeSan Francisco, CA1mo ago