Find jobs in AI/ML, Data Science and Big Data
82 results
for TensorRT-LLM
(Skill/Tech stack)
-
Staff Machine Learning Engineer, Voice AI USD 220K-280KAudio codecs | Audio signal processing | Batching | CUDA | Deep learningHealth insurance | Startup equitySenior-level Full TimeSan Francisco1d ago
-
Staff AI Platform Engineer - Abu Dhabi USD 139K-300KAlerting | Azure | CI/CD | Distributed tracing | DockerSenior-level Full TimeAmman, Amman Governorate, Jordan2d ago
-
Inference Engineer - Acceleration CHF 110K-160KAdmission control | CUDA | Cutlass | FlashAttention | KV cacheCommuting subsidy | Learning and development budget | Offsites and team events | Pension plan | Vacation daysMid-level Full TimeZürich, Switzerland2d ago
-
Software Engineer, Inference - Multi Modal USD 295K-555KDistributed Systems | GPU | High Throughput | Inference | Language ModelsEntry-level Full TimeSan Francisco3d ago
-
AI Performance Optimization Engineer USD 136K-258KC++ | Cache optimization | Continuous batching | Cutlass | Deep learningMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 159K-264KC++ | Continuous batching | Cutlass | Deep learning | DeepSpeedRemote workMid-level Full TimeUnited States - Remote R4d ago
-
Mid-level Full TimeSeattle (WA), United States6d ago
-
AWS | AWS Bedrock | AWS CDK | AWS CodeBuild | AWS CodePipeline401k | Healthcare coverage | PTO | Phone stipend | Wellness stipendSenior-level Full TimeSan Carlos - Hybrid R6d ago
-
Senior-level Full Time北京6d ago
-
Machine Learning Engineer, Distributed vLLM USD 136K-225KAPI Gateway | Cilium | Envoy | GPU Profiling | GRPCFlexible spending account | Health savings account | Paid parental leave plans | Paid time off and holidays | Retirement 401k with employer matchMid-level Full TimeBoston, United States R7d ago
-
Member of Technical Staff, AI Engineering USD 162K-297KAutogen | BF16 | C++ | CI/CD | CUDAIncome Protection for Illness or Injury | Medical, dental, vision plans | Paid Holidays | Paid family leave | Paid time offSenior-level Full TimeBoise, ID - Main Site, United …7d ago
-
CUDA | Deep learning | Distributed Training | Docker | GPU PerformanceCareer development | Continuous learning | Flexible remote work | International work environment | Technical ownershipMid-level Full TimeGermany7d ago
-
Deep learning | Evaluation Pipelines | GPU Cluster | High Performance | High-Performance ComputingSenior-level Full TimeIsrael, Tel Aviv8d ago
-
Solutions Architect, Pre-training and Post-training KRW 65000K-90000KArtificial Intelligence | Debugging | Deep learning | Fine Tuning | GPU ArchitectureSenior-level Full TimeKorea, Seoul, Korea, Republic of8d ago
-
AI Platform Engineer INR 1500K-2500KAutomated Evaluation | CI/CD | CUDA | Continuous Checkpointing | Continuous batchingMid-level Full TimeBangalore, India8d ago
-
Senior-level Full TimeIsrael8d ago
-
AI Engineer - Tieto Banktech (m/f/d) NOK 792K-1075KAWS | Anthropic | Azure | CI/CD | DockerAutonomy | Collaborative culture | Hybrid workingMid-level Full TimeTrondheim, Trøndelag, Norway8d ago
-
AI Engineer - Tieto Banktech (m/f/d) NOK 792K-1075KAWS | Anthropic | Azure | CI/CD | Cloud platformHybrid workingMid-level Full TimeTrondheim, Trøndelag, Norway8d ago
-
AI Engineer - Tieto Banktech (m/f/d) NOK 792K-1075KAWS | Agentic Workflows | Anthropic | Azure | CI/CDFlexible hybrid workingMid-level Full TimeFornebu, Akershus, Norway8d ago
-
AI Engineer - Tieto Banktech (m/f/d) NOK 792K-1075KAWS | Agent Frameworks | Azure | CI/CD | Cloud platformAutonomy | Flexible hybrid working | Knowledge sharingMid-level Full TimeBergen, Vestland, Norway8d ago
-
AI Engineer - Tieto Banktech (m/f/d) NOK 792K-1075KAWS | Agent Frameworks | Anthropic API | Azure | CI/CDAutonomy | Collaborative culture | Flexible hybrid workingMid-level Full TimeBergen, Vestland, Norway8d ago
-
【26届校招】Software Engineer (All Levels) – 大模型与智能机器人系统 CNY 240K-480KC++ | CUDA | DDS | GPU memory | GPU memory managementNone Full Time广州、深圳9d ago
-
Solution Architect (AI/LLM Inference) USD 165K-330KArtificial Intelligence | Benchmarking | Embeddings | GPU Selection | Image Generation401k company match | Fertility and family building stipend | Flexible PTO | Medical/Dental/Vision insurance | Paid parental leaveSenior-level Full TimeSan Francisco10d ago
-
AWQ | Audio codecs | Audio streaming | Autoscaling | Chunked prefill401k matching | Annual offsites | Dental coverage | Employer-paid training | Healthcare benefitsMid-level Full TimeSan Francisco, CA12d ago
-
Automatic Speech Recognition | DeepSpeed | Distributed Training | FSDP | GPU Memory Optimization401k matching | Healthcare Dental Vision | Hybrid work | New parent leave | Office StockedMid-level Full TimeSan Francisco, CA12d ago
-
Forward Deployed Engineer (Inference & Post-Training) USD 270K-300KDPO | GRPO | KV cache | LoRA | Pipeline parallelismEquity | Health insurance | Remote work flexibilitySenior-level Full TimeSan Francisco13d ago
-
Staff Software Engineer, Inference USD 188K-275KBF16 | C++ | CUDA | Distributed Systems | FP8401k employer match | Dental insurance | Employee stock purchase program | Flexible PTO | Flexible spending accountSenior-level Full TimeSunnyvale, CA / Bellevue, WA13d ago
-
Senior-level Full TimeIraklio, Greece13d ago
-
Audio Inference Engineer, Model Efficiency USD 165K-300KC++ | Deep learning | Distributed inference | GPU Programming | Low-level systemCo-working stipend | Health and dental benefits | Inclusive culture | Mental health budget | Parental leave top-upMid-level Full TimeNew York14d ago
-
None Full Time广州、深圳15d ago
-
Senior-level Full TimeTaichung - AATT, Taiwan16d ago
-
Senior GenAI Engagement Lead, Partner Platforms USD 184K-356KAgent systems | Agentic AI | Autogen | Automl | Cloud NativeSenior-level Full TimeUS, CA, Santa Clara, United States16d ago
-
Senior GenAI Technical Lead, Partner Platforms USD 184K-356KAWS | Agent systems | Agentic AI | Autogen | AzureSenior-level Full TimeUS, CA, Santa Clara, United States16d ago
-
Machine Learning Platform Engineer CAD 103K-155KBash | DevOps | Docker | GPU Monitoring | GitHub ActionsSenior-level Full TimeRBC WATERPARK PLACE, 88 QUEENS QUAY …17d ago
-
AI Engineer - Model Performance USD 165K-250KAttention Backend | Audio Processing | Batching | CUDA | CUDA graphAsync communication | Innovation-focused culture | Remote work | Startup environment | Supportive teamMid-level Full TimeSF Hybrid R20d ago
-
AI Engineer - Tieto Banktech (m/f/d) SEK 396K-480KAWS | Anthropic | Azure | CI/CD | DockerAutonomy | Collaborative culture | Hybrid workingMid-level Full TimeSolna, Sweden20d ago
-
Software Engineer, Inference AI/ML USD 92K-135KAlgorithms | C++ | CI/CD | CUDA | Caching401k match | Company paid life insurance | Flexible PTO | Health savings account | Medical, dental, and vision insuranceEntry-level Full TimeSunnyvale, CA / Bellevue, WA21d ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Distributed Systems | FP8 | FasterTransformer | Flash AttentionOn-site workEntry-level Full Time InternshipCHN - Minhang, China22d ago
-
AI Software Engineer Intern CNY 38K-50KCUDA | Compiler optimization | Continuous batching | Distributed Systems | Dynamic batchingOn-site workEntry-level Full Time InternshipCHN - Minhang, China22d ago
-
Asynchronous programming | Asyncio | Deep learning | DeepSpeed | Distributed TrainingSenior-level Full TimeChina, Shanghai27d ago
-
AI/ML Applied Data Scientist - Generative AI USD 121K-208KContext Management | Deep learning | Deep reinforcement learning | Efficient Fine Tuning | Fine TuningMid-level Full TimeNewport Beach, CA, US, 9266027d ago
-
Senior Software Engineer, RL Post-Training Frameworks USD 184K-356KActor Based Programming | C# | C++ | Consistency models | DPOComprehensive benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States28d ago
-
Senior Deep Learning Algorithms Engineer - BioNeMo USD 180K-333KApplied Machine Learning | C++ | CUDA | CUDA kernels | Deep learningSenior-level Full TimeVietnam, Ho Chi Minh City29d ago
-
API | Agent systems | Audit Logging | Authentication | AuthorizationSenior-level Full TimeUS, CA, Santa Clara, United States29d ago
-
Engineering Manager, Agentic Systems - Moveworks USD 113K-192KC++ | Deep learning | DeepSpeed | Distributed Training | GPU infrastructureMid-level Full TimeMountain View, CALIFORNIA, United States29d ago
-
Senior-level Full TimeToronto, Canada30d ago
-
Pioneer Talent Program - Applied Data Scientist (FTE) PHP 384K-480KAgent systems | Agentic AI | Artificial Intelligence | Chain-of-Thought | Data integrationCareer growth | Continuous learning | Work from homeMid-level Full TimeAsia30d ago
-
Senior AI Engineer Specialist INR 2500K-3500KAgentic AI | Apache Spark | Direct Preference Optimization | Distributed Computing | Embedding architecturesSenior-level Full TimeIND - Bengaluru - Esko-Graphics India …1mo ago
-
Agentic AI | Autogen | BF16 | Big Data | CI/CDSenior-level Full TimeFab 10A, Singapore1mo ago
-
LLM Algorithm Engineer CNY 38K-50KAPI Development | Agent systems | Attention Mechanisms | Autogen | CUDAEnglish courses | Meal allowance | Online learning access | Onsite gym | Onsite massagesMid-level Full TimeChang Sha Shi, China1mo ago