Find jobs in AI/ML, Data Science and Big Data
35 results
for TensorRT-LLM
(Skill/Tech stack)
-
Intern, AI Engineering USD 80K-124KCUDA | Hugging Face | Hugging Face Transformers | Inference Optimization | Language ModelsEmployee benefits | Flexible work environment | Remote work optionsEntry-level InternshipSan Francisco, California10h ago
-
AWS EKS | BigQuery | CI/CD | ClickHouse | Distributed SystemsGym membership | Healthcare | Home-office equipment | Life insurance | Lunch cardSenior-level Full TimeRemote R1d ago
-
Lead AI Platform USD 184K-339KBatching | CUDA | CUDNN | Cause analysis | Concept driftCareer advancing opportunities | Employment referral program | English language development courses | Interest-free loans | Location premiumSenior-level Full TimeCairo, Cairo Governorate, Egypt2d ago
-
Senior Solutions Architect - KV Cache and AI Storage CNY 460K-600KBluefield | CMX | Caching | Cassandra | CephSenior-level Full TimeChina, Beijing3d ago
-
Solutions Architect, LLM Model Builder USD 152K-241KBenchmarking | CUDA | Compression | Distillation | EvaluationSenior-level Full TimeUS, CA, Santa Clara, United States3d ago
-
Solutions Architect, LLM Model Builder USD 152K-241KBenchmarking | CUDA | Compression | Distillation | EvaluationSenior-level Full TimeUS, CA, Santa Clara, United States3d ago
-
Machine Learning Engineer, Responsible AI USD 177K-387KA I | A I Safety | A/B | A/B Testing | Automated testingCommunity involvement | Health benefits | Hybrid work | In person options | Mental health supportMid-level Full TimeSeattle (WA), United States3d ago
-
Principal Engineer - Generative AI Infra Capabilities INR 2000K-4600KApigee | Arize | Avi | CI/CD | CUDASenior-level Full Time110380-IND-BENGALURU-INTL BLR Twr-1&2 CARNATION, India6d ago
-
Senior Member of Technical Staff: ML Systems and Infrastructure INR 2500K-4000KArgo Workflows | ArgoCD | CI/CD | CUDA | GitHub ActionsSenior-level Full TimeBangalore, India8d ago
-
Senior Deep Learning Engineer PLN 221K-383KCUDA | Diffusion Models | Docker | Inference Server | PyTorchHybrid workSenior-level Full TimeUK, Remote, United Kingdom R8d ago
-
Software Engineer (All Levels) – 大模型与智能机器人系统 CNY 240K-480KC++ | CUDA | DDS | GPU memory | GPU memory managementEntry-level Full Time广州、深圳8d ago
-
Staff Machine Learning Engineer, GenAI Platform USD 253K-354KCUDA | DeepSpeed | Distributed Systems | Docker | FSDP401k employer match | Family planning support | Flexible vacation | Gender-affirming care | Healthcare benefitsSenior-level Full TimeRemote - United States R8d ago
-
Member of Technical Staff, AI Engineering USD 162K-328KBF16 | C++ | CI/CD | CUDA | CUDA kernelsDental insurance | Income Protection for Illness or Injury | Medical insurance | Paid Holidays | Paid family leaveSenior-level Full TimeBoise, ID - Main Site, United …9d ago
-
Agentic AI | Autogen | BF16 | Big Data | CI/CDSenior-level Full TimeFab 10A, Singapore9d ago
-
AI Engineer - EU EUR 61K-81KA/B | A/B Testing | Amazon S3 | Audit Logs | B testingEducation budget | Executive coaching | Health checkup budget | PTO | Remote-firstSenior-level Full TimeItaly - Remote R9d ago
-
Container Orchestration | Distributed Systems | GPU Acceleration | Kubernetes | LLM InferenceCareer growth opportunities | Collaborative engineering environment | Global datacenter exposure | Hyper scale environment | Open source contribution opportunitiesEntry-level Full TimeSeattle, Washington, United States10d ago
-
Senior Applied Scientist - Sovereign AI INR 2500K-4600KAblation Studies | Benchmarking | Knowledge Distillation | Machine Learning | Megatron-LMSenior-level Full TimeIndia, Bengaluru10d ago
-
Deep Learning Algorithms Engineer - ACOT USD 152K-287KC++ | CUDA | Diffusion Models | Distributed Training | GPU ArchitectureEntry-level Full TimeVietnam, Ho Chi Minh City10d ago
-
Principal Engineer, Machine Learning, SMAI SGD 96K-155KAgentic AI | Auto RL | Autogen | BF16 | Big DataSenior-level Full TimeFab 10A, Singapore10d ago
-
Attention Mechanism | C++ | Deep learning | Distributed Systems | DockerEntry-level Full TimeUS, CA, Santa Clara, United States10d ago
-
Senior Machine Learning Engineer, Voice AI USD 200K-260KAudio codecs | Audio signal processing | Automatic Speech Recognition | Batching | CUDAHealth insurance | Startup equitySenior-level Full TimeSan Francisco10d ago
-
Senior-level Full TimeFab 10A, Singapore11d ago
-
AI Model Deployment | AI model | Artificial Intelligence | Debugging | Deep learningSenior-level Full TimeKorea, Seoul, Korea, Republic of11d ago
-
Entry-level Internship北京14d ago
-
AWQ | C++ | CUDA | CUDA kernels | Continuous batchingSenior-level Full TimeDonostia, Spain15d ago
-
Senior Software Engineer II, Inference USD 165K-242KAutoscaling | BF16 | C++ | CI/CD | CUDA401k match | Employee stock purchase program | Flexible PTO | Flexible spending account | Health savings accountSenior-level Full TimeSunnyvale, CA / Bellevue, WA16d ago
-
Engineering Manager, Agentic Systems USD 162K-284KC++ | Deep learning | DeepSpeed | Distributed Training | GPU OptimizationMid-level Full TimeMountain View, CALIFORNIA, United States16d ago
-
Engineering Manager, Inference ML Runtime USD 180K-250KC++ | Cloud infrastructure | Deep learning | Distributed Systems | High PerformanceMid-level Full TimeSunnyvale CA or Toronto Canada16d ago
-
4-bit | C plus plus | C++14 | C++17 | CI/CDSenior-level Full TimeGermany, Munich21d ago
-
AWS | Agent Frameworks | Azure | CI/CD | Cost OptimizationEmployee share option plan | Flexible working | Hybrid working | National Police Check | Paid parental leaveMid-level Full TimeSydney22d ago
-
API Gateways | CNI | Cilium | Cloud Native | Computer ArchitectureDental | Disability | Family medical leave | Flexible spending | Health savingsSenior-level Full TimeBoston, United States R1mo ago
-
Solutions Architect, Inference Deployments USD 152K-241KAI Inference | AI inference workloads | Disaggregated inference | GPU Operator | GPU OrchestrationBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Senior Solutions Architect, HPC and AI PLN 292K-650KC++ | CUDA | Cluster management | Dynamo | GPU ArchitecturesSenior-level Full TimeUK, Remote, United Kingdom R1mo ago
-
Batch ML serving | DPO | Data platform | Data platform architecture | Decision MakingSenior-level Full TimeUS-CA-Menlo Park1mo ago
-
Software Engineer - AI Inference Infrastructure USD 129K-246KCloud Native | Cloud Native Systems | Container Management | Distributed Storage | GPU AccelerationSenior-level Full TimeSeattle, Washington, United States1mo ago