Find jobs in AI/ML, Data Science and Big Data
71 results
for TensorRT-LLM
(Skill/Tech stack)
-
Entry-level Full Time北京、广州、上海17h ago
-
AI Solution Architect, AI 解決方案架構師 (內湖瑞光) TWD 310K-480KAI Agent | AI Foundry | AI Search | API Gateway | AWS BedrockSenior-level Full TimeTaipei Neihu, Taiwan R1d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | C++100% remote | Full-time W2 employment | H1B transfer supportSenior-level Full TimeUnited States - Remote R1d ago
-
Mid-level Full TimeSingapore4d ago
-
Senior-level Full TimeUnited States - Remote R4d ago
-
AI Software Engineer USD 151K-332KC++ | CUDA | CUDA kernels | CUDA profiling | Cache ManagementCommunity involvement | Health benefits | Hybrid work options | In-person work options | Remote work optionsMid-level Full TimeSeattle (WA), United States5d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | BatchingRemote workSenior-level Full TimeUnited States - Remote R5d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | C++100 percent remote work | Career growth | Full-time employmentSenior-level Full TimeUnited States - Remote R5d ago
-
ML Platform Engineer USD 100K-150KAPI Gateway | Abuse detection | Automated rollback | Autoscaling | BatchingSenior-level Full TimeUnited States - Remote R5d ago
-
C++ | Deep learning | Distributed Training | ETL | GoSenior-level Full TimeMountain View, CALIFORNIA, United States5d ago
-
C++ | ETL | Go | Hugging Face | MicroservicesFlexible work arrangements | Learning and development support | Remote work optionsSenior-level Full TimeMountain View, CALIFORNIA, United States5d ago
-
Senior Solutions Architect, Generative AI Research USD 184K-287KAI Agents | AI Feedback | Agent evaluation | Artificial Intelligence | BatchingSenior-level Full TimeUS, FL, Remote, United States R6d ago
-
MLOps Engineer (LLM/GenAI) GBP 27K-27KAWS | Accelerate | Azure | Batching | CUDAContributory pension scheme | Enhanced maternity and adoption pay | Private healthcare | Tailored professional development opportunities | Work accommodations supportEntry-level Full TimeSheffield, United Kingdom R6d ago
-
Engineering Manager, LLM Performance USD 224K-431KAPI Development | C++ | CUDA | GPU Architecture | LLM InferenceMid-level Full TimeUS, CA, Santa Clara7d ago
-
Senior-level Full Time上海、北京7d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Continuous batching | CutlassRemote workMid-level Full TimeUnited States - Remote R8d ago
-
Agent systems | Agentic Systems | Artificial Intelligence | Batching | CachingExecutive-level Full TimeUSA-SAN JOSE, United States8d ago
-
Sr GenAI Infra Specialist SA, AWS WWSO Startup USD 153K-228KAWS Inferentia | AWS Trainium | Amazon Web Services | Batching | CUDASenior-level Full TimeNew York, New York, USA12d ago
-
AI运维工程师(大模型推理 / AI Infra) CNY 180K-300KAlerting | Automation | Docker | GPU Acceleration | High AvailabilityEntry-level Full Time深圳13d ago
-
AI Engineer EUR 60K-80KAWQ | AWS | Agent SDK | CI/CD | CUDACareer growth opportunities | Permanent employment | Remote work optionMid-level Full TimeRemote - Paris, France R13d ago
-
Senior SW Engineer – AI Infrastructure & Optimization PLN 280K-383KCUDA | Cloud Platforms | Distributed Systems | GPU Performance | GPU Performance OptimizationSenior-level Full TimeKraków, Małopolskie, PL14d ago
-
Director, Engineering - Serverless Inference INR 1962K-6000KAPI Gateway | Capacity Planning | Cloud Native | Distributed Systems | Fault ToleranceEmployee assistance program | Employee stock purchase program | Flexible time off | LinkedIn Learning access | Local Employee MeetupsExecutive-level Full TimeBengaluru14d ago
-
Senior Machine Learning Engineer USD 188K-282KAdversarial Training | Calibration monitoring | Continuous batching | DPO | Deep learningSenior-level Full TimePalo Alto, CA14d ago
-
Senior Inference Engineer, AIConfigurator for Dynamo USD 184K-356KBatching | Distributed Systems | Expert parallelism | GPU Computing | High PerformanceEquity | Health benefits | Hybrid workSenior-level Full TimeUS, CA, Santa Clara, United States18d ago
-
Senior SW Engineer – AI Infrastructure & Optimization USD 184K-300KCUDA | Cloud Platforms | GPU Performance | GPU Performance Optimization | Gateway APISenior-level Full TimeIsrael, center, IL18d ago
-
Senior Solutions Architect, GPU Cloud GenAI – Infrastructure INR 2200K-5000KAnsible | C plus plus | CI/CD | Data parallelism | Device pluginSenior-level Full TimeIndia, Mumbai19d ago
-
AI Engineer USD 100K-135KAWQ | AWS | AWS EC2 | Agent Frameworks | CI/CD401k match | Health insurance | Learning and development stipend | Paid parental leave | Paid time offMid-level Full TimeRemote USA - In Tandem R19d ago
-
C++ | Cloud Native | Container Orchestration | Deep learning | Distributed SystemsCareer growth | Open Source contribution | World Class CollaborationEntry-level Full TimeSan Jose, California, United States19d ago
-
C# | C++ | Computer Vision | Debugging | Deep learningSenior-level Full TimeChina, Shanghai20d ago
-
AWS | Apache Flink | Apache Spark | Azure | C++Senior-level Full TimeSanta Clara, CA22d ago
-
AWQ | AWS | Batching | CPU architecture | CUDASenior-level Full TimeGuangzhou, Guangdong, China25d ago
-
C++ | Deep learning | Distributed Training | ETL | GoSenior-level Full TimeMountain View, CALIFORNIA, United States25d ago
-
Senior Machine Learning Engineer (Inference Platform) USD 175K-225KAWS | Alerting | CI/CD | Continuous batching | Data ProcessingSenior-level Full TimeRemote - USA R26d ago
-
Senior-level Full TimeSeoul, Korea27d ago
-
Senior-level Full TimeUS, CA, Remote, United States R28d ago
-
Artificial Intelligence | Bottleneck analysis | CUDA | Deep learning | Diffusion ModelsBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States29d ago
-
CUDA | CUDA-X | DevRel | Dynamo | Inference ServingMid-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Senior Solutions Architect - Generative AI INR 2475K-4500KArgo | CI/CD | CUDA | Evaluation | FedRAMPSenior-level Full TimeIndia, Pune1mo ago
-
Engineering Manager, Inference Benchmarking — AI Perf USD 224K-356KDCGM | Distributed Systems | GPU Telemetry | GPU observability | HelmSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
Product Manager - AI Inference & Model Serving USD 165K-275KAI Inference | Artificial Intelligence | Autoscaling | Cache Management | Continuous batchingConference attendance | Professional development | Stock options | Training | Workstation providedMid-level Full TimeAustin, TX, United States1mo ago
-
Data Curation | Deep learning | DeepSpeed | Direct Preference Optimization | EvaluationSenior-level Full TimeSingapore, Singapore1mo ago
-
AI工程师-Agent Infra & LLMOps 方向(成都) CNY 180K-360KAccess Control | AutoGPT | CPU isolation | Docker | FirecrackerNone Full Time成都1mo ago
-
Entry-level Full Time武汉1mo ago
-
Entry-level Full Time北京1mo ago
-
Intern, AI Engineering USD 64K-106KCUDA | CUDA kernel | CUDA kernel development | Hugging Face | Inference OptimizationEntry-level InternshipSan Francisco, California1mo ago
-
Product Manager - AI Inference & Model Serving USD 160K-275KAI Inference | Autoscaling | Cache Management | Cold Start | Cold Start OptimizationConference attendance | Professional development and training | Stock options | Workstation providedMid-level Full TimeAustin, TX, United States1mo ago
-
AWQ | AWS | Accelerate | Azure | BatchingMid-level Full TimeShenzhen, Guangdong, China R1mo ago
-
Solutions Architect - AI Technology Center, Foundation Model Building KRW 65000K-90000KAI model | AI model development | CUDA | Debugging | Fine TuningSenior-level Full TimeKorea, Seoul, Korea, Republic of1mo ago
-
Staff Machine Learning Engineer, Voice AI USD 220K-280KAudio codecs | Audio signal processing | Batching | CUDA | Deep learningHealth insurance | Startup equitySenior-level Full TimeSan Francisco1mo ago
-
Inference Engineer - Acceleration CHF 110K-160KAdmission control | CUDA | Cutlass | FlashAttention | KV cacheCommuting subsidy | Learning and development budget | Offsites and team events | Pension plan | Vacation daysMid-level Full TimeZürich, Switzerland1mo ago