Find jobs in AI/ML, Data Science and Big Data
17 results
for Inference Engineer
(Role)
-
Staff AI Inference and Acceleration Engineer USD 180K-275KAI Inference | Benchmarking | C++ | CUDA | Computer ArchitectureSenior-level Full TimeSan Jose, CA3d ago
-
AI Inference Engineer QVAC PLN 288K-383KC++ | Deep learning | Diffusion Models | GPU Programming | GgmlCareer progression | English classes | Learning opportunities | Medical benefits | MeetupsSenior-level Full TimeWarsaw, Masovian Voivodeship, Poland - Remote R4d ago
-
Senior ML Inference Engineer - Platform USD 128K-261KAirflow | CUDA | Flyte | Inference Server | Inference latencyEmployee assistance program | Life insurance | Medical, dental, and vision | Paid vacation and holidays | Relocation benefitsSenior-level Full TimeGM Automation - Sunnyvale - GM …6d ago
-
LLM Inference Frameworks and Optimization Engineer USD 160K-230KC++ | CUDA | CUDA graph | Cluster scheduling | CompilerEquity | Health insuranceMid-level Full TimeSan Francisco, Singapore, Amsterdam14d ago
-
Lead ML Inference Engineer, Advertising USD 195K-352KArtificial Intelligence | Auction dynamics | Co-design | Control Optimization | Distributed SystemsSenior-level Full TimeAustin, Texas15d ago
-
Senior Inference Engineer, AIConfigurator for Dynamo USD 184K-356KBatching | Distributed Systems | Expert parallelism | GPU Computing | High PerformanceEquity | Health benefits | Hybrid workSenior-level Full TimeUS, CA, Santa Clara, United States18d ago
-
AWS | Apache Flink | Apache Spark | Azure | C++Senior-level Full TimeSanta Clara, CA22d ago
-
Inference Engineer USD 180K-250KCUDA | Continuous batching | Distributed Systems | Generative Models | Machine Learning401k | Commuter allowance | Dental insurance | Flexible PTO | Health insuranceMid-level Full Time*HQ - San Francisco, CA R24d ago
-
AI Inference Engineer USD 110K-270KBenchmarking | C# | C++ | Hugging Face | Hugging Face Transformers401k retirement plan | Commuting support | Company Provided Lunches | Flexible paid time off | Medical, dental, vision plansSenior-level Full TimeBurlingame, California, United States R1mo ago
-
Lead ML Inference Engineer, Advertising USD 246K-486KCo-design | Distributed Systems | GPU Acceleration | Hardware-Software Co-design | Hardware/softwareDisability benefits | Equity awards | Health insurance | Life insurance | Paid time offSenior-level Full TimeSan Jose, California1mo ago
-
Inference Engineer - Acceleration CHF 110K-160KAdmission control | CUDA | Cutlass | FlashAttention | KV cacheCommuting subsidy | Learning and development budget | Offsites and team events | Pension plan | Vacation daysMid-level Full TimeZürich, Switzerland1mo ago
-
Amazon Web Services | Apache Airflow | Apache Flink | Apache Kafka | Apache SparkSenior-level Full TimeMassachusetts, Massachusetts, United States1mo ago
-
AI Systems Engineer (LLM & RAG Optimization) EUR 60K-81KAPI Development | CPU Optimization | Containerization | GPU Optimization | Information RetrievalEmployee stock options | Free canteen | Health insurance | Hybrid work | On-site kindergartenSenior-level Full TimeGetafe, Spain R1mo ago
-
AI Platform Engineer INR 1500K-2500KAlerting | CUDA | Cause analysis | Continuous batching | GPU ProfilingMid-level Full TimeBangalore, India1mo ago
-
Senior-level Full TimePalo Alto1mo ago
-
Audio Inference Engineer, Model Efficiency USD 165K-300KC++ | Deep learning | Distributed inference | GPU Programming | Low-level systemCo-working stipend | Health and dental benefits | Inclusive culture | Mental health budget | Parental leave top-upMid-level Full TimeNew York1mo ago
-
Senior-level Full TimeDublin, Ireland1mo ago