Senior Software Engineer, Machine Learning Inference
US, CA, Santa Clara, United States
USD 152K-287K Senior-level Full Time
Tasks
- Collaborate with deep learning experts on hardware and software design
- Design inference software optimizations
- Develop NVIDIA TensorRT and TensorRT LLM
- Develop software in C plus plus CUDA and Python
- Implement inference deployment optimizations
- Optimize deep learning inference performance
Perks/Benefits
Skills/Tech-stack
C plus plus | CUDA | Compilers | Deep learning | GPU Programming | JAX | Machine Learning | OpenCL | Performance Analysis | PyTorch | Python | Rust | SGLang | Systems programming | TensorRT | VLLM
Education
Regions
Countries
States
Cities
Related jobs
-
Data Engineer USD 119K-200KAzure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage | Azure SynapseMid-level Full TimeCenter, Center District, IL7h ago
-
Cloud Software Engineer II USD 150K-250KAngular | Apache Lucene | Apache Solr | Aspect-Oriented Programming | Aspect-orientedSenior-level Full TimeAnnapolis Junction, MD8h ago
-
Cloud Software Engineer II USD 150K-250KAngular | Apache Hadoop | Apache Lucene | Apache Solr | Aspect-Oriented ProgrammingSenior-level Full TimeAnnapolis Junction, MD8h ago
-
Senior-level Full TimeAnnapolis Junction, MD8h ago
-
Tech Lead Manager, Large Language Models & Generative AI USD 308K-696KArtificial Intelligence | Deep learning | Information Retrieval | Intent Recognition | Language ModelsSenior-level Full TimeSan Jose, California, United States9h ago
-
Senior Software Engineer, AI Coding Tools USD 244K-588KDeep learning | Distributed Training | Fine Tuning | GPU Computing | Inference OptimizationSenior-level Full TimeSan Jose, California, United States9h ago
-
Software Engineer, AI Agent USD 199K-340KAgent systems | Agentic Workflows | Attribution Modeling | Bidding strategy | Chain-of-ThoughtSenior-level Full TimeSan Jose, California, United States9h ago
-
Adversarial Training | Computer Vision | Dataset curation | Distributed Training | GPU ComputingEntry-level Full TimeMenlo Park, CA10h ago
-
Partner Engineering GenAI - US USD 136K-203KAPIs | C++ | Claude | Cloud Computing | Data integrationSenior-level Full TimeMenlo Park, CA | Seattle, WA …10h ago
-
Computer Science Research - US - IC5 USD 166K-244KDeep learning | Image to Video Generation | Image-to-video | Information Extraction | Language ModelsKnowledge sharing | Mentoring | Open source contributions | Research collaborationMid-level Full TimeBellevue, WA | Menlo Park, CA10h ago
-
API Design | Agentic Workflows | C# | C++ | Code reviewSenior-level Full TimeRedmond, WA10h ago
-
AI Specialist - Product and Applied Research USD 180K-200KC++ | Computer Vision | Crawling | Data Mining | Data RegressionMid-level Full TimeMenlo Park, CA | New York, …10h ago
-
Software Engineer, Databases (Technical Leadership) USD 161K-297KAI Tooling | Automation | Consensus Protocols | Data Integrity | Database InternalsSenior-level Full TimeBellevue, WA | Menlo Park, CA10h ago
-
Senior-level Full TimeMenlo Park, CA10h ago
-
Staff Machine Learning Engineer, Inference Team USD 207K-300KData Processing | Debugging | Fine Tuning | Inference Serving | Language ProcessingSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA10h ago
-
Senior Software Engineer, AI/ML, Platforms and Devices USD 174K-252KC++ | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingBonus | Equity | Health insurance | Learning and development | Paid time offSenior-level Full TimeMountain View, CA, USA; United States11h ago
-
Staff Software Engineer, AI/ML GenAI, Google Cloud USD 207K-300KComputer Vision | Data Preparation | Data Processing | Debugging | Distributed ComputingSenior-level Full TimeSunnyvale, CA, USA; San Francisco, CA, …11h ago
-
C++ | Data Processing | Data Structures | Data Structures and Algorithms | DebuggingSenior-level Full TimeKirkland, WA, USA; New York, NY, …11h ago
-
Apache Beam | Apache Kafka | BigQuery | Cloud Storage | Cloud platformCampus facilities | Completion bonus | Consultant support | Multiple project extensions | Onsite workMid-level Full TimeJuno Beach, FL11h ago
-
Full Stack AI Software Engineer USD 216K-283KAPI contracts | AWS | Azure | Classification | Data PipelinesAdoption leave | Commuter benefits | Dental insurance | Disability insurance | Equity ESPPExecutive-level Full TimeSan Mateo, CA, United States15h ago
-
Embedded Controls Engineer USD 138K-230KARM Cortex | ARM Cortex-M | BLDC | Bode Plot | Bode plot analysisMid-level Full TimeBoulder, CO18h ago
-
Machine Learning Engineer, AI Agent Platform USD 110K-230KAPI Development | Benchmarking | Distributed Systems | Evaluation | LLM orchestrationHealth insurance | Paid time off | Parental leaveSenior-level Full TimeBay Area20h ago
-
CI/CD | Data Engineering | Data Processing | Docker | ERPFlexible hours | Hybrid work | Onsite remote splitSenior-level Full TimeHouston, TX, United States21h ago
-
Entry-Level AI / ML Software Engineer USD 60K-74KAgile | Algorithms | Code review | Data Structures | Deep learningEntry-level Full TimeHopkins, MN, United States22h ago
-
Senior, Data Scientist (Machine Learning Engineer) USD 117K-234KAirflow | Azure | Batch inference | CI/CD | CNNSenior-level Full Time(USA) Crossman Respect Building CA SUNNYVALE …22h ago