Senior Deep Learning Software Engineer, LLM Performance
US, CA, Santa Clara, United States
USD 184K-356K Senior-level Full Time
Tasks
- Analyze LLM inference latency and throughput
- Collaborate with teams on performance modeling and kernel development
- Contribute to TensorRT and Triton code
- Develop and contribute to LLM inference benchmarking frameworks
- Implement GPU accelerated deep learning inference pipelines
- Implement LLM inference serving and deployment
- Optimize LLM inference performance
- Scale LLM performance across NVIDIA accelerators
- Tune LLM VLM and GenAI models
Perks/Benefits
Skills/Tech-stack
C# | C++ | CUDA | Inference Server | JAX | LLM | OpenCL | PyTorch | Python | TensorFlow | TensorRT | Triton | Triton Inference | Triton Inference Server | VLM
Education
Regions
Countries
States
Cities
Related jobs
-
AWS | Analytics | Data Mining | Generative AI | Machine LearningMentorship | Training | Work-life balanceSenior-level Full TimeArlington, Virginia, USA7h ago
-
API Integration | AWS | Autogen | Azure | Cloud platformHybrid work environmentSenior-level Contract Full TimeChicago, Illinois, United States9h ago
-
Senior/Staff Software Engineer - Perception & Sensing USD 195K-280K3D Object Detection | 3D segmentation | A/B | A/B Testing | B testingSenior-level Full TimeFoster City, CA9h ago
-
AI Engineer USD 103K-140KAI Agents | AI Studio | Access Control | Anthropic Claude | AuthenticationBonus eligibleSenior-level Full TimeDenver, CO, United States9h ago
-
Staff Software Engineer, GenAI Platform USD 208K-250KAPI | AWS EKS | Access Control | Agent Orchestration | Audit LoggingCatered lunches | Cultural and team offsites | Employee giving match | Flexible work schedule | Generous vacation policySenior-level Full TimeSan Francisco, CA, United States10h ago
-
Analytics Engineer USD 95K-115KAirflow | DBT | Dagster | Data Governance | Data IntegrityBackground check compliant | Hybrid work | In office once every 2 weeks | Industry complianceMid-level Full TimeChicago, Illinois, United States12h ago
-
Robotic Orchestration Platform Software Engineer USD 125K-250KAWS | Agile | Azure | C# | CI/CDAgile environment | Remote supportSenior-level Full TimeSan Francisco, California12h ago
-
Senior Data Engineer USD 111K-124KAccess Control | Agile | Azure | CI/CD | Data Governance401k contributions | Education assistance | Life and disability coverage | Medical, dental, and vision coverage | Paid sabbaticalSenior-level Full TimeAtlanta, Georgia or Gainesville, FL13h ago
-
Senior-level Full TimeBoston, Massachusetts, United States13h ago
-
Data Analytics & Engineering Opportunities USD 65K-105KHive | Microstrategy | MySQL | Oracle | Python401k with firm profit share | Dental insurance | Disability insurance | Firm paid holidays | Flexible spending accountEntry-level Full TimeWashington, DC, United States14h ago
-
Mid-Level Data Engineer USD 90K-98KAPI Development | Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake StorageRemote workMid-level Full TimeWork from home, VA, United States R15h ago
-
Senior Data Engineer USD 165K-180KAPIs | Anomaly Detection | Azure | Azure Data | Azure Data FactorySenior-level Full TimeWork from home, VA, United States R15h ago
-
Analytics Engineer USD 110K-120KDBT | Data Modeling | Data Warehousing | Documentation | GitCareer growth | MentorshipMid-level Full TimeLehi, Utah16h ago
-
Sr. Machine Learning Engineer USD 100K-160KCI/CD | Data Fusion | Data analytics | Deep learning | DockerHybrid work environmentSenior-level Full TimeCocoa Beach, Florida, United States16h ago
-
Software Engineer II, Computational Platform USD 124K-154KAWS | Agentic AI | Data Modeling | Docker | ETL401k plan | Annual performance bonus | Commuter support | Company-provided laptop | Flexible paid time offMid-level Full TimeRemote; Watertown, Massachusetts, United States R16h ago
-
Quantitative Developer (DV Equities) USD 100K-150KC++ | Linux | Mathematics | Python | StatisticsDental insurance | Dependent care options | FSA | Flexible vacation | Group term life insuranceNone Full TimeNew York17h ago
-
Senior AI Engineer | Sage Home Loans USD 150K-220KAgent Orchestration | Automated Regression | Automated regression testing | Cost Optimization | DPO401k match | Disability insurance | Employee assistance program | Flexible paid time off | Flexible spending accountsSenior-level Full TimeCharlotte, NC R17h ago
-
Senior DevOps Engineer ID63545 USD 135K-185KAWS | Apache Airflow | ArgoCD | Azure | BigQueryFlextime | Growth roadmaps | Mentorship | Office work options | Remote work optionsSenior-level Full TimeMiami, United States17h ago
-
Junior AI Engineer USD 71K-85KAPI Development | AWS | Azure | CI/CD | Data Pipelines401k match | Commuter benefits | Dental insurance | Dependent care support | Employee discountsEntry-level Full TimeUS - Baltimore17h ago
-
Evergreen - Mathematics for Machine Learning USD 80K-300KAutodiff | JAX | Linear Algebra | Matrix Operations | NumPyAsynchronous hiring process | Flexible collaboration | Part-time hoursMid-level Full TimeBoston, US18h ago
-
Senior Director, AI / Machine Learning Software Engineer USD 136K-300KApache Flink | Apache Spark | CI/CD | Data Lineage | Data PrivacyHealth benefits | Paid leave | Paid volunteer timeSenior-level Full TimeNew York, NY, United States18h ago
-
Data Engineering | Machine Learning | Machine Learning Pipelines | Python | Recommendation SystemsSenior-level Full TimeSan Jose, California, United States19h ago
-
Data Pipelines | Full Stack | Full-Stack Development | Machine Learning | PythonSenior-level Full TimeSan Jose, California, United States19h ago
-
C++ | Data Analysis | Data Manipulation | Data Processing | Deep learningSenior-level Full TimeMountain View, CA, USA20h ago
-
Algorithms | Audio Software | C++ | Debugging | Embedded SystemsSenior-level Full TimeMountain View, CA, USA20h ago