Senior Deep Learning Software Engineer, LLM Performance
US, CA, Santa Clara, United States
USD 184K-356K Senior-level Full Time
Tasks
- Analyze and tune model latency and throughput
- Collaborate on GPU kernel development
- Contribute to open source LLM frameworks
- Develop and optimize inference benchmarking software
- Implement LLM inference serving and deployment algorithms
- Optimize LLM inference performance
- Scale LLM performance across GPU and edge architectures
- Work on performance modeling profiling and debugging
Perks/Benefits
Skills/Tech-stack
C# | C++ | CUDA | GPU Programming | Generative AI | JAX | LLM | Latency optimization | Open Source | Performance Modeling | Performance Profiling | PyTorch | Python | TensorFlow | TensorRT | Throughput Optimization | Triton | VLM
Education
Regions
Countries
States
Cities
Related jobs
-
Senior-level ContractAustin, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | AWS SageMaker | AWS SageMaker Studio | Airflow | AvroFlexible schedule | Mentorship | Office options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeAustin, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Airflow | Apache Spark | Avro | DuckDBFlextime | Mentorship | Office options | Personalized growth roadmaps | Remote work optionsSenior-level Full TimeWest Palm Beach, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Advanced Analytics | Airflow | Apache Spark | AvroFlexible schedule | Mentorship | Office options | Remote work options | TechtalksSenior-level Full TimeJacksonville, United States2h ago
-
Data Engineer ID50062 USD 148K-164KAWS | Apache Airflow | Apache Spark | Avro | Columnar StorageFlexible schedule | Mentorship | Personalized growth roadmap | Remote work options | TechtalksSenior-level Full TimeHouston, United States2h ago
-
Senior AI Engineer USD 123K-215KAPI Design | API Versioning | AWS | Argo Workflows | Authentication & AuthorizationCareer development and training | Confidential counseling support | Employee wellness centers | Financial coaching | Health insuranceSenior-level Full TimeUS-Arizona-Phoenix2h ago
-
Data Engineer USD 117K-195KAWS | Cloud Security | Cloud infrastructure | Data Governance | Data LakesHealth insurance | Holiday pay | Learning and development | Life insurance | Long-term disabilitySenior-level Full TimeUSA-Remote Work R3h ago
-
C++ | Constrained optimization | Controls theory | Differentiable physics | Dimensionality ReductionSenior-level Full TimeRedmond, WA4h ago
-
Research Engineer, Language USD 170K-251KData Processing | Deep learning | Distributed Systems | Efficient Inference | Efficient TrainingEntry-level Full TimeBurlingame, CA4h ago
-
AI/HPC Systems Performance Engineer USD 177K-276KC++ | Host Networking | IB | MPI | Machine LearningMid-level Full TimeMenlo Park, CA4h ago
-
Software Engineer, AI/ML, Geo Data Protection USD 147K-211KC++ | Data Processing | Debugging | Distributed Computing | Information RetrievalMid-level Full TimeMountain View, CA, USA4h ago
-
Software Engineer, AI/ML, Google Ads USD 174K-252KC plus plus | Data Processing | Data Storage | Data Structures | Data Structures and AlgorithmsMid-level Full TimeMountain View, CA, USA4h ago
-
Accelerator Virtualization | Artificial Intelligence | Container Orchestration | Container Runtime | Distributed SystemsSenior-level Full TimeSeattle, WA, USA; Kirkland, WA, USA4h ago
-
Senior Software Engineer, Compiler Optimization USD 174K-252KC# | C++ | Compiler optimization | Data Analysis | Data StructuresSenior-level Full TimeSunnyvale, CA, USA4h ago
-
Forward Deployed Engineer III, Applied AI, Google Cloud USD 183K-265KAPIs | Agent systems | Chain-of-Thought | Conversational AI | CrewAICollaboration with research and engineering experts | Direct access to DeepMind teams | High travelSenior-level Full TimeSan Francisco, CA, USA; Atlanta, GA, …4h ago
-
Software Engineer III, AI/ML, Display Ads USD 147K-211KC++ | Data Analysis | Data Processing | Data Structures | Data Structures and AlgorithmsSenior-level Full TimeMountain View, CA, USA4h ago
-
Staff Software Engineer USD 207K-300KArtificial Intelligence | C++ | Computer Vision | Data Processing | Embedding ModelsSenior-level Full TimeMountain View, CA, USA4h ago
-
Application development | C++ | Data Analysis | Data Processing | Data Processing PipelinesMid-level Full TimeSunnyvale, CA, USA4h ago
-
Senior Software Engineer, AI/ML, Google Cloud AI USD 174K-252KC++ | Code review | Data Processing | Debugging | Distributed ComputingSenior-level Full TimeSunnyvale, CA, USA4h ago
-
Senior Software Engineer, AI/ML GenAI, Google Play USD 174K-252KAlgorithms | C++ | Computer Vision | Data Processing | Data StructuresSenior-level Full TimeMountain View, CA, USA4h ago
-
Software Engineer III, AI/ML GenAI, Google Cloud Compute USD 147K-211KAudio generation | C++ | Data Processing | Debugging | Distributed ComputingSenior-level Full TimeKirkland, WA, USA4h ago
-
Software Engineer, Computer Vision, Geo USD 147K-211KAlgorithms | C++ | Computer Vision | Data Structures | Image classificationMid-level Full TimeMountain View, CA, USA; Seattle, WA, …4h ago
-
Senior-level Full TimeBoston, MA10h ago
-
Senior-level Full TimeBelmont, CA, US, 9400215h ago
-
AWS S3 | Active IQ | Ansible | Azure Blob | Azure Blob Storage401k | Dental insurance | Medical insurance | Paid sick hours | Vision insuranceSenior-level Contract Full TimeRidgefield Park, NJ, United States16h ago