Software Engineer, GDC LLM Serving and GPU Performance
Tasks
- Benchmark LLM model performance on GPUs
- Build performance analysis tooling
- Design disaggregated serving architecture
- Enhance LLM serving stack
- Identify performance bottlenecks
- Optimize and deploy LLMs in production
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | Data Processing | Data Storage | Deep learning | Distributed Computing | Fine Tuning | GPU Performance | Key Value Cache | Key-value | Language Models | Language Processing | Large Language Models | Machine Learning | Model Debugging | Model Deployment | Model Evaluation | Natural Language | Natural Language Processing | Networking | Performance Profiling | Reinforcement Learning | Software Architecture | System design
Education
Bachelor of Engineering | Bachelor of Science | Master of Science | PhD
Roles
Regions
Countries
States
Cities
Related jobs
-
Senior Data Engineer USD 165K-180KAPIs | Anomaly Detection | Azure | Azure Data | Azure Data FactorySenior-level Full TimeWork from home, VA, United States R10h ago
-
Senior Director, AI / Machine Learning Software Engineer USD 136K-300KApache Flink | Apache Spark | CI/CD | Data Lineage | Data PrivacyHealth benefits | Paid leave | Paid volunteer timeSenior-level Full TimeNew York, NY, United States13h ago
-
Computer Vision | Data Analysis | Language Models | Language Processing | Large Language ModelsSenior-level Full TimeSeattle, Washington, United States14h ago
-
Classification Algorithms | Data Analysis | Deep learning | Language Models | Language ProcessingSenior-level Full TimeSan Jose, California, United States14h ago
-
Data Engineering | Machine Learning | Machine Learning Pipelines | Python | Recommendation SystemsSenior-level Full TimeSan Jose, California, United States14h ago
-
Data Pipelines | Full Stack | Full-Stack Development | Machine Learning | PythonSenior-level Full TimeSan Jose, California, United States14h ago
-
C++ | Data Analysis | Data Manipulation | Data Processing | Deep learningSenior-level Full TimeMountain View, CA, USA15h ago
-
Algorithms | Audio Software | C++ | Debugging | Embedded SystemsSenior-level Full TimeMountain View, CA, USA15h ago
-
Software Engineer, Machine Learning USD 207K-300KC++ | Data Processing | Experimentation | Information Retrieval | Just-in-TimeSenior-level Full TimeNew York, NY, USA; Mountain View, …15h ago
-
Software Engineer III, AI/ML, Search News Intelligence USD 147K-211KAlgorithms | C++ | Data Processing | Data Structures | Feature EngineeringSenior-level Full TimeMountain View, CA, USA15h ago
-
Software Engineer Manager II, Embedded Systems, Firmware USD 207K-300KAgile project management | Automated testing | C++ | Direct memory access | Embedded operating systemsSenior-level Full TimeSunnyvale, CA, USA15h ago
-
Artificial Intelligence | C++ | CSS | Data Storage | Distributed ComputingSenior-level Full TimeNew York, NY, USA15h ago
-
Customer Engineer, Data Analytics, Google Cloud USD 153K-222KBatch Processing | Big Data | Cloud Architecture | Cloud platform | Customer RequirementsSenior-level Full TimeSunnyvale, CA, USA15h ago
-
C++ | Data Processing | Debugging | Information Retrieval | Language ModelsSenior-level Full TimeMountain View, CA, USA15h ago
-
Algorithms | C++ | Cloud Computing | Cloud platform | Data StructuresSenior-level Full TimeSunnyvale, CA, USA15h ago
-
Cloud Data and AI Engineer, Professional Services USD 127K-183KC++ | Capacity Planning | Cloud Databases | Data Migration | Data PipelinesTravel up to 30 percentMid-level Full TimeReston, VA, USA15h ago
-
Staff Software Engineer, ML Frameworks USD 207K-300KAPIs | Data Processing | Debugging | Fine Tuning | GPU AccelerationSenior-level Full TimeMountain View, CA, USA15h ago
-
Data Scientist USD 67K-150KA/B | A/B Testing | B testing | Clustering | Drift monitoring401k plan | AD and D insurance | Child Life Insurance | Dental insurance | Educational Assistance PlanMid-level Full TimeUnited States17h ago
-
Quantitative Research & Model, Senior Advisor USD 155K-237KAgile | Artificial Intelligence | Cloud Computing | Energy Markets | Exotic options401k match | Dental insurance | Health insurance | Hybrid work | Life insuranceSenior-level Full TimeHOUSTON, US, 7705617h ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeDiversity and Inclusion Commitment | Flexible asynchronous culture | Laid-back atmosphere | Portfolio and LinkedIn supportMid-level Full TimeBellevue, WA, USA20h ago
-
Bash | Cloud platform | Data Ingestion | Data Processing | DockerAsynchronous work culture | Flexible management | Remote-friendly, distributed teamMid-level Full TimeDenver, CO, USA20h ago
-
Bash | Cloud platform | Data Processing | Docker | Google CloudMid-level Full TimePalo Alto, CA, USA21h ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeAsynchronous culture | Flexible work environment | Remote-friendlyMid-level Full TimeLos Angeles, CA, USA21h ago
-
Bash | Data Processing | Docker | GCP | LinuxAsynchronous culture | Flexible remote workMid-level Full TimeKirkland, WA, USA21h ago
-
Bash | Data Processing | Docker | GCP | Infrastructure as CodeAsynchronous culture | Flexible remote work | Work on impact driven productsMid-level Full TimeSan Jose, CA, USA21h ago