Software Engineer, GDC LLM Serving and GPU Performance
Sunnyvale, CA, USA; Seattle, WA, USA
USD 207K-301K Senior-level Full Time
Tasks
- Build performance analysis tooling
- Collaborate with research and SRE teams to deploy LLMs
- Design disaggregated serving architecture
- Enhance LLM serving stack
- Identify performance bottlenecks
- Profile and benchmark LLM models on GPU accelerators
Perks/Benefits
- N/A
Skills/Tech-stack
Benchmarking | Data Processing | Debugging | Distributed Computing | Fine Tuning | GPU Acceleration | Key Value Cache | Key-value | LLM serving | Language Processing | Machine Learning | Model Deployment | Model Evaluation | Natural Language | Natural Language Processing | Performance Profiling | Reinforcement Learning | Resource allocation
Education
Roles
Regions
Countries
States
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R7d ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Advanced SQL | Advanced SQL optimization | AirflowEducation budget | Exciting projects | Fitness budget | Flexible schedule | MentorshipSenior-level Full TimeMiami, United States4h ago
-
Senior Data Engineer ID75059 USD 156K-190KAPI Integration | AWS | Apache Airflow | Apache Spark | AvroEducation budget | Fitness budget | Flextime | Mentorship | Office optionsSenior-level Full TimeWest Palm Beach, United States4h ago
-
Senior Lead Software Engineer - Python/AWS/AI/LLM USD 175K-195KAWS | Algorithms | Anthropic | Artificial Intelligence | Data MiningBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site wellness centersSenior-level Full TimeJersey City, NJ, United States6h ago
-
Agent Orchestration | Context engineering | Data Pipelines | Deployment Automation | Developer ToolingDirect path from idea to ship | High ownership | Mentorship opportunities | Startup-like environmentSenior-level Full TimeMenlo Park, CA6h ago
-
A/B Testing | B testing | Data Processing | Experimentation | Feature EngineeringHigh ownership | Mentorship | Short path from idea to shipSenior-level Full TimeMenlo Park, CA6h ago
-
Embedding retrieval | Experimentation | Information Retrieval | Large-Scale Search | Large-scaleHigh ownership | Opportunity to shape product | Short path from idea to shipSenior-level Full TimeMenlo Park, CA6h ago
-
Senior Software Engineer, Knowledge Catalog For AI USD 174K-253KArtificial Intelligence | Big Data | C# | Cloud | Compute TechnologiesSenior-level Full TimeKirkland, WA, USA; Sunnyvale, CA, USA6h ago
-
Member of Technical Staff, Computational Pathology USD 100K-300KCausal Inference | Computer Vision | Deep learning | Domain Adaptation | Linear AlgebraSenior-level Full TimeNew York HQ14h ago
-
Senior Software Engineer - Machine Learning USD 183K-223KAgentic Systems | Apache Flink | Apache Spark | Apache Storm | Big Data401k match | Dental insurance | Health insurance | Life insurance | Long-term disabilitySenior-level Full TimeSan Francisco, CA 94158, United States18h ago
-
Data Engineer II USD 145K-165KAWS | Agile | Apache Airflow | Apache Spark | CI/CD401k match | Company-provided phone | Extended leave | Generous PTO | Health insuranceMid-level Full TimeNew York, New York, United States19h ago
-
Director, AI Platform and Development Engineering USD 165K-200KAPIs | AWS | Access Control | Audit Logs | AzureExecutive-level Full TimeMiramar, FL, US21h ago
-
Senior AI Systems Engineer USD 130K-195KAlerting | Bash | CI/CD | CMMC | Configuration ManagementHybrid work option | Remote work optionSenior-level Full TimeRaleigh, North Carolina, United States; Albuquerque, … R21h ago
-
Entry-level Full TimePlano, TX, United States22h ago
-
Machine Learning Operations Engineer USD 140K-240KAWS | Airflow | Alerting | Compression | Data Pipelines401k | Dental insurance | Health insurance | Office food and beverages | Stock option planMid-level Full TimeRWC HQ23h ago
-
Embedded Software Engineer 3 USD 120K-135KAgile | Android | Android Studio | AndroidX | Automated UI TestingCEU Training | Onsite Monday Friday schedule | Professional development | Technical supportSenior-level Full TimeCarlsbad, California, United States1d ago
-
AWS | Airflow | Autogen | Azure | Bias EvaluationEntry-level Full TimeUnited States1d ago
-
Senior Machine Learning Engineer II USD 156K-250KAWS | Azure | Batch Processing | Container Orchestration | Data PipelinesDiscretionary paid time off | Emotional and mental wellness support | Employee resource groups | Fitness programs | Learning and development programsSenior-level Full TimeSeattle, Washington, United States R1d ago
-
Agile | Calculus | Classification | Linear Algebra | Machine LearningSenior-level Full TimeMc Lean, VA, United States1d ago
-
Agile | Amazon Web Services | As-a-Service | Calculus | ClassificationMid-level Full TimeTysons, VA, United States1d ago
-
Senior AI Engineer USD 156K-224KDebugging | Evaluation | Information Retrieval | Language Models | Large Language ModelsSenior-level Full TimeUS-CA-Menlo Park1d ago
-
GenAI Engineer USD 113K-188KAWS | Amazon Bedrock | Data Management | Data integration | Generative AIMentorship | Onsite work | Professional development | Travel opportunitiesEntry-level Full TimeArlington/Rosslyn, Virginia, United States1d ago
-
AI Program Lead USD 159K-265KAI Governance | Agent-Based Artificial Intelligence | Agent-based | Artificial Intelligence | Business IntelligenceMentorship | Professional development | Travel up to 25 percentSenior-level Full TimeArlington/Rosslyn, Virginia, United States1d ago
-
Security Engineer III - AI/ML USD 170K-215KAPI Development | AWS | Anomaly Detection | Azure | BERTSenior-level Full TimePlano, TX, United States1d ago
-
Agent Orchestration | Bias Mitigation | Data Structures | Data Structures and Algorithms | Graph theorySenior-level Full TimeSunnyvale, CA | Bellevue, WA | …1d ago