Deep Learning Architect, LLM Inference - New College Grad 2026
US, CA, Santa Clara, United States
USD 124K-241K Senior-level Full Time
Tasks
- Build benchmarking methodologies
- Contribute to deep learning software projects
- Develop client server LLM applications
- Develop profiling and analysis tools
- Guide inference serving direction
- Improve team efficiency with coding agents
- Optimize inference server performance
- Verify GPU product launch performance
- Workload characterize large language model inference
Perks/Benefits
- N/A
Skills/Tech-stack
CPU performance | Compiler optimization | Data Visualization | Databases | Deep learning | GPU Performance | Inference Optimization | Language Models | Large Language Models | MCP | Microarchitecture | Model Inference | OpenAI API | Operating Systems | Profiling | PyTorch | SGLang | TRT-LLM | VLLM
Education
Regions
Countries
States
Cities
Related jobs
-
Data Architecture | Data Governance | Data Modeling | Data Systems | Data VisualizationSenior-level ContractBaton Rouge, United States7h ago
-
Research Engineer - LLM Infra training - Seed Infra USD 244K-450KCheckpointing | Data Analysis | Distributed Training | Fault Tolerance | GPU memoryMid-level Full TimeSan Jose, California, United States9h ago
-
Research Engineer - LLM Infra training - Seed Infra USD 232K-427KCheckpointing | Data-Driven Optimization | Data-driven | Deep learning | Distributed TrainingMid-level Full TimeSeattle, Washington, United States9h ago
-
Causal Inference | Cross-modal fusion | DPO | Data Modeling | Deep learningMid-level Full TimeSeattle, Washington, United States9h ago
-
Machine Learning Engineer Graduate (E-Commerce Supply Chain & Logistics)- 2026 Start (BS/MS) USD 122K-256KData Mining | Deep learning | Knowledge graphs | Language Models | Language ProcessingEntry-level Full TimeSan Jose, California, United States9h ago
-
Agentic Systems | Architecture Design | Fine Tuning | Generative AI | Human FeedbackEntry-level Full TimeSan Jose, California, United States9h ago
-
Partner Engineering GenAI - US USD 140K-203KAPI Integration | Agent Orchestration | Artificial Intelligence | Bias Mitigation | C++Senior-level Full TimeMenlo Park, CA | Seattle, WA …10h ago
-
Computer Science Research - US - IC5 USD 166K-244KData Pipelines | Deep learning | Experimentation | Generative Models | Image-to-videoKnowledge sharing | Mentoring | Open source contributionsMid-level Full TimeBellevue, WA | Menlo Park, CA10h ago
-
API Design | Agentic Workflows | C plus plus | C# | Computer VisionSenior-level Full TimeRedmond, WA10h ago
-
Algorithms | Authentication | C# | Cryptography | Data StructuresSenior-level Full TimeMountain View, CA, USA10h ago
-
Software Engineer III, AI/ML GenAI, Google Ads USD 147K-211KC++ | Data Processing | Data Storage | Debugging | Distributed ComputingSenior-level Full TimeMountain View, CA, USA10h ago
-
3D Perception Engineer - Autonomy (Droid) USD 180K-265K3D Geometry | Aerial survey | Autonomy | CNN | Camera CalibrationBonus pay | Dental insurance | Equity compensation | Medical insurance | Paid time offMid-level Full TimeSouth San Francisco, California, USA17h ago
-
Autonomy Perception Engineer - CV / 3D Reconstruction USD 180K-265K3D Reconstruction | Camera Calibration | Computer Vision | Convolutional Neural Networks | Data AnnotationDental insurance | Equity compensation | Medical insurance | Paid time off | Vision insuranceMid-level Full TimeSouth San Francisco, California, USA17h ago
-
Machine Learning Engineer (Active Secret Clearance) USD 175K-205KAgile | Algorithms | Asynchronous programming | CI/CD | Data Structures401k plan | FSA | HSA | Medical/Dental/Vision insurance | Paid disability insuranceMid-level Full TimeSchofield Barracks, Hawaii, United States20h ago
-
AI Algorithm Engineer Scientist USD 170K-240KAgentic Workflows | C# | C++ | CPU Optimization | CUDAHealth insurance | Hybrid work model | Paid time off | Retirement planMid-level Full TimeUSA - CA - Santa Clara, …21h ago
-
AI Solutions Architect USD 100K-110KAI Governance | AIOps | AWS | Agentic AI | AgileDiscount programs | Fitness classes and recreation center access | Flexible health and dental options | Free RTD EcoPass | Generous paid time offSenior-level Full TimeColorado, United States21h ago
-
API Design | C++ | Data Mining | Deep learning | Feature EngineeringSenior-level Full TimeMountain View, CA, USA; San Francisco, …22h ago
-
Senior Machine Learning Engineer, AI Personalization USD 194K-343KAWS | Agentic Engineering | Automated testing | Code generation | Data ExperimentationFlexible time off | Medical insurance | Modern family planning | Remote work | Retirement savings plansSenior-level Full TimeBay Area, CA, United States of …22h ago
-
Data Analytics Analyst USD 172K-202KAWS | Computer Vision | Data Analysis | Data Pipelines | Deep learningBackup childcare | Financial coaching | Health insurance | Mental health support | On-site health and wellness centersMid-level Full TimeNew York, NY, United States22h ago
-
Senior-level Full TimeChicago, Illinois, USA R22h ago
-
Systems Engineer - Data Analysis & Algorithms USD 120K-130KAgile | Data Analysis | Data Modeling | Data Visualization | Git401k | Dental insurance | Employee referral program | Flexible spending account | Health savings accountEntry-level Full TimeSanta Clara, CA23h ago
-
Agentic AI | Information Retrieval | LLM Evaluation | Language Models | Language ProcessingFlexible work environment | Health benefits | Remote work optionsSenior-level Full TimeMountain View, CALIFORNIA, United States23h ago
-
Senior Embedded Systems Engineer USD 145K-235KC# | C++ | CAN | Debugging | Digital Logic401k matching | Dental coverage | Employee stock ownership plan | Employer paid medical insurance | HSA contributionsSenior-level Full TimeGoleta, CA, US1d ago
-
Applied AI ML Engineer Associate USD 175K-215KAPI Integration | Autogen | Big Data | CI/CD | CloudFormationBackup childcare | Financial coaching | Health care coverage | Mental health support | Retirement savings planSenior-level Full TimeColumbus, OH, United States1d ago
-
Tech Lead, ML Engineer - AV Product engineering USD 175K-264KAction models | C++ | CUDA | Closed Loop | Closed Loop EvaluationHybrid work policy | Mentorship opportunities | On-site collaboration | Work from home flexibilitySenior-level Full TimeSunnyvale1d ago