Deep Learning Architect, LLM Inference - New College Grad 2026
US, CA, Santa Clara, United States
USD 124K-241K Senior-level Full Time
Tasks
- Build benchmarking methodologies
- Contribute to deep learning software projects
- Develop client server LLM applications
- Develop profiling and analysis tools
- Guide inference serving direction
- Improve team efficiency with coding agents
- Optimize inference server performance
- Verify GPU product launch performance
- Workload characterize large language model inference
Perks/Benefits
- N/A
Skills/Tech-stack
CPU performance | Compiler optimization | Data Visualization | Databases | Deep learning | GPU Performance | Inference Optimization | Language Models | Large Language Models | MCP | Microarchitecture | Model Inference | OpenAI API | Operating Systems | Profiling | PyTorch | SGLang | TRT-LLM | VLLM
Education
Regions
Countries
States
Cities
Related jobs
-
AWS | Cloud Run | Cloud platform | Django | DockerOn-site onlySenior-level Contract Full TimeSan Jose, CA, United States12h ago
-
AI Solution Architect – Enterprise AI Platform USD 112K-160KAI Services | AKS | API Development | API Management | Agent OrchestrationHybrid work flexibility | Paid time offSenior-level Full TimeDallas, TX, United States16h ago
-
Data Synthesis | Deep learning | Language Models | Language Processing | Large Language ModelsEntry-level InternshipSan Jose, California, United States17h ago
-
AWS | Alteryx | Amazon SageMaker | Azure | Azure DataMid-level Full TimeNew York, NY, United States17h ago
-
Strategic Intelligence & Advanced Analytics Engineer USD 108K-136KAnomaly Detection | Artificial Intelligence | Azure | Data Pipelines | Data QualityPaid parental leave | Paid time off | Public service loan forgiveness | Tuition reimbursement | Wellness programsMid-level Full TimeTexas-Dallas-5323 Harry Hines Blvd17h ago
-
Fine Tuning | GPU resource management | Intelligent agents | Language Models | Large Language ModelsEntry-level Full TimeSan Jose, California, United States17h ago
-
Senior Software Engineer, Database Internals, AlloyDB USD 174K-252KC# | C++ | Code optimization | Concurrency Control | Database InternalsEntry-level Full TimeSunnyvale, CA, USA18h ago
-
Artificial Intelligence | Data Analysis | Data Structures | Data structures algorithms | Human-in-the-loopSenior-level Full TimeMountain View, CA, USA18h ago
-
Agent tooling | Artificial Intelligence | C++ | Cloud Architecture | Conversational AISecret clearance | TravelSenior-level Full TimeAtlanta, GA, USA; Austin, TX, USA18h ago
-
Software Engineer III, AI/ML GenAI, Google Cloud Compute USD 147K-211KAudio generation | C++ | Computer Vision | Data Processing | DebuggingSenior-level Full TimeSunnyvale, CA, USA18h ago
-
Staff Software Engineer, Embedded Systems/Firmware, Platforms Infrastructure Engineering USD 207K-300KAlgorithms | Data Structures | Distributed Computing | Embedded Systems | Embedded operating systemsSenior-level Full TimeSunnyvale, CA, USA18h ago
-
Senior Software Engineer, Applied AI Commerce USD 174K-252KAutomated Evaluation | C++ | Cloud | Evaluation datasets | GeminiSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA18h ago
-
Applied AI ML Lead - LLM SUITE ENGINEERING USD 176K-215KAPI Design | AWS | Agentic AI | Caching | Cloud NativeBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeWilmington, DE, United States1d ago
-
AI Data Engineer USD 120K-220KAgent memory | Amazon Web Services | Audio Processing | Batch Processing | Cloud infrastructureAccess to AI tools | Equity | Remote opportunitiesMid-level Full TimeSan Francisco Bay Area1d ago
-
Senior-level Full TimeRaleigh, NC, US1d ago
-
Senior AI Engineer USD 107K-199KAKS | API Design | Alerts | Anomaly Detection | Apache SparkHybrid work environment | Inclusion support | Learning opportunities | Well-being supportSenior-level Full TimeUSA, Massachusetts, Boston, 200 Berkeley Street, …1d ago
-
Entry-level Full TimeUnited States - Remote R1d ago
-
CI/CD | Docker | Drift Detection | Embeddings | Experiment trackingMentorship | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Data Engineer USD 85K-141KAPI Gateways | CI/CD | Cloud Databases | Data Governance | Data Lakes401k retirement plan | Adoption Assistance | Flexible spending accounts | Health savings account | Parental leaveMid-level Full TimeClient Office: Aberdeen, MD, United States1d ago
-
Senior Data Engineer USD 82K-172KAWS | Apache Spark | Artificial Intelligence | BERT | BitbucketContinuing education | Family support benefits | Flexible time off | Healthcare benefits | Learning resourcesSenior-level Full Time606 KING OF PRUSSIA PA, United …1d ago
-
Staff AI/ML Engineer USD 108K-227KAWS | Adversarial Networks | Bitbucket | CUDA | CupyFlexible time off | Learning resources | MentoringSenior-level Full Time606 KING OF PRUSSIA PA, United …1d ago
-
Staff AI/ML Engineer (LLMs) USD 108K-227KAWS Bedrock | Agentic AI | Arize Phoenix | Bitbucket | CUDAFlexible time off | Learning and development resourcesSenior-level Full Time606 KING OF PRUSSIA PA, United …1d ago
-
Machine Learning Engineer II USD 131K-184KAzure | Batch inference | Data Pipelines | Databricks | Distributed SystemsContinuous learning | Flexible ways of working | Growth mindset cultureMid-level Full TimeUSA TX Houston Hybrid, United States R1d ago
-
Senior, Data Scientist (Machine Learning Engineer) USD 110K-220KAccessibility guidelines | Airflow | CI/CD | Computer Vision | Container OrchestrationSenior-level Full Time(USA) Crossman Respect Building CA SUNNYVALE …1d ago
-
Agentic AI Machine Learning Engineer USD 99K-225KAPI Integration | Cloud Computing | Computer Vision | Confluent | Deep learningDependent care | Disability insurance | Health insurance | Life insurance | Paid leaveMid-level Full TimeUSA, DC, Washington (901 15th St …1d ago