Deep Learning Architect, LLM Inference - New College Grad 2026
US, CA, Santa Clara, United States
USD 124K-241K Senior-level Full Time
Tasks
- Build benchmarking methodologies
- Contribute to deep learning software projects
- Develop client server LLM applications
- Develop profiling and analysis tools
- Guide inference serving direction
- Improve team efficiency with coding agents
- Optimize inference server performance
- Verify GPU product launch performance
- Workload characterize large language model inference
Perks/Benefits
- N/A
Skills/Tech-stack
CPU performance | Compiler optimization | Data Visualization | Databases | Deep learning | GPU Performance | Inference Optimization | Language Models | Large Language Models | MCP | Microarchitecture | Model Inference | OpenAI API | Operating Systems | Profiling | PyTorch | SGLang | TRT-LLM | VLLM
Education
Regions
Countries
States
Cities
Related jobs
-
Senior Machine Learning Engineer, Computer Vision USD 150K-200KAWS | Agile | Airflow | Azure | CI/CD401-k plan | Healthcare benefits | Life insurance | Long-term disability | On-site collaborationSenior-level Full TimeSeattle, Washington, United States7h ago
-
Staff AI Engineer USD 170K-220KAPI Development | API Integration | Anthropic API | Artificial Intelligence | Backend Development401k match | Commuter benefits | Employee assistance program | Flexible spending accounts | Gym Fitness Discount ProgramSenior-level Full TimeRemote- US R9h ago
-
Senior Engineer, Developer Platforms - GenAI USD 140K-160KAWS | Agentic Workflows | Amazon Web Services | Datadog | DockerSenior-level Full TimeNew York, NY9h ago
-
AWS | Cloud Run | Cloud platform | Django | DockerOn-site onlySenior-level Contract Full TimeSan Jose, CA, United States12h ago
-
AI Solution Architect – Enterprise AI Platform USD 112K-160KAI Services | AKS | API Development | API Management | Agent OrchestrationHybrid work flexibility | Paid time offSenior-level Full TimeDallas, TX, United States17h ago
-
Data Synthesis | Deep learning | Language Models | Language Processing | Large Language ModelsEntry-level InternshipSan Jose, California, United States17h ago
-
AWS | Alteryx | Amazon SageMaker | Azure | Azure DataMid-level Full TimeNew York, NY, United States18h ago
-
Strategic Intelligence & Advanced Analytics Engineer USD 108K-136KAnomaly Detection | Artificial Intelligence | Azure | Data Pipelines | Data QualityPaid parental leave | Paid time off | Public service loan forgiveness | Tuition reimbursement | Wellness programsMid-level Full TimeTexas-Dallas-5323 Harry Hines Blvd18h ago
-
Fine Tuning | GPU resource management | Intelligent agents | Language Models | Large Language ModelsEntry-level Full TimeSan Jose, California, United States18h ago
-
Senior Software Engineer, Database Internals, AlloyDB USD 174K-252KC# | C++ | Code optimization | Concurrency Control | Database InternalsEntry-level Full TimeSunnyvale, CA, USA19h ago
-
Artificial Intelligence | Data Analysis | Data Structures | Data structures algorithms | Human-in-the-loopSenior-level Full TimeMountain View, CA, USA19h ago
-
Agent tooling | Artificial Intelligence | C++ | Cloud Architecture | Conversational AISecret clearance | TravelSenior-level Full TimeAtlanta, GA, USA; Austin, TX, USA19h ago
-
Software Engineer III, AI/ML GenAI, Google Cloud Compute USD 147K-211KAudio generation | C++ | Computer Vision | Data Processing | DebuggingSenior-level Full TimeSunnyvale, CA, USA19h ago
-
Staff Software Engineer, Embedded Systems/Firmware, Platforms Infrastructure Engineering USD 207K-300KAlgorithms | Data Structures | Distributed Computing | Embedded Systems | Embedded operating systemsSenior-level Full TimeSunnyvale, CA, USA19h ago
-
Senior Software Engineer, Applied AI Commerce USD 174K-252KAutomated Evaluation | C++ | Cloud | Evaluation datasets | GeminiSenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA19h ago
-
Staff AI engineer USD 155K-225KAI Evaluation | AWS | Agent Orchestration | Caching | Conversational InterfacesFlexible working hours | Hybrid work culture | Unlimited time offSenior-level Full TimeSan Francisco1d ago
-
Applied AI ML Lead - LLM SUITE ENGINEERING USD 176K-215KAPI Design | AWS | Agentic AI | Caching | Cloud NativeBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeWilmington, DE, United States1d ago
-
AI Data Engineer USD 120K-220KAgent memory | Amazon Web Services | Audio Processing | Batch Processing | Cloud infrastructureAccess to AI tools | Equity | Remote opportunitiesMid-level Full TimeSan Francisco Bay Area1d ago
-
Senior-level Full TimeRaleigh, NC, US1d ago
-
Senior AI Engineer USD 107K-199KAKS | API Design | Alerts | Anomaly Detection | Apache SparkHybrid work environment | Inclusion support | Learning opportunities | Well-being supportSenior-level Full TimeUSA, Massachusetts, Boston, 200 Berkeley Street, …1d ago
-
Entry-level Full TimeUnited States - Remote R1d ago
-
CI/CD | Docker | Drift Detection | Embeddings | Experiment trackingMentorship | Remote workSenior-level Full TimeUnited States - Remote R1d ago
-
Data Engineer USD 85K-141KAPI Gateways | CI/CD | Cloud Databases | Data Governance | Data Lakes401k retirement plan | Adoption Assistance | Flexible spending accounts | Health savings account | Parental leaveMid-level Full TimeClient Office: Aberdeen, MD, United States1d ago
-
Senior Data Engineer USD 82K-172KAWS | Apache Spark | Artificial Intelligence | BERT | BitbucketContinuing education | Family support benefits | Flexible time off | Healthcare benefits | Learning resourcesSenior-level Full Time606 KING OF PRUSSIA PA, United …1d ago
-
Staff AI/ML Engineer USD 108K-227KAWS | Adversarial Networks | Bitbucket | CUDA | CupyFlexible time off | Learning resources | MentoringSenior-level Full Time606 KING OF PRUSSIA PA, United …1d ago