Senior Deep Learning Architect, LLM Inference
US, CA, Santa Clara, United States
USD 184K-356K Senior-level Full Time
Tasks
- Build inference content
- Contribute to deep-learning projects
- Develop performance website
- Establish benchmarking methodologies
- Guide inference serving strategies
- Improve team efficiency with inference tech
- Invent profiling tools
- Verify GPU performance
- Workload characterization
Perks/Benefits
Skills/Tech-stack
AI Coding Agents | AI coding | Client-Server | Client-server applications | Coding Agents | Compiler optimization | Deep learning | Deep learning inference | Deep learning inference serving | Frameworks | GPU microarchitecture | Generative AI | Inference Serving | MCP | OpenAI API | Performance optimization | Profiling | PyTorch | Schedulers | Server applications | Systems Development
Education
Regions
Countries
States
Cities
Related jobs
-
Staff AI/ML Technical Solution Consultant USD 183K-265KAI dev tools | AI/ML | AI/ML workflows | C++ | Cloud ComputingBenefits | Bonus | EquitySenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA1d ago
-
Digital S/W Eng Lead Analyst -Vice President USD 125K-188KAI/ML | CI/CD | Claude | Docker | Embeddings401k | Dental | Disability insurance | Life insurance | MedicalSenior-level Full Time3800 CITIGROUP CENTER DRIVE BUILDING C …1d ago
-
AI Architect USD 119K-180KAlgorithms | Cloud Platforms | Data Architecture | Data Governance | Data StandardizationSenior-level Full TimeHouston, Texas, United States1d ago
-
Gen AI/ML lead- NY USD 48K-170KAI APIs | AI Observability | AI Security | AI architecture | CI/CD401k plan | Dental insurance | Medical insurance | Paid Holidays | Paid time offSenior-level Full TimeUnited States2d ago
-
Principal AI Developer / Product Architect USD 165K-220KAI Governance | AI architecture | API Design | AWS | AzureFlexible work hours | Professional development support | Remote work optionsSenior-level Full TimeMcLean, VA, United States3d ago
-
Staff Data Engineer (Tech Lead) - Hybrid USD 135K-202KAI integration | AWS EMR | AWS Glue | BigQuery | CI/CDFlexible hours | Health benefits | Hybrid work | Professional development | Retirement planSenior-level Full TimeHartford CT- Home Office, United States3d ago
-
Sr. GenAI & ML Specialist Solutions Architect USD 169K-228KAI system evaluation | Architectural Design | Cloud Computing | Deep learning | Generative AIInclusive culture | Mentorship | Work-life balanceSenior-level Full TimeNew York, New York, USA3d ago
-
Solutions Architect, Agentic AI USD 152K-241KAI frameworks | Agent systems | Agentic AI | C# | C++Benefits | Career development | Flexible work arrangementsSenior-level Full TimeUS, CA, Santa Clara, United States3d ago
-
Observability and AI Ops Architect, Senior USD 112K-257KAI | Agile | Enterprise monitoring | Generative AI | ObservabilityHealth benefits | Paid leave | Professional development | Retirement plans | Tuition assistanceSenior-level Full TimeUSA, DC, Washington (901 15th St …3d ago
-
APIs | AWS SageMaker | Amazon Bedrock | Amazon Comprehend | Amazon PersonalizeCareer growth opportunities | Flexible work hours | Inclusive team culture | Mentorship programsSenior-level Full TimeSan Francisco, California, USA3d ago
-
AI Solutions Architect USD 198K-270KAI frameworks | AI orchestration | Agile methodology | Communication | Data SecurityImpactful social innovation | Mission-driven environmentSenior-level Full TimeSan Francisco, CA R4d ago
-
AI Coding Agents | AI coding | Client-Server | Client-server applications | Coding AgentsBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States4d ago
-
Senior-level Full TimeUSA - Remote R6d ago
-
Technical Architect - Machine Learning USD 175K-265KAPI Gateway | AWS | Airflow | Amazon Bedrock | Deep learningCareer growth opportunities | Exposure to AI and cloud technologies | Innovative tech environment | Remote workSenior-level Full TimeUSA - Remote, United States R6d ago
-
Principal, Application Data & Cloud Security (AI) USD 143K-239KAI Security | AI architectures | Agentic AI | Cloud Security | Cybersecurity architectureSenior-level Full TimeAtlanta, Georgia, US United States, 303407d ago
-
Senior Solutions Architect, Generative AI USD 184K-287KAI frameworks | C++ | CUDA | Containers | Data PipelinesEquity | Health benefits | Professional development | Remote workSenior-level Full TimeUS, CA, Santa Clara, United States7d ago
-
AI Director, Data Science USD 150K-200KAutonomous Agents | Client engagement | Cloud Platforms | Diffusion Models | GANsCollaborative environment | Growth opportunities | Impactful work | Innovative cultureExecutive-level Full TimeColumbia, MD, United States7d ago
-
Tech Lead-Machine Learning Engineer (Agent & Multi-Agent Systems) – AIGC Risk Intelligence USD 150K-260KAI architecture | Agent systems | Data Analysis | Deep learning | Machine LearningSenior-level Full TimeSeattle, Washington, United States7d ago
-
Solutions Architect, Data, Google Cloud USD 183K-271KBig Query | Cloud Architecture | Data Migration | Data Modeling | Data WarehousingBenefits | Bonus | EquitySenior-level Full TimeChicago, IL, USA; Atlanta, GA, USA8d ago
-
Principal Engineer, Data Platform Architecture USD 228K-342KBatch Processing | Cloud Platforms | Data Governance | Data Modeling | Data PipelinesFlexible work options | Inclusive environmentSenior-level Full TimeUS - California - Fully Flexible, …8d ago
-
Principal AI Architect - Agentic Verticals USD 206K-451KC++ | Cognitive Frameworks | Hugging Face | Java | LangchainCareer growth opportunities | Flexible work arrangements | Health benefitsSenior-level Full TimeSeattle (WA), United States8d ago
-
Solutions Architect USD 143K-155KCloud Computing | Data Pipelines | Data Science | Generative AI | KubernetesCareer growth | Flexible working environment | Market competitive salary | Remote-friendly cultureSenior-level Full TimeSan Francisco, CA R8d ago
-
Senior Manager - Data Architect USD 180K-253KData Governance | Data Modeling | Data Performance Optimization | Data integration | Data performanceSenior-level Full TimeUnited States8d ago
-
Principal Software Developer for Data and AI Engines USD 96K-223KAI | AI frameworks | Data Architecture | Data Management | Data ModelingAdoption Assistance | Dental insurance | Employee stock purchase plan | Flexible vacation | Legal and financial planningSenior-level Full TimeRedwood City, CA, United States8d ago
-
Solutions Architect, AI and ML USD 124K-241KCUDA | Cloud Computing | Data Science | Deep learning | Distributed ComputingBenefits | EquitySenior-level Full TimeUS, WA, Redmond, United States9d ago