Senior Solutions Architect, Generative AI Deployment and AIOps
US, CA, Santa Clara, United States
USD 184K-287K Senior-level Full Time
Tasks
- Advise customers on generative AI and LLM inference
- Analyze inference performance and power efficiency
- Collaborate with engineering, product, and business teams
- Define high value AI solutions
- Deploy and optimize inference workloads on Kubernetes
- Manage GPU orchestration and MIG in Kubernetes
- Support MLOps adoption and implementation
Perks/Benefits
- N/A
Skills/Tech-stack
C plus plus | C# | Debugging | Deep learning | Distributed Computing | Docker | GPU | GPU MIG | GPU Orchestration | Inference Optimization | Kubernetes | LLM Inference | MLOps | Monitoring | Multi-Instance GPU | Multi-Instance GPU (MIG) | NVIDIA TensorRT | Observability | Parallel Computing | Profiling | PyTorch | Python | TensorFlow
Education
Roles
AI | AI Solutions | AI Solutions Architect | Architect | Solutions Architect
Regions
Countries
States
Cities
Related jobs
-
AI Solutions Architect USD 100K-130KAI Governance | Azure Data | Azure Data Platform | DAX | Data ArchitectureSenior-level Full TimeEast Granby, CT, US17h ago
-
Data Science Faculty Position USD 228K-515KArtificial Intelligence | Causal Inference | Computational Neuroscience | Computational modeling | Data ScienceMid-level Full TimeStanford University18h ago
-
Senior AI Engineer (Contract) USD 122K-156KAI evals | AWS | Agent Orchestration | Agent SDK | Agents SDKFlexible scheduling | Part-time contract | Potential contract extensionSenior-level ContractUnited States18h ago
-
HCL Informix | HCL Vector Blade | LLM | MCP | PythonDedicated buddy | Executive Networking | Intern events | Internship showcase | Onsite orientation travelEntry-level InternshipUS-Remote R19h ago
-
Senior-level Full TimeUS - Milpitas19h ago
-
AI Solutions Intern USD 60K-65KAPI Integration | Agent systems | Automation | GitHub | LangchainDedicated buddy | Executive Networking | Intern events | Internship showcase | Remote internshipEntry-level InternshipUS-Remote R19h ago
-
Principal Data Architect USD 165K-185KAWS | Amazon Bedrock | Amazon SageMaker | Business Intelligence | CI/CD401k match | Company laptop | Dental insurance | Equipment stipend | Flexible spending accountSenior-level Full TimeUSA R19h ago
-
Mid-level Full TimeSão Paulo, Brazil; Denver, CO; Austin, …19h ago
-
Director, Software Engineering - GenAI USD 186K-299KC++ | Cloud Computing | Data Quality | Distributed Systems | Docker401k | Dental insurance | Health insurance | Life insurance | Paid time offExecutive-level Full TimeBellevue, WA, United States20h ago
-
Advanced Technology: AI/ML Research Scientist USD 172K-250KAccess patterns | C# | Differential Equations | Generalization | Gradient methodsMid-level Full TimeSunnyvale, CA; Toronto, Ontario, Canada; Vancouver, …20h ago
-
Technical Director, Generative AI USD 115K-130KAPI | API Integration | Architecture | Automation Pipelines | Content Generation401k | Career growth | Dental insurance | Medical insurance | Paid HolidaysExecutive-level Full TimeNew York, United States21h ago
-
Principal Research Scientist II - Data Architecture, Biotherapeutics and Genetic Medicine USD 137K-203KAWS | Azure | Cloud Native | Cloud Native Architecture | Cloud platform401k | Dental insurance | Medical insurance | Paid time off | Vision insuranceMid-level Full TimeWorcester, MA, United States21h ago
-
Staff Software Engineer-AI Solutions USD 124K-198KAJAX | API Integration | Agent systems | Agentic AI | Business RulesSenior-level Full TimeHighlands Ranch, CO, United States21h ago
-
Intern, Information Technology (AI Engineer) USD 50K-50KAccess Control | Business Intelligence | Data Cataloging | Data Classification | Data GovernanceEntry-level InternshipCharlotte, NC United States21h ago
-
AI Engineer I USD 104K-156KAgentic AI | Apache Spark | Async Processing | Data Processing | Distributed SystemsMid-level Full TimeBoston, MA22h ago
-
Summer Internship - AI Researcher USD 64K-75KBrowser Automation | Caching | DOM Manipulation | Evaluation Pipelines | HCCCollaborative team | Guidance from senior leaders | Remote-first cultureEntry-level InternshipRemote, United States R22h ago
-
Summer Internship - AI Researcher USD 64K-80KAPI Design | Agent systems | Anthropic Claude | Human-in-the-loop | JavaScriptEntry-level InternshipRemote, United States R22h ago
-
AI Intern (Summer 2026) USD 36K-62KAWS | Azure | Computer Vision | Data Preprocessing | Data VisualizationFlexible class enrollment | Hands-on experience | Mentorship | University internship registration supportEntry-level Full Time InternshipOH, United States23h ago
-
Lead Software Engineer - AI Engineer USD 167K-215KAWS | Agile methodologies | Application Resiliency | Authentication | AutomationBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeTampa, FL, United States1d ago
-
Senior Software Architect (Robot Platform) USD 180K-230KApplication Programming | Application Programming Interfaces | CD pipelines | CI/CD | CI/CD pipelines401k match | Education assistance program | Flexible work arrangements | Health screenings | Medical/Dental/VisionSenior-level Full TimePleasanton, CA, United States1d ago
-
Staff Engineer, Innovation USD 175K-250KAWS Batch | AWS Bedrock | AWS Lambda | Agent Frameworks | Amazon S3Senior-level Full TimeUnited States (Remote) R1d ago
-
AI Engineering Intern USD 40K-60KAWS | Agent Frameworks | Agent systems | Application development | BedrockAccess to proprietary datasets | Collaborative work environment | MentorshipEntry-level InternshipBoston, MA1d ago
-
AI Agents | App Development | Azure | Embeddings | Generative AIOnsite Hybrid work arrangementSenior-level Contract Full TimeNew York City, New York, United …1d ago
-
AI Engineer USD 170K-250KAWS | Agent workflows | Autodesk | Azure | Azure DevOpsAdoption Assistance | Birthday off | Direct deposit paychecks | Educational reimbursement | Maternity paternity care plansMid-level Full TimeNaples, United States1d ago
-
Agent Frameworks | Azure | Embeddings | Generative AI | LangchainSenior-level Contract Full TimeNew York City, New York, United …1d ago