Senior Solutions Architect, Generative AI Deployment and AIOps
US, CA, Santa Clara, United States
USD 184K-287K Senior-level Full Time
Tasks
- Advise customers on generative AI and LLM inference
- Analyze inference performance and power efficiency
- Collaborate with engineering, product, and business teams
- Define high value AI solutions
- Deploy and optimize inference workloads on Kubernetes
- Manage GPU orchestration and MIG in Kubernetes
- Support MLOps adoption and implementation
Perks/Benefits
- N/A
Skills/Tech-stack
C plus plus | C# | Debugging | Deep learning | Distributed Computing | Docker | GPU | GPU MIG | GPU Orchestration | Inference Optimization | Kubernetes | LLM Inference | MLOps | Monitoring | Multi-Instance GPU | Multi-Instance GPU (MIG) | NVIDIA TensorRT | Observability | Parallel Computing | Profiling | PyTorch | Python | TensorFlow
Education
Roles
AI | AI Solutions | AI Solutions Architect | Architect | Solutions Architect
Regions
Countries
States
Cities
Related jobs
-
AI Data Platform Lead USD 164K-229KAWS | Airflow | Audit Logging | DBT | Data GovernanceFloating holidays | Wellness daySenior-level Full TimeUnited States9h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KCMME | MATLAB | NumPy | Pandas | PythonFlexible schedule | Part-time project-based work | Project-based compensationSenior-level FreelanceUnited States - Remote R15h ago
-
Statistics & Python Expert - Freelance AI Trainer USD 146K-146KC# | Combinatorics | Graph theory | MATLAB | NumPyFlexible hours | Part-time opportunities | Project based workSenior-level FreelanceTexas, United States - Remote R15h ago
-
API Integration | Agent systems | Asynchronous processing | Chunking | Cost OptimizationCompetitive salary based on experience | High-impact role | Opportunity to scale AI systems | Strong ownershipMid-level Full TimeAustin, Texas, United States - Remote R15h ago
-
AI Full Stack Engineer - KS001 USD 160K-225KAds API | Agent systems | Anthropic Claude | Cost monitoring | EmbeddingsHigh-impact role | Strong ownershipMid-level Full TimeAustin, Texas, United States - Remote R15h ago
-
Data Scientist Lead USD 175K-210KAWS | Apache Spark | Data Governance | Data Modeling | DatabricksBackup childcare | Financial coaching | Health care coverage | Mental health support | Onsite wellness centersSenior-level Full TimeOH, United States17h ago
-
Senior-level Full TimePalo Alto17h ago
-
Lead AI Engineer - AI & Credit Analytics USD 156K-234KAWS | CI/CD | Data Governance | Generative AI | LLMOpsFlexible time off | Flexible work environment | Hybrid work option | Matching 401k | Medical/Dental/Vision insuranceSenior-level Full TimeCosta Mesa, CA, United States R17h ago
-
Senior-level Full TimePalo Alto17h ago
-
Research Intern, AI & Visual Computing USD 80K-159KARVR | C++ | Computer Graphics | Computer Vision | Image ModelsEntry-level InternshipAustin, TX, US18h ago
-
AI Software Engineer - Greenwood Village, CO Office USD 80K-120KAI Agents | API | Automation | C# | Computer VisionCollaborative environment | Comprehensive benefits package | Employee ownership | Flexible workplace | Innovative cultureEntry-level Full TimeGreenwood Village, Colorado, United States R18h ago
-
Principal Engineer -- AI Architect USD 170K-241KAgentic AI | BigQuery | CI/CD | Cloud Computing | Context engineeringSenior-level Full TimeNew York City, New York, United …19h ago
-
Sr. Delivery Acceleration AI Engineer USD 146K-241KA/B | A/B Testing | AI Agent | AI agent orchestration | API DesignSenior-level Full TimeAustin , Texas, United States19h ago
-
Sr. Delivery Acceleration AI Engineer USD 146K-241KA/B | A/B Testing | AI Agent | AI Agent Development | AI agent orchestrationSenior-level Full TimeBoston , Massachusetts, United States19h ago
-
Sr. Delivery Acceleration AI Engineer USD 137K-241KA/B | A/B Testing | AI Agents | API Design | Artificial IntelligenceSenior-level Full TimeAtlanta, Georgia , United States19h ago
-
Mid-level Full TimeKing George, VA, United States23h ago
-
Senior-level Full TimeDallas, TX, United States1d ago
-
Artificial Intelligence | Competency Mapping | Content Review | Curriculum Development | Data ScienceCross-cultural work experience | Free in-house training | Networking opportunities | Opportunities to develop as a public expert | Remote workSenior-level Part TimeBoston, US1d ago
-
Artificial Intelligence | Data Science | Language Models | Large Language Models | Machine LearningFree training | Networking | Professional development opportunities | Remote workSenior-level Part Timegeorgia, georgia, GE1d ago
-
.NET | AI Foundry | AI Search | Application Insights | Azure AISenior-level Full TimeNew York, New York, United States1d ago
-
AI Governance | Agent systems | Architecture | Context engineering | Data SovereigntySenior-level Full TimeChicago, IL, USA; Atlanta, GA, USA1d ago
-
API Integration | Asynchronous processing | Chatbots | Deep learning | Distributed Systems100% remote | Flexible scheduleMid-level Full TimeAnnapolis, Maryland, United States R1d ago
-
Support Systems Architect USD 216K-240KAutomation | ChatGPT | Dashboards | Data Pipelines | ETLHybrid work model | Relocation assistanceSenior-level Full TimeSan Francisco1d ago
-
Databricks Solutions Architect USD 180K-248KAWS | Apache Spark | Azure | Cloud Native | Cloud platformCareer development | Health insurance | Life insurance | Long-term incentive programs | Retirement planSenior-level Full TimeChicago, IL1d ago
-
Staff ML Architect / Lead consultant USD 150K-226KAI Agents | Apache Airflow | Apache Spark | CI/CD | Fine TuningSenior-level Contract Full TimeChicago, IL1d ago