Solutions Architect, Inference Deployments
USD 152K-241K Senior-level Full Time
Tasks
- Accelerate inference pipelines with TensorRT-LLM and related tools
- Build inference pipelines with NVIDIA Dynamo
- Collaborate with DevOps on Kubernetes orchestration
- Mentor customers and internal teams on deployment and resolution of complex issues
Perks/Benefits
Skills/Tech-stack
AI Inference | AI inference workloads | Disaggregated inference | GPU Operator | GPU Orchestration | GPU memory | GPU memory management | Inference Server | Inference acceleration | Inference workloads | Kubernetes | Low Latency | Low Latency Networking | Memory Management | Model Optimization | Multi-Instance GPU | NIM Operator | NVIDIA GPU | NVIDIA GPU Operator | Neural Networks | Nvidia Dynamo | Open Source | Open-source contributions | Quantization | RDMA | SGLang | Speculative decoding | TensorRT-LLM | Transformer Neural Networks | Triton Inference | Triton Inference Server | UCX | VLLM | WideEP
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Lead Architect , AI Solutions Architecture - PI USD 169K-279KAI Act | AI Foundry | AI RMF | AI Services | API Integration401k match | Health insurance | Mental health counseling | Paid Holidays | Paid time offSenior-level Full TimeHartford - Tower, United States21h ago
-
Lead Architect , AI Solutions Architecture - EDDA USD 169K-279KAI RMF | AWS Bedrock | Agentic AI | Artificial Intelligence | CI/CD401k match | Health insurance | Paid time off | Volunteer rewards | Wellness programSenior-level Full TimeHartford - Tower, United States21h ago
-
Senior Architect, AI Solutions Architecture - ETS USD 139K-230KAWS Bedrock | Agent Orchestration | Artificial Intelligence | CI/CD | Crew AI401k match | Health insurance | Mental health counseling | Paid Holidays | Paid time offSenior-level Full TimeHartford - Tower, United States21h ago
-
Lead Architect , AI Solutions Architecture - Claim USD 169K-279KAI Act | AI Ops | AWS Bedrock | Agent Orchestration | Agentic AI401k match | Employee assistance program | Health insurance | Matching gift program | Paid time offSenior-level Full TimeHartford - Tower, United States21h ago
-
Lead Architect , AI Solutions Architecture - BI/INTL USD 169K-279KAI Act | AI Ops | AI RMF | AI Services | AWS Bedrock401k match | Free counseling services | Health coaching | Health insurance | Matching giftSenior-level Full TimeHartford - Tower, United States21h ago
-
Senior Architect, AI Solutions Architecture - CorpTech USD 139K-230KAI Governance | AI Ops | AWS Bedrock | Agent Orchestration | Agentic AI401k matching | Health insurance | Mental health counseling | Paid Holidays | Paid time offSenior-level Full TimeHartford - Tower, United States21h ago
-
Principal Data Architect USD 197K-337KApache Flink | Apache Kafka | Apache Spark | Artificial Intelligence | Cloud ComputingCareer development opportunities | Learning and development programs | Mentorship | Remote workSenior-level Full TimeChicago, Illinois, USA R1d ago
-
Data Platform Architect - Applied Field Engineering USD 207K-271KAWS | Azure | Cloud platform | Data Modeling | DockerSenior-level Full TimeUS-NY-New York1d ago
-
Advisory AI Architect (FDE Unit) USD 249KArtificial Intelligence | Automated testing | CI/CD | Cloud Migration | DevOpsKnowledge transfer | Mentorship | Remote work | Travel opportunitiesSenior-level Full TimeChicago, Illinois, United States, United States1d ago
-
Enterprise Application and Data Architect USD 133K-207KAPI Design | Apache Spark | Application Architecture | Automated testing | AzureContinuous learning | Health benefits | Retirement planning | Time offSenior-level Full TimeBloomington, MN, United States2d ago
-
Data Architecture - Sr Advisor II USD 110K-186KAES 256 | AWS | Agile | Application development | Best practicesSenior-level Full TimeMilwaukee, Wisconsin, United States2d ago
-
Advisory AI Architect (FDE Unit) USD 249KArtificial Intelligence | Automated testing | CI/CD | Cloud Migration | Data PipelinesHealth benefits | Remote work | Travel for customer projectsSenior-level Full TimeRound Rock, Texas, United States, United …2d ago
-
Principal Data Platform and Software Engineering, VP - State Street Investment Management USD 120K-217KAWS | Agentic AI | CI/CD | Capacity Planning | Cloud Architecture401k company match | Dental insurance | Employee assistance program | Employee networks | Health insuranceSenior-level Full TimeQuincy, Massachusetts, United States2d ago
-
Senior-level Full TimeInnovation Point, United States2d ago
-
Director - Microsoft Cloud & AI Solution Architecture (Communications, Media & Technology) USD 225K-240KARM Templates | Ansible | App Services | Azure DevOps | Azure Firewall401(k) plan matching | Bereavement leave | Disability insurance | Employee assistance program | Employee discount programSenior-level Full TimeLos Angeles, CA, United States R3d ago
-
.Net Core | ARM/Bicep | Ansible | App Service | Application Architecture401k plan with company matching | Disability insurance | Employee assistance program | Health, dental, vision coverage | Life insuranceSenior-level Full TimeNew York, NY, United States R3d ago
-
Principal AI/ML Architect USD 165K-185KAWS Lambda | Airflow | Amazon Bedrock | Amazon Kinesis | Amazon OpenSearch401k with company match | Company issued laptop | Dental insurance | Equipment and office stipend | Flexible spending accountSenior-level Full TimeUSA R3d ago
-
Senior AI Solution Architect USD 124K-168KAWS | AWS Bedrock | AWS CDK | AWS Inferentia | AWS Trainium401k | Certification reimbursement | Cruise and travel privileges | Employee stock purchase plan | Health benefitsSenior-level Full TimeSeattle, WA, United States3d ago
-
Data Analyst - TC - Data & Analytics - Data Arch & Eng - FSO - Manager - Mul Pos - 1699535 USD 178K-178KAWS | Amazon Redshift | Apache Spark | Apache Spark Streaming | Azure Synapse401k plan | Continuous learning | Flexible vacation policy | Hybrid work model | Medical and dental coverageMid-level Full TimeCharlotte, NC, US, 28202 R3d ago
-
Data Analyst-Tech Con - D & A - Data Arch & Eng-FSO-Senior Manager-Multiple Positions - 1698174 USD 200KAWS | Apache Kafka | Azure | Azure Synapse | Cassandra401k plan | Dental coverage | Flexible vacation policy | Hybrid work model | Medical coverageSenior-level Full TimePhoenix, AZ, US, 85004 R3d ago
-
AI Solutions Architect USD 149K-248KAirflow | Anomaly Detection | Azure | Azure Machine Learning | Cause analysisSenior-level Full TimeDurham Blackwell Street, United States3d ago
-
Solutions Architect, LLM Model Builder USD 152K-241KBenchmarking | CUDA | Compression | Distillation | EvaluationSenior-level Full TimeUS, CA, Santa Clara, United States3d ago
-
Solutions Architect, LLM Model Builder USD 152K-241KBenchmarking | CUDA | Compression | Distillation | EvaluationSenior-level Full TimeUS, CA, Santa Clara, United States3d ago
-
Senior Solutions Architect, Generative AI Specialist USD 184K-287KAI Evaluation | Agentic AI | Audio Processing | CUDA | Data GenerationSenior-level Full TimeUS, CA, Santa Clara, United States3d ago
-
Senior Solutions Architect, Generative AI Specialist USD 184K-356KAI Observability | Agent systems | Agentic AI | CUDA | Container OrchestrationComprehensive benefitsSenior-level Full TimeUS, CA, Santa Clara, United States3d ago