Solutions Architect, Inference Deployments
USD 152K-241K Senior-level Full Time
Tasks
- Accelerate inference pipelines with TensorRT-LLM and related tools
- Build inference pipelines with NVIDIA Dynamo
- Collaborate with DevOps on Kubernetes orchestration
- Mentor customers and internal teams on deployment and resolution of complex issues
Perks/Benefits
Skills/Tech-stack
AI Inference | AI inference workloads | Disaggregated inference | GPU Operator | GPU Orchestration | GPU memory | GPU memory management | Inference Server | Inference acceleration | Inference workloads | Kubernetes | Low Latency | Low Latency Networking | Memory Management | Model Optimization | Multi-Instance GPU | NIM Operator | NVIDIA GPU | NVIDIA GPU Operator | Neural Networks | Nvidia Dynamo | Open Source | Open-source contributions | Quantization | RDMA | SGLang | Speculative decoding | TensorRT-LLM | Transformer Neural Networks | Triton Inference | Triton Inference Server | UCX | VLLM | WideEP
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Principal AI/ML Architect USD 168K-252KAWS AgentCore | AWS Athena | AWS Bedrock | AWS Glue | AWS SageMakerSenior-level Full TimeSan Francisco, CA10h ago
-
Senior-level Full TimeUSA, CA, San Diego (1615 Murray …1d ago
-
Senior Staff Embedded Systems Architect, IVI USD 163K-308KAPI Design | AUTOSAR | Automotive Ethernet | Automotive Networks | BLECommunity service paid time off | Employee resource groups | Flexible family care days | Immediate Work Visa Sponsorship | Medical dental vision prescription drug coverageSenior-level Full TimePalo Alto, CA, United States1d ago
-
AI infrastructure | Accelerator Virtualization | Container Runtime | Distributed Systems | GPUSenior-level Full TimeSeattle, WA, USA; Kirkland, WA, USA1d ago
-
Analytics Data Platform Architect USD 170K-200KAWS | Agile | Amazon Redshift | Analytics Integration | Automation401k match | Adoption Assistance | Healthcare Dental Vision | Long-term disability | Paid HolidaysSenior-level Full TimeSan Francisco, CA, United States2d ago
-
Principal AI Data Architect USD 159K-258KAPI Design | AWS | AWS Neptune | Agent systems | Artificial Intelligence401k savings plan | Adoption benefits | Career development | Disability benefits | Employee assistance programSenior-level Full TimeIrving, Texas, United States2d ago
-
Senior Solutions Architect, Autonomous Driving - GenAI USD 184K-356KAWS | Azure | C++ | CUDA-X | Computer VisionSenior-level Full TimeUS, CA, Santa Clara, United States2d ago
-
Principal AI Architect - Agentic Verticals USD 206K-451KC++ | Computer Vision | Hugging Face | Java | LangchainEmployee wellness benefits | Hybrid work | Remote work options | Work-life balanceSenior-level Full TimeSeattle (WA), United States2d ago
-
Principal AI Architect USD 150K-300KAWS | Apache Kafka | Apache Spark | Azure | CassandraContinuing education program | Continuous learning access | Family-friendly perks | Financial wellness programs | Generous time offSenior-level Full TimeUS - NY NYC - 55 …2d ago
-
Senior Software Engineer, Data Platform USD 168K-322KAWS | Azure | Data Lake | Data Warehousing | DockerSenior-level Full TimeUS, CA, Santa Clara, United States3d ago
-
Data Architect USD 150K-160KAI | AWS | Azure | Cloud Architecture | ContainerizationCareer growth | Flexible work | Global exposure | Learning opportunitiesSenior-level Full TimeUnited States3d ago
-
AI Architect USD 121K-204KCloud Architecture | Data analytics | Docker | Keras | KubernetesCareer growth opportunities | Onsite | Security clearance sponsorshipSenior-level Full TimeUSA-VA-Arlington3d ago
-
AI Development and Platform Engineer Officer USD 70K-118KAI Document Intelligence | API Management | AWS Agent Core | AWS Bedrock | AWS LambdaDental insurance | Disability insurance | Employee assistance program | Family care support | Health insuranceSenior-level Full TimeQuincy, Massachusetts, United States4d ago
-
Director II, Engineering USD 225K-274KCloud Native | Containerization | Distributed Systems | Java | KubernetesExecutive-level Full Time*Job Posting Only: USA14d ago
-
Apache Spark | Cloud Platforms | Data Engineering | Data Science | Data analyticsFlexible work arrangements | Health insurance | Professional development opportunities | Travel opportunitiesSenior-level Full TimeAtlanta, Georgia; Chicago, Illinois; New York; …4d ago
-
Apache Spark | Big Data | Big Data Technologies | Cloud Platforms | Data ArchitectureFlexible work hours | Health insurance | Professional development opportunities | Travel opportunitiesSenior-level Full TimeNew Jersey; New York4d ago
-
AI Solutions Architect USD 180K-220KAI | AI/ML Services | Cloud AI | Cloud AI/ML | Cloud AI/ML servicesSenior-level Full TimeNew York, NY, United States5d ago
-
AI Solutions Architect USD 180K-220KAI/ML | AI/ML Services | AI/ML methodologies | Cloud AI | Cloud AI/MLFlexible work hours | Professional Development BenefitsSenior-level Full TimeNew York, NY, United States5d ago
-
Sr Architect USD 160K-200KAgile | Artificial Intelligence | Cybersecurity | Data Structures | Data VisualizationSenior-level Full TimeUS - NEW JERSEY CLIENT SITE, …5d ago
-
AI Architect USD 127K-244KAI architecture | AI ethics | AWS | Agent Orchestration | AzureFlexible working hours | Onsite workplace | Travel supportSenior-level Full TimeChantilly, VA, USA, 201515d ago
-
Head of Data Platform Engineering, MD USD 170K-267KCI/CD | Cloud Architecture | Data Governance | Data Lakes | Data ModelingFlexible work | Health insurance | Paid time off | Professional development | Retirement planExecutive-level Full TimeCR1 - 700 District, United States5d ago
-
Senior Product Architect, Storage USD 224K-356KAI infrastructure | Cache Management | DPU programming | Fabric design | High PerformanceBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States8d ago
-
Digital S/W Eng Lead Analyst -Vice President USD 125K-188KAI/ML | CI/CD | Claude | Docker | Embeddings401k | Dental | Disability insurance | Life insurance | MedicalSenior-level Full Time3800 CITIGROUP CENTER DRIVE BUILDING C …9d ago
-
AI Architect USD 152K-209KDSP | Deep learning | Distillation | GPU | Hardware optimizationFlexible work | Health insurance | Paid time off | Retirement plansSenior-level Full TimeAtlanta, US9d ago
-
Technical Staff-Network Architect USD 230K-297KAI Fabrics | Data Processing | Data Processing Units | Data center | Data center networkingCareer growth opportunities | Flexible work schedule | Health benefitsSenior-level Full TimeRound Rock, Texas, United States, United …9d ago