Platform Support Architect
USD 175K-200K Senior-level Full Time
Tasks
- Author and maintain support triage runbooks and checklists
- Build hands on labs and proof of concepts for RAG and agentic AI use cases
- Collaborate to align reference architectures and best practices across teams
- Collect and interpret logs and telemetry and create minimal repros and defect reports
- Define and validate unified diagnostics bundles
- Develop reusable technical assets implementation guides and best practice playbooks
- Diagnose performance bottlenecks in RAG and agentic AI workflows
- Perform end to end triage across GPU NVAIE vector DB Kubernetes Docker networking and storage
- Provide NVIDIA AI Enterprise and vector database support for customer environments
- Provide field feedback to product management and engineering on compatibility upgrade rollback and observability needs
Perks/Benefits
- N/A
Skills/Tech-stack
CI/CD | CUDA | Canary Deployment | Ceph | Ceph RBD | Docker | Elasticsearch | Embeddings | GPFS | GPU Operator | Grafana | Helm | Inference Server | Infinia | Infiniband | Ingestion pipelines | Kubernetes | Linux | Lustre | MLOps | Milvus | NFS | NVIDIA GPU | NVIDIA GPU Operator | NVIDIA Nemo | Nvidia Nim | Observability | Prometheus | Prompt engineering | RAG | RDMA | Reranking | Retrieval | Rollback | S3 | SMB | TensorRT | Triton Inference | Triton Inference Server | Vector Database
Education
N/A
Regions
Countries
States
Related jobs
-
Featured Feat. Associate Director, Data Labs USD 167K-167KAWS | Cloud Computing | Compute Infrastructure | Data Analysis | LLM GovernanceConference speaking opportunities | Hybrid work schedule | Media appearancesSenior-level Full TimeWashington, District of Columbia, 20004, United … R4d ago
-
Senior Data Engineer USD 90K-110KAgile | Amazon Web Services | Apache NiFi | Data Modeling | Data WarehousingAutonomy | Employee assistance program | Flexible working hours | Inclusive community | Online training videosSenior-level Full TimeNew York, United States R10h ago
-
Machine Learning Engineer USD 196K-196KApache Airflow | CI/CD | Containerization | Docker | Experiment trackingFully remoteSenior-level Full TimeSan Francisco, CA R20h ago
-
AWS Bedrock | Agent systems | Anthropic API | Autogen | Azure401k matching program | Adoption Assistance | Development and career growth opportunities | Fertility treatments | Flexible work schedulesSenior-level Contract Full TimeRemote, OR, United States R21h ago
-
(US) AI Solutions Specialist USD 85K-95KAPIs | Data integration | Gainsight | Gong | LLM APIsBenefits | Bonus | Overtime exempt | Remote workSenior-level Full TimeRemote, USA R23h ago
-
Software Engineer, Applied AI USD 190K-280KContext engineering | Data Processing | Data Storage | Debugging | Docker401k match | Dental insurance | Health insurance | Hybrid work model | Professional developmentSenior-level Full TimeNew York City R23h ago
-
Solutions Architect, Digital Natives USD 175K-240KAgent systems | Amazon Web Services | Caching | Cloud platform | ContainersTelecommuting within United StatesSenior-level Full TimeSan Francisco R23h ago
-
Senior Staff Data Engineer - Platform Data and Analytics USD 268K-368KAirflow | Alerting | Apache Spark | Capacity Planning | Cost OptimizationSenior-level Full TimeSan Francisco, CA R1d ago
-
Staff Data Engineer USD 185K-220KAWS | Apache Airflow | Apache Kafka | Benthos | Big DataDental insurance | Disability insurance | Flexible work hours | Health insurance | Health savings accountSenior-level Full TimeRosslyn, VA or Remote R1d ago
-
API Testing | Cypher | Data Quality | DataOps | DevOpsBenefits | Competitive pay | Growth opportunity | Remote work | Travel requiredSenior-level Full TimeReston, VA, United States R1d ago
-
Principal Engineer - Data Platform USD 221K-387KAWS | Airflow | Apache Hive | Apache Iceberg | Apache ImpalaRemote workSenior-level Full TimeSanta Clara, California, United States R1d ago
-
IT Infrastructure Engineer II USD 139K-178KAWS Organizations | Access Management | Amazon Web Services | Ansible | Application FirewallHealth and wellness resources | Remote work | Wellness FridaysSenior-level Full TimeRemote - United States R1d ago
-
Sr Machine Learning Engineer I USD 150K-241KAutomated retraining | Bayesian Filter | CI/CD | Data Analysis | Data AssociationDiscretionary paid time off | Emotional and mental wellness support | Employee resource groups | Fitness programs | Learning and development programsSenior-level Full TimeSeattle, Washington, United States R1d ago
-
Senior Data and AI Platform Engineer USD 150K-230KAnalytics engineering | CI/CD | DBT | Data Engineering | Data GovernanceCross-functional collaboration | Learning opportunities | Occasional travel | Technical mentorshipSenior-level Full TimeRemote - US R1d ago
-
Senior Data Engineer - Databricks USD 180K-248KAWS | Access Control | Amazon Web Services | Apache Spark | Automated testing401k match | Corporate Benefit Program | Discounted pet insurance | Educational resources | Employee Referral Bonus ProgramSenior-level Full TimeUS - Remote R1d ago
-
Software Engineer, ML Dev Enablement USD 123K-163KAWS | Agentic AI | C++ | DDP | Data pipeline401k with company match | Dental insurance | Health savings account | Hybrid schedule | Life insuranceSenior-level Full TimeLas Vegas, Nevada, United States; Remote … R1d ago
-
Data Manipulation | Distributed Systems | Embeddings | Java | KubernetesCollaborative flat culture | Direct access to technical leadership | Exposure to cutting edge generative AI | Flexible schedule | High autonomyEntry-level Full TimeCanada R1d ago
-
API Integration | CRM | Context Management | Distributed Systems | Document stores401k retirement plan | Company offsite and team events | Flexible PTO | Fully remote | Health, dental, and vision insuranceSenior-level Full TimeCanada R1d ago
-
Senior Data Engineer USD 135K-205KAzure | Azure Data | Azure Data Lake | Azure Data Lake Storage | Azure Machine Learning401k matching | Continuing education assistance | Medical, dental & vision coverage | Paid time offSenior-level Full TimeArlington, VA, United States R1d ago
-
Senior Platform Engineer, Data & AI USD 140K-160KAPI Development | AWS | Alerting | Bash | CI/CDEmployee growth opportunities | Leadership programs | Mentorship programsSenior-level Full TimeUnited States R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Data Quality | Data labeling | Data quality monitoring100 percent remote | Career growth | Full-time employment | W2 employmentMid-level Full TimeUnited States - Remote R1d ago
-
AI Research Engineer USD 100K-150KAblation Studies | Accelerator hardware | Agentic Systems | Computer Vision | Data QualityMid-level Full TimeUnited States - Remote R1d ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache SparkRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
Hadoop Big Data Developer USD 100K-150KAWS EMR | Airflow | Apache Atlas | Apache Flink | Apache HiveRemote workSenior-level Full TimeUnited States - Remote R1d ago
-
AI Data Engineer USD 100K-150KActive Learning | Apache Beam | CI/CD | Caching | Code review100 percent remote | Career growth | Full-time employment | H1B transfer support | W2 employmentMid-level Full TimeUnited States - Remote R1d ago