Find jobs in AI/ML, Data Science and Big Data
8 results
for DCGM
(Skill/Tech stack)
-
Reliability Engineer, Supercomputing USD 350K-475KBMC | Container Orchestration | DCGM | Debugging | Firmware ManagementDental benefits | Health benefits | Paid parental leave | Relocation support | Unlimited PTOMid-level Full TimeSan Francisco5d ago
-
Sr GenAI Infra Specialist SA, AWS WWSO Startup USD 153K-228KAWS Inferentia | AWS Trainium | Amazon Web Services | Batching | CUDASenior-level Full TimeNew York, New York, USA12d ago
-
Senior Embedded Software Engineer USD 179K-269KACPI | BMC | Bash | Bring-up | C#Career growth and learning opportunities | Collaborative culture | Flexibility | International environment | Opportunity to work on AI projectsSenior-level Full TimeRemote - United States R18d ago
-
Software Engineer - Reliability USD 200K-250KAWS | Bash | Cluster management | Containerization | DCGMSenior-level Full TimeSF Bay Area, CA, Remote, US R26d ago
-
Senior-level Full Time上海28d ago
-
Engineering Manager, Inference Benchmarking — AI Perf USD 224K-356KDCGM | Distributed Systems | GPU Telemetry | GPU observability | HelmSenior-level Full TimeUS, CA, Santa Clara, United States1mo ago
-
AI Platform Engineer INR 1500K-2500KAutomated Evaluation | CI/CD | CUDA | Continuous Checkpointing | Continuous batchingMid-level Full TimeBangalore, India1mo ago
-
Member of Technical Staff - Training Platform USD 150K-300KAnsible | DCGM | FastAPI | GPU Operator | GitOpsConference attendance | Professional development budget | Relocation support | Remote work | Team off-sitesSenior-level Full TimeSan Francisco1mo ago