Manager, Software Engineering - Production AI Inference
Tasks
- Automate production workflows
- Drive new model onboarding
- Implement security readiness and hardening
- Improve observability and operational health
- Integrate serving stack for inference
- Lead production AI inference releases
- Manage issues and dependencies
- Manage release readiness and quality
- Perform performance profiling and optimization
- Plan roadmap and execution rhythm
Perks/Benefits
- N/A
Skills/Tech-stack
AI | Automation | CUBLAS | CUDA | CUDNN | Cutlass | Distributed Systems | GPUDirect RDMA | Inference engine | Kubernetes | Language Models | Large Language Models | Machine Learning | NCCL | NVLink | Observability | Performance optimization | PyTorch | Security Hardening | TensorRT
Education
Regions
Countries
States
Related jobs
-
Senior Technical Program Manager, AI USD 168K-322KAgile | Aha! | Capacity Planning | Confluence | Data AnalysisSenior-level Full TimeUS, CA, Santa Clara R1d ago
-
Senior Product Manager - Agentic Data Analytics USD 208K-379KCost estimation | Data Governance | Databases | Distributed Systems | Evaluation datasetsBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara R2d ago
-
Engineering Manager, LLM Performance USD 224K-431KAPI Development | C++ | CUDA | GPU Architecture | LLM InferenceMid-level Full TimeUS, CA, Santa Clara10d ago
-
C plus plus | CUDA | DMA Buffer | Databases | DriversSenior-level Full TimeUS, CA, Santa Clara14d ago
-
Senior Manager, Engineering - AI Developer Tools USD 272K-431KAgile | Automation | Go | JavaScript | PythonSenior-level Full TimeUS, CA, Santa Clara R17d ago
-
Senior Manager, Robotics Quality Assurance USD 216K-345KDigital Twin | Embedded Software | Embedded software testing | Generative AI | Isaac LabEquity | Health benefits | Paid time offSenior-level Full TimeUS, CA, Santa Clara17d ago
-
Senior Technical Program Manager, Deep Learning Software USD 168K-322KAgile | Aha! | Capacity Planning | Confluence | Deep learningSenior-level Full TimeUS, CA, Santa Clara R18d ago
-
Manager, Next-Gen AI Cluster Validation USD 224K-356KAnsible | Cluster architecture | Deep learning | Distributed Systems | GoMid-level Full TimeUS, CA, Santa Clara R19d ago