Senior Staff AI Platform Engineer
Tasks
- Architect and scale LLM ML infrastructure
- Build LLM aware monitoring and AI assisted incident response
- Define AI native infrastructure roadmaps
- Design observability for infrastructure health and model performance
- Develop automation and tooling for reliability and scalability
- Implement AI-assisted engineering practices
- Mentor engineers and promote AI first culture
- Translate platform capabilities into scalable product solutions
- Troubleshoot distributed systems and AI ML scaling issues
Perks/Benefits
Skills/Tech-stack
Access Management | Algorithms | Automation | C++ | Complexity analysis | Data Structures | Distributed Systems | GPU | Go | Hugging Face | Identity and Access Management | Identity and access | Incident Response | Infrastructure as Code | Kubernetes | LLM | Logging | MLOps | Metrics | Model Serving | Network Segmentation | OWASP Top | OWASP Top 10 | Observability | Python | Rust | Security Evaluation | Supply Chain | Supply chain security | Top 10 | Tracing | Vulnerability Management | Weights and Biases | “as-code”
Education
Regions
Countries
States
Related jobs
-
Architecture Optimization | Artificial Intelligence | Autonomous Driving | C++ | C++ debuggingSenior-level Full TimeUS, CA, Santa Clara1d ago
-
Senior Network Solution Engineer (Weekend Coverage) USD 140K-270KC# | C++ | Code debugging | Embedded Network Firmware | EthernetBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara1d ago
-
Senior Deep Learning Performance Architect USD 152K-287KC# | C++ | Computer Architecture | Deep learning | Energy EfficiencyBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara2d ago
-
Azure DevOps | Bazel | C# | C++ | CI/CDSenior-level Full TimeUS, CA, Santa Clara2d ago
-
3D Gaussian Splatting | 3D Vision | C plus plus | Deep learning | Gaussian SplattingSenior-level Full TimeUS, CA, Santa Clara2d ago
-
3D Obstacle Detection | C++ | Computer Vision | Critical development | Data QualityBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara3d ago
-
ARM Cortex | ARM Cortex-M | BMC | Bash | C plus plusSenior-level Full TimeUS, CA, Santa Clara3d ago
-
Senior Software Engineer, PyTorch - Deep Learning USD 152K-287KC++ | CUDA | Distributed Computing | Parallel Programming | PyTorchSenior-level Full TimeUS, CA, Santa Clara R4d ago
-
Bash | Bootstrap | CSI | CSS3 | Container StorageSenior-level Full TimeUS, CA, Santa Clara R4d ago
-
Senior Software Engineer, AI Storage USD 184K-287KAlgorithms | Bash | C++ | CUDA | CloudBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara R4d ago
-
Distinguished Engineer, Storage – AI Cloud USD 320K-488KBeeGFS | Block Storage | Block layer | C# | C++24 7 production support environment | Comprehensive benefits packageSenior-level Full TimeUS, CA, Santa Clara4d ago
-
Senior Deep Learning Framework Communications Engineer USD 152K-287KC++ | CUDA | CUDA kernels | CuTe | Distributed TrainingBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara R6d ago
-
AI Data | AI data pipelines | AWS | Access Management | AzureCompetitive pay | Growth opportunities | Remote work flexibilitySenior-level Full TimeHonduras - Tegucigalpa7d ago
-
C++ | CUDA | Docker | Infiniband | JAXSenior-level Full TimeUS, CA, Santa Clara R7d ago
-
Senior Deep Learning Frameworks CUDA Software Engineer USD 184K-356KAutograd | C++ | CUDA | Compiler technology | Computer ArchitectureSenior-level Full TimeUS, CA, Santa Clara R10d ago
-
Automation | BIOS | BMC | Bash | C#Senior-level Full TimeUS, CA, Santa Clara10d ago
-
AWS | Azure | C++ | CI/CD | Capacity PlanningSenior-level Full TimeUS, CA, Santa Clara11d ago
-
C++ | CUDA | Computer Architecture | Deep learning | GPU ArchitectureEquity | Health insurance | Paid time off | Retirement planSenior-level Full TimeUS, CA, Santa Clara11d ago
-
Senior Scientific Machine Learning Engineer – Earth-2 USD 152K-287KCUDA | Containers | Data parallelism | Diffusion Models | GPU KernelBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara R12d ago
-
Senior Storage Production Engineer - DGX Cloud USD 176K-333KAI/ML | Access Control | Algorithms | Ansible | AuditingBenefits | Equity | On-call rotationSenior-level Full TimeUS, CA, Santa Clara R12d ago
-
Senior Staff AI/ML System Software Engineer USD 180K-280KC# | C++ | Computer Architecture | Data Structures | Distributed SystemsHybrid work | Onsite 3 days per weekSenior-level Full TimeSanta Clara1mo ago
-
Principal AI/ML System Software Engineer USD 180K-280KC++ | Computer Architecture | Data Structures | Deep learning | Distributed SystemsHybrid workSenior-level Full TimeSanta Clara1mo ago
-
Machine Learning Engineer USD 135K-200KAWS | Data Analysis | Data Preprocessing | Deep learning | Language Processing401k plan | Employee insurance coverage | Flexible PTO | Stock optionsSenior-level Full TimeSanta Clara1mo ago