Deep Learning Kernel Software Performance Architect - New College Grad 2026
US, CA, Santa Clara, United States
USD 124K-241K Senior-level Full Time
Tasks
- Analyze and debug using analytical models simulators test suites
- Collaborate with CUDA and AI Compiler teams on performance issues
- Collaborate with hardware architecture performance teams on emerging features
- Debug deep learning and data analytics performance bottlenecks
- Develop scripts and tools for analysis visualization debugging
- Optimize AI ML training and inference performance layers
- Validate performance of GPU accelerated architectures
Perks/Benefits
Skills/Tech-stack
AI Compiler | C# | C++ | CUDA | Computer Architecture | Deep learning | GPU Computing | Machine Learning | Parallel Programming | Performance Profiling | Performance debugging | Python
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Senior Data Engineer USD 175K-215KAngular | Dashboards | Data Visualization | Microservices | NoSQLSenior-level Full TimeWashington, DC, United States4h ago
-
Agentic AI Developer - Supply Chain USD 150K-200KAPIs | Agent Orchestration | Evaluation | Event Driven | Event-driven architectureSenior-level Full TimeAuburn Hills, MI, United States5h ago
-
Mid-level Full TimeAnnapolis Junction, MD6h ago
-
Mid-level Full TimeAnnapolis Junction, MD6h ago
-
Mid-level Full TimeAnnapolis Junction, MD6h ago
-
Data Engineer - Supply Chain USD 120K-164KApache Spark | CI/CD | Data Governance | Data Lineage | Data ModelingSenior-level Full TimeAuburn Hills, MI, United States6h ago
-
API | Axon | Customer360 | Data Governance | Data ManagementSenior-level ContractAustin, United States8h ago
-
Mid-level Full TimeSan Diego, California, United States8h ago
-
Data Engineer USD 62K-62KAzure Data | Azure Data Factory | DBT | Data Factory | Data Modeling401k matching | Dental insurance | Disability insurance | Flexible spending account | Internal promotion opportunitiesEntry-level Full TimeKS, Leawood9h ago
-
Data parallelism | Deep learning | Distributed Training | Model Acceleration | Model BenchmarkingSenior-level Full TimeSan Jose, California, United States9h ago
-
Computational optimization | Data parallelism | Deep learning | Distributed Training | Generative AIMid-level Full TimeSan Jose, California, United States9h ago
-
Communication optimization | Data parallelism | Deep learning | Distributed Training | Generative AISenior-level Full TimeSeattle, Washington, United States9h ago
-
Computer Vision | Information Retrieval | Language Processing | Machine Learning | Natural LanguageSenior-level Full TimeSan Jose, California, United States9h ago
-
Software Engineer, C/C++ SDK Performance Optimization USD 194K-355KAndroid | C# | C++ | CPU performance | Frame rateSenior-level Full TimeSan Jose, California, United States9h ago
-
Applied Scientist - Monetization Technology - Global Tech Research Program - 2027 Start (PhD) USD 113K-250KCausal Inference | Causal modeling | Deep learning | Fine Tuning | Generative AIEntry-level Full TimeSan Jose, California, United States9h ago
-
Computer Vision | Deep learning | Language Processing | Machine Learning | Natural LanguageSenior-level Full TimeSan Jose, California, United States9h ago
-
Computer Vision | Language Processing | Machine Learning | Natural Language | Natural Language ProcessingSenior-level Full TimeSan Jose, California, United States9h ago
-
Benchmarking | CUDA | Data parallelism | Distributed Training | Model ParallelismSenior-level Full TimeSan Jose, California, United States9h ago
-
Senior Machine Learning E-commerce Feed Recommendation USD 187K-337KData Analysis | Data Pipelines | Feature Engineering | Machine Learning | Model OptimizationSenior-level Full TimeSeattle, Washington, United States9h ago
-
Click Through Rate | Click Through Rate Prediction | Cold Start | Conversion Rate | Conversion Rate PredictionSenior-level Full TimeSeattle, Washington, United States9h ago
-
Algorithm Design | Click Through Rate | Click Through Rate Prediction | Cold Start | Conversion RateSenior-level Full TimeSan Jose, California, United States9h ago
-
Candidate Generation | Click Through Rate | Click Through Rate Modeling | Cold Start | Conversion RateSenior-level Full TimeSan Jose, California, United States9h ago
-
Research Engineer - Language - MRS AI USD 117K-173KComputer Graphics | Computer Vision | Data Analysis | Deep learning | Generative AIEntry-level Full TimeMenlo Park, CA10h ago
-
Silicon Engineer, Digital Research, Quantum AI USD 163K-237KASIC development | Analog design | Cadence Genus | Cadence Innovus | Cell ModelingMid-level Full TimeGoleta, CA, USA; Mountain View, CA, …10h ago
-
Software Engineer, YouTube Ads, Machine Learning USD 147K-211KData Processing | Debugging | Distributed Computing | Language Processing | Machine LearningBonus | Career development | Equity | Health insurance | Paid time offMid-level Full TimeMountain View, CA, USA10h ago