Principal Deep Learning Communication Architect
US, CA, Santa Clara, United States
USD 272K-431K Senior-level Full Time
Tasks
- Co design communication primitives with application developers
- Collaborate on hardware and software co design for networking
- Define technical roadmap for communication libraries
- Design communication primitives and collective algorithms
- Develop analytical models and simulators for system behavior
- Ensure evolution of communication libraries for large language models
- Lead development and scaling for distributed deep learning
- Optimize communication for heterogeneous interconnects
Perks/Benefits
- N/A
Skills/Tech-stack
3D Parallelism | CUDA | Context Parallelism | Data parallelism | DeepSpeed | Expert parallelism | Infiniband | JAX | MPI | Megatron Core | NCCL | NVSHMEM | Pipeline parallelism | PyTorch | PyTorch distributed | RDMA | RoCE | SGLang | Tensor Parallelism | TensorRT-LLM | UCC | UCX | VLLM | XLA | Zero
Education
Regions
Countries
States
Cities
Related jobs
-
Technical Architect – AI, ML & Generative AI USD 142K-240KAWS Bedrock | AWS SageMaker | Agentic AI | Apache Spark | Artificial Intelligence401k | Critical Illness Accident Hospital Indemnity Identity Theft Protection | Dental plans | Life and Accidental Death and Dismemberment | Long-term disabilitySenior-level Full TimeFrisco, United States15h ago
-
AWS | Artificial Intelligence | Azure AI | Data Analysis | DatabricksBackup childcare | Financial coaching | Health care coverage | Mental health support | On-site health and wellness centersSenior-level Full TimeChicago, IL, United States1d ago
-
Algorithm Optimization | CI/CD | Data Modeling | Data queries | Deep Neural NetworksFlexible work arrangement | Learning opportunities | Relocation assistanceSenior-level Full TimeCASDRB08, United States1d ago
-
CI/CD | Deep learning | Docker | Git | MLOpsDental insurance | Disability insurance | Flexible spending accounts | Flexible work arrangements | Health insuranceSenior-level Full TimeCASDRB08, United States1d ago
-
AI Engineering Sr Director or VP, Data Science USD 128K-175KAI Platform | AWS SageMaker | Agent systems | Agentic AI | Azure MLCollaborative culture | Growth opportunities | Impactful technical work | Professional developmentSenior-level Full TimeColumbia, MD, United States1d ago
-
AI Solutions Engineer, East USD 125K-175KAWS | Azure | Cloud platform | Dspy | Generative AI401k plan | Dental insurance | Medical insurance | Mental wellness support | Parental leaveMid-level Full TimeRemote (New York) R1d ago
-
AWS | Airflow | Artificial Intelligence | Azure | CDISCExecutive-level Full TimeNorth Chicago, IL, United States1d ago
-
Senior-level ContractATLANTA, GA1d ago
-
Senior-level Full TimeErie, PA, United States1d ago
-
Principal Machine Learning USD 120K-220KAI Observability | AWS | AWS Bedrock | Agentic AI | Amazon SageMakerSenior-level Full TimeLivonia, MI, United States R2d ago
-
Senior Solutions Architect, Generative AI USD 184K-287KC# | C++ | Deep learning | Distributed Computing | GPUComprehensive benefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States2d ago
-
AI/ML Software Engineer USD 86K-198KAutomated testing | CI/CD | Containerization | Data Ingestion | Data PipelinesDependent care | Disability insurance | Health insurance | Life insurance | Paid leaveMid-level Full TimeUSA, NY, Rome (99 Otis St), …2d ago
-
CMC AI/ML and Automation Scientist USD 129K-209KAWS | Analytical technology | Azure | Big Data | Data ScienceSenior-level Full TimeUS: Indianapolis IN Tech Center North, …2d ago
-
Senior/Principal AI/ML Computational Scientist USD 156K-343KComputer Vision | Data Visualization | Deep learning | Image Processing | Image analysisRelocation benefitsSenior-level Full TimeSouth San Francisco, United States2d ago
-
Solution Architect (AI/LLM Inference) USD 165K-330KArtificial Intelligence | Benchmarking | Embeddings | GPU Selection | Image Generation401k company match | Fertility and family building stipend | Flexible PTO | Medical/Dental/Vision insurance | Paid parental leaveSenior-level Full TimeSan Francisco2d ago
-
Amazon Web Services | Anthropic | Cloud platform | Cohere | Google CloudCross-functional collaboration | Mentoring | Team research discussionsSenior-level Full TimeRemote, US or Europe R2d ago
-
Mid-level Full TimeSan Francisco, CA2d ago
-
A/B | A/B Testing | AWS | Agile | Artificial IntelligenceMid-level Full TimeSan Francisco, CA2d ago
-
Data Pipelines | Distributed Systems | MLOps | Machine Learning | Model DeploymentCompetitive benefits | Hybrid workMid-level Full TimeSunnyvale, California, United States2d ago
-
AI Research Engineer USD 152K-258KContrastive Learning | Deep learning | Distributed Computing | Fine Tuning | Generative AIDental insurance | Flexible-hybrid work | Health insurance | Relocation assistance | Retirement planMid-level Full TimePalo Alto, California, United States2d ago
-
Agile | LLMOps | Language Processing | MLOps | Machine LearningDental vision life insurance after eligibility period | Education and development opportunities | Medical coverage | Paid time off | Public retirement systemMid-level Full TimeUnited States of America-OHIO-Franklin County-Columbus3d ago
-
AI Developer Intern USD 42K-58KClassification | Clustering | Data Analysis | Data Preprocessing | Language ProcessingCompany culture | Feedback and evaluation | Full-time work experience | Meaningful projects | Mentorship and guidanceEntry-level Full Time InternshipChicago, Illinois, United States4d ago
-
AI Solutions Architect - East Region USD 185K-235KAI | AI Enterprise | Amazon Web Services | As-a-Service | CUDA401k plan with company matching | Employee assistance program | Health dental vision care | Life and disability insurance | Paid time offSenior-level Full TimeHartford, CT, United States4d ago
-
AI Solutions Architect - Global USD 185K-235KAI Enterprise | AI architecture | AWS | Artificial Intelligence | As-a-Service401k matching | Dental insurance | Employee assistance program | Health insurance | Paid time offSenior-level Full TimeHartford, CT, United States4d ago
-
AI Solutions Architect - Central Region USD 185K-235KAI Enterprise | AI architecture | AI infrastructure | AWS | Amazon Web Services401k matching | Bereavement leave | Disability insurance | Employee assistance program | Employee discountSenior-level Full TimeChicago, IL, United States4d ago