Software Engineer, SystemML - Scaling / Performance
Tasks
- Develop performance benchmarks and tuners
- Enable reliable scalable distributed ML training
- Improve distributed GPU communication performance
Perks/Benefits
- N/A
Skills/Tech-stack
CUDA | Distributed Systems | GPU Architectures | HPC | NCCL | Performance optimization | Pipeline Parallel | PyTorch | Tensor Parallel
Education
Roles
Regions
Countries
States
Cities
Related jobs
-
Tech Risk and Control [Multiple Positions Available] USD 173K-215KAI | AWS | AWS SageMaker | Aqua Security | AzureFinancial coaching | Health care coverage | Mental health support | On-site wellness | Retirement planSenior-level Full TimePlano, TX, United States7h ago
-
Senior Machine Learning Ops Engineer, ML System USD 136K-359KDistributed Systems | GPU | Machine Learning | NPU | Performance AnalysisGlobal collaboration | International teamSenior-level Full TimeSan Jose, California, United States7h ago
-
Software Engineer III, Cloud Assist USD 147K-211KAI frameworks | Artificial Intelligence | Cloud Computing | Distributed Systems | Machine LearningBenefits | Bonus | EquitySenior-level Full TimeSan Francisco, CA, USA8h ago
-
AI | Algorithms | BigQuery | C++ | Data ProcessingBenefits | Bonus | EquitySenior-level Full TimeMountain View, CA, USA8h ago
-
Software Engineer III, AI/ML GenAI, GCP, Performance USD 147K-211KC++ | Distributed Systems | JAX | Java | Language ProcessingBenefitsSenior-level Full TimeSunnyvale, CA, USA8h ago
-
Senior Software Engineer, GenAI Infrastructure, Cloud AI USD 174K-252KC++ | Cloud APIs | Distributed Systems | Machine Learning | Machine Learning InfrastructureBenefits | Bonus | EquitySenior-level Full TimeSunnyvale, CA, USA; Kirkland, WA, USA8h ago
-
Customer Engineer II, AI Infrastructure, Google Cloud USD 127K-184KAI Infrastructure design | AI infrastructure | Cloud infrastructure | Infrastructure Design | KubernetesBenefitsMid-level Full TimeSunnyvale, CA, USA; Los Angeles, CA, …8h ago
-
Computer Vision | Data Management | Deep learning | Edge AI | Experiment trackingFlexible scheduling | Professional development opportunitiesSenior-level Full TimeBaltimore, Maryland20h ago
-
AI Governance | CI/CD | Cloud Platforms | Dask | Data EngineeringBenefits | Health coverage | Inclusive environmentSenior-level Full TimeNew York, NY, United States20h ago
-
AI Software Engineer USD 125K-170K.NET | .Net Core | AI | ASP.Net Core | ConfluenceHealth insurance | Paid time off | Professional development opportunities | Retirement plans | Work-life balanceMid-level Full TimeUSA-Colorado, United States20h ago
-
Applied Research Scientist / Engineer USD 200K-300KData Curation | Deep learning | Diffusion Models | Distributed Systems | Fine TuningMid-level Full TimePalo Alto, CA, London, UK, Seattle, …1d ago
-
Research Scientist / Engineer – Training Infrastructure USD 200K-300KCUDA | Containerization | Distributed Systems | GPU clusters | LinuxSenior-level Full TimePalo Alto, CA, Remote - International, … R1d ago
-
Big Data | Cloud Computing | Distributed Systems | Hadoop | Hive401k match | Community engagement | Leave buy-back | Medical/Dental/Vision | Profit sharingMid-level Full TimeFt. Meade, Maryland1d ago
-
Principal Software Engineer - Python USD 120K-200KAI | API Development | Cloud services | Distributed Systems | DockerSenior-level Full TimeBoston, Massachusetts, United States1d ago
-
Sr. Back-End Software Engineer - Machine Learning USD 170K-250KC++ | Computer Vision | Linux | Machine Learning | NLP401k matching | Commuter benefits | Dependent coverage | Employee referral program | EquitySenior-level Full TimeSanta Clara, CA1d ago
-
Software Engineer, New Grad USD 120K-150KCloud technologies | Databases | Distributed Systems | Python | RustCollaborative environment | MentorshipEntry-level Full TimeSan Francisco1d ago
-
Senior Software Engineer - ML Infrastructure USD 232K-283KData Pipelines | Distributed Systems | ML Ops | ML Platforms | Machine LearningSenior-level Full TimeSan Francisco1d ago
-
Software Engineer, Infrastructure USD 180K-300KAWS | Azure | Bash | Distributed Systems | GCP401k plan | Free Lunches and Snacks | Health benefits | Learning stipend | Relocation assistanceSenior-level Full TimeRedwood City1d ago
-
Machine Learning Operations Engineer USD 162K-219KAWS | AWS SageMaker | CICD | CVAT | DevOpsCollaborative work culture | Dental insurance | Health insurance | Professional growth opportunities | Vision insuranceMid-level Full TimeBoston, Massachusetts1d ago
-
Data Wrangling | Database Administration | Git | Jupyter Notebooks | PyTorchDental insurance | Disability insurance | Healthcare benefits | Life insurance | Professional developmentSenior-level Full TimeFort Meade, MD1d ago
-
Machine Learning Engineer: Perception and Planning USD 156K-215KC++ | Data Processing | Deep learning | JAX | Machine LearningSenior-level Full TimeOakland, CA1d ago
-
Machine Learning Engineer USD 150K-223KData Preprocessing | Feature Engineering | Healthcare Data | Model Evaluation | PyTorchGrowth opportunities | Industry competitive benefits | Learning programs | Team environmentMid-level Full TimeSan Francisco, California, United States1d ago
-
AI Engineer/Architect USD 149K-184KAI architecture | AI systems | AWS | Autonomous Systems | Azure401k | Flexible work hours | Health insurance | Paid Holidays | Paid family leaveSenior-level Full TimeUSA VA Home Office (VAHOME), United …1d ago
-
(Senior) Machine Learning Engineer USD 132K-215KAWS | CI/CD | Cloud Computing | Computer Vision | Data PreprocessingAnnual incentive | Healthcare coverage | Retirement benefitsSenior-level Full TimeCambridge, MA USA2d ago
-
AI Research Scientist, SysML - FAIR USD 143K-208KArtificial Intelligence | C# | C++ | Co-design | Hardware-Software Co-designMid-level Full TimeMenlo Park, CA | Boston, MA …2d ago