Find jobs in AI/ML, Data Science and Big Data
49 results
for Model Parallelism
(Skill/Tech stack)
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Continuous batching | Cutlass | DeepSpeedCareer growth potential | Full-time benefits | H1B transfer support for qualified candidates | Long-term engagement | Remote workMid-level Full TimeUnited States - Remote R1d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Compiler optimization | Continuous batching | Deep learningMid-level Full TimeUnited States - Remote R1d ago
-
Mid-level Full Time北京 R3d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | Continuous batching | Custom Kernel | Custom kernel development | Cutlass100 percent remote | Benefits package | Full-time employmentMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Continuous batching | Data loading | Data loading optimizationMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 100K-150KC++ | CUDA | Continuous batching | Cutlass | DeepSpeedMid-level Full TimeUnited States - Remote R4d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | CUDA | Continuous batching | CutlassBenefits package | Remote workMid-level Full TimeUnited States - Remote R5d ago
-
AI Performance Optimization Engineer USD 100K-150KBenchmarking | C++ | Compiler optimization | Continuous batching | CutlassCareer growth | Health benefits | Mentorship | Remote workMid-level Full TimeUnited States - Remote R5d ago
-
Senior AI Researcher (Foundation AI) USD 190K-230KCI/CD | Cloud Computing | Context Parallelism | DPO | Data parallelismSenior-level Full TimeBoston, MA11d ago
-
CI/CD | Containerization | Data Pipelines | Data parallelism | Deep learningEmployee discount | Employee sample sales | Flexible benefits allowance | Paid annual leave | Personalised learningSenior-level Full TimeLondon, England, United Kingdom11d ago
-
Senior-level Full TimeMilpitas, CA, United States12d ago
-
AI/ML ASIC Architect USD 163K-249KARM | ASIC architecture | AXI interconnect | Area Optimization | Attention MechanismsSenior-level Full TimeMilpitas, CA, United States12d ago
-
Sr GenAI Infra Specialist SA, AWS WWSO Startup USD 153K-228KAWS | Amazon EC2 | Amazon EKS | Amazon S3 | Cache optimizationInclusive team culture | Mentorship and career growth | Work-life balanceSenior-level Full TimeNew York, New York, USA13d ago
-
Software Engineer, Inference - Multi Modal USD 295K-555KDistributed Systems | GPU | High Throughput | Inference | Language ModelsEntry-level Full TimeSan Francisco16d ago
-
Staff Software Engineer - AI Research Infrastructure USD 199K-270KBackend Services | CI | Cluster management | Data Pipelines | Distributed SystemsSenior-level Full TimeNew York City, New York; San …18d ago
-
Generative AI - ML System Engineering CNY 360K-600KC++ | CUDA | Compilation | Data pipeline | Diffusion ModelsFully remote option | On-site work flexibilitySenior-level Full TimeShanghai R18d ago
-
C plus plus | CI/CD | CUDA | Computer Architecture | Distributed SystemsSenior-level Full TimeTel Aviv-Yafo, Tel Aviv, ISR18d ago
-
Senior MLOps & AI Infrastructure Engineer USD 149K-215KAWS SageMaker | Airflow | Arize | Azure Machine Learning | BashSenior-level Full TimeSan Jose, California, United States, United …18d ago
-
Senior-level Full TimeTel Aviv-Yafo, Tel Aviv, ISR18d ago
-
Large Model Training Acceleration Engineer USD 187K-387KBenchmarking | Data parallelism | Deep learning | Distributed Training | Distributed inferenceMid-level Full TimeSan Jose, California, United States20d ago
-
Audio Inference Engineer, Model Efficiency USD 165K-300KC++ | Deep learning | Distributed inference | GPU Programming | Low-level systemCo-working stipend | Health and dental benefits | Inclusive culture | Mental health budget | Parental leave top-upMid-level Full TimeNew York27d ago
-
Senior Applied AI Researcher (India) INR 2500K-4500KArtificial Intelligence | DPO | Data parallelism | DataLoader | DeepSpeedSenior-level Full TimeIndia/Bengaluru28d ago
-
Senior Applied AI Researcher (Brazil) BRL 271K-370KCI/CD | DPO | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeBrazil/Remote R28d ago
-
Senior Applied AI Researcher (Dublin, CA) USD 190K-300KAutomated testing | Continuous Evaluation | Data parallelism | Deep learning | DeepSpeedSenior-level Full TimeDublin, CA (HQ)28d ago
-
Senior-level Full TimeTaichung - AATT, Taiwan28d ago
-
Inference Server – Product Software Intern USD 100K-140KAPI Design | Backend Development | Batching | C++ | CachingHybrid workEntry-level InternshipBelgrade, Serbia1mo ago
-
Staff Software Engineer - AI Research Infrastructure USD 190K-270KBackend Services | C plus plus | CI/CD | Cloud infrastructure | Cluster managementSenior-level Full TimeNew York City, New York; San …1mo ago
-
Staff Software Engineer - AI Research Infrastructure USD 190K-270KBackend Services | CI testing | Cluster scheduling | Data Pipelines | Distributed SystemsSenior-level Full TimeNew York City, New York1mo ago
-
Intern Researcher – AI Foundation Model Training CAD 58K-104KAI Agent | AI agent systems | Agent systems | Architecture Search | Computational Graph OptimizationEntry-level InternshipMarkham, Ontario, Canada1mo ago
-
Staff Technical Lead for Inference & ML Performance USD 180K-300KCUDA | Compilation | Cutlass | Distributed Serving | Kernel optimizationSenior-level Full TimeSan Francisco1mo ago
-
CI/CD | Containerization | Data Pipelines | Data parallelism | Deep learningEmployee discount | Employee sample sales | Flexible benefits | Flexible benefits allowance | Paid annual leaveSenior-level Full TimeLondon, England, United Kingdom1mo ago
-
CI/CD | Containerization | Data Pipelines | Data parallelism | Deep learningCelebration day | Employee discount | Employee sample sales | Flexible benefits allowance | Paid annual leaveSenior-level Full TimeLondon, England, United Kingdom1mo ago
-
Software Engineer, Machine Learning Infrastructure USD 190K-300KAWS Kinesis | AWS Lambda | AWS SageMaker | Amazon DynamoDB | Amazon EC2Cell phone and internet allowance | Childcare allowance | Dental insurance | Flexible time off | Health insuranceMid-level Full TimeSan Francisco, CA1mo ago
-
Agentic AI | Autogen | BF16 | Big Data | CI/CDSenior-level Full TimeFab 10A, Singapore1mo ago
-
MLOps Engineer EUR 45K-45KAWS | Azure | Azure CycleCloud | Azure Data | Azure Data FactoryCareer plan | Discounted lunch options | Educational budget | Flexible remuneration | Flexible working hoursMid-level Full TimeMadrid, Spain1mo ago
-
Data parallelism | Deep learning | Distributed Training | Model Acceleration | Model BenchmarkingSenior-level Full TimeSan Jose, California, United States1mo ago
-
Computational optimization | Data parallelism | Deep learning | Distributed Training | Generative AIMid-level Full TimeSan Jose, California, United States1mo ago
-
Communication optimization | Data parallelism | Deep learning | Distributed Training | Generative AISenior-level Full TimeSeattle, Washington, United States1mo ago
-
Benchmarking | CUDA | Data parallelism | Distributed Training | Model ParallelismSenior-level Full TimeSan Jose, California, United States1mo ago
-
Principal PMT-ES - AI/ML Training, Annapurna Labs USD 181K-281KAI/ML | Customer Requirements | DPO | Deep learning | Developer experienceCareer growth resources | Flexible organization | Knowledge sharing | Mentorship | Work-life balanceSenior-level Full TimeCupertino, California, USA1mo ago
-
Benchmarking | CUDA | Communication optimization | Data parallelism | Deep learningMid-level Full TimeSeattle, Washington, United States1mo ago
-
Data parallelism | Deep learning | Distributed Training | GPU Acceleration | Model BenchmarkingMid-level Full TimeSan Jose, California, United States1mo ago
-
Applied Scientist 4 USD 120K-251KAutomatic Speech Recognition | C++ | Cloud Computing | Computer Vision | Data AnnotationMid-level Full TimePleasanton, CA, United States1mo ago
-
Machine Learning Systems Engineer USD 160K-253KAWQ | C# | C++ | CUDA | Distributed TrainingDental insurance | Free meals and snacks | Health insurance | Professional development | Unlimited PTOSenior-level Full TimeMenlo Park, CA1mo ago
-
CI/CD | Containerization | Data Pipelines | Data parallelism | Deep learningEmployee discount | Employee sample sales | Flexible benefits allowance | Paid annual leave | Personalised learningSenior-level Full TimeLondon, England, United Kingdom1mo ago
-
AI/ML Software Engineer TWD 140K-500KC++ | Collective Communications | Device orchestration | Diffusion Models | Distributed ComputingSenior-level Full TimeHsinchu, Taiwan1mo ago
-
Machine Learning Engineer - Orchestration USD 212K-450KAutoscaling | Distributed Systems | Embedding | Eviction | GPUSenior-level Full TimeSan Jose, California, United States1mo ago
-
AI acceleration | Communication optimization | Data parallelism | Deep learning | Distributed TrainingSenior-level Full TimeSeattle, Washington, United States1mo ago
-
Software Engineer, Inference – AMD GPU Enablement USD 295K-555KCUDA | Collective communication | Distributed Systems | GPU Kernels | HIPMid-level Full TimeSan Francisco1mo ago