40 jobs for Tensor Parallelism

Sr. Software Engineer, AI Infrastructure USD 144K-236K

Apache Beam | Apache Flink | Apache Spark | C# | C++

Career development | Hybrid work | Stock grants

Senior-level Full Time

Sunnyvale, CA, United States

16h ago

Senior Inference Runtime Engineer TWD 1900K-2500K

CUDA | CUDA profiling | Continuous batching | Distributed inference | GPU Memory Optimization

Flexible work culture | Inclusive environment | Training and mentoring

Senior-level Full Time

Singapore, SG / Penang, MY / …

1d ago

Model Optimization Engineer USD 100K-150K

C++ | CUDA | Continuous batching | Deep learning | DeepSpeed

Senior-level Full Time

United States - Remote R

1d ago

ML Performance Engineer USD 100K-150K

Benchmarking | C++ | Continuous batching | Cutlass | Deep learning

Career growth | Direct W2 employment | Remote work

Senior-level Full Time

Tempe, AZ R

1d ago

Lead Machine Learning Engineer (Foundation Models) SGD 162K-238K

C++ | DPO | Deep learning | DeepSpeed | Distributed Training

Birthday leave | Employee assistance programme | FlexWork | Flexible benefits | Medical insurance

Senior-level Full Time

Singapore, Singapore

4d ago

AI Optimization Engineer USD 100K-150K

Benchmarking | C++ | Cache optimization | Compiler optimization | Continuous batching

Career growth

Senior-level Full Time

United States - Remote R

4d ago

Sr. Software Engineer, AI Infrastructure USD 139K-229K

Apache Beam | Apache Spark | C# | C++ | CUDA

Employee benefits | Hybrid work | Stock grants

Senior-level Full Time

Sunnyvale, CA, United States

4d ago

Model Optimization Engineer USD 100K-150K

Benchmarking | C++ | CUDA | Continuous batching | Cutlass

Senior-level Full Time

United States - Remote R

5d ago

大模型推理架构师 CNY 144K-240K

Ascend C | C plus plus | C# | CUDA | CUDA kernel

Senior-level Full Time

上海

5d ago

Foundation AI Engineer (LLM) CAD 100K-110K

AI Feedback | Attention Mechanisms | Constitutional AI | Constitutional Safety Tuning | Data Curation

Annual health checkups | Healthcare insurance | Opportunity to collaborate with industry professionals | Performance bonuses | Preferential pricing for services

Mid-level Full Time

Hanoi, Vietnam

5d ago

Senior DevOps/ MLOps Engineer USD 100K-150K

AWS | ArgoCD | Bash | CI/CD | CUDA

Annual health checkups | Collaborative learning opportunities | Health insurance | Preferential employee pricing

Senior-level Full Time

Hanoi, Vietnam

5d ago

具身世界模型推理INFRA工程师 - XiaomiRobotics CNY 144K-240K

Diffusion Models | Expert parallelism | FP8 | Machine Learning | Model Inference

Senior-level Full Time

北京

6d ago

ML Framework (MetalLM) Engineer USD 175K-312K

C# | C++ | CUDA | Compiler optimization | Compression

Senior-level Full Time

Cupertino

7d ago

AI Engineer (Managed Services) SGD 85K-138K

ARES | AWQ | Agent Orchestration | Agent systems | Attention Mechanisms

Mid-level Full Time

Singapore

11d ago

优才-具身大模型训练框架工程师-觅蜂子公司 CNY 500K-500K

C++ | CPU Optimization | CUDA | CUDA graph | Communication overlap

Mid-level Full Time

上海 R

11d ago

Senior AI Infrastructure Engineer - Model Training USD 190K-260K

BF16 | C++ | CUDA | Data parallelism | DeepSpeed

401k | Dental and vision plans | Dependent care FSA | Dog-friendly office | FSA

Senior-level Full Time

Mountain View, CA

11d ago

Senior AI scientist INR 3715K-5449K

ALiBi | Adafactor Optimizer | AdamW | Attention Mechanisms | BF16

Senior-level Full Time

Remote - India R

12d ago

AI Engineer (Managed Services) SGD 85K-138K

A/B | A/B Testing | AWQ | AutoAWQ | B testing

Mid-level Full Time

Singapore

12d ago

Tech Lead Manager, Inference USD 207K-300K

Autoscaling | Cache Management | Caching | Continuous batching | Deployment Pipelines

Senior-level Full Time

SF Bay Area, CA

13d ago

Sr. Engineering Manager, AI Runtime USD 228K-297K

Checkpointing | DeepSpeed | Distributed Systems | Elastic Training | FSDP

Senior-level Full Time

Mountain View, California; San Francisco, California

14d ago

Senior Software Engineer I - AI Inference Data Plane INR 3584K-4400K

Continuous batching | Data parallelism | Databases | Distributed Systems | GPU Optimization

Conference reimbursement | Employee assistance program | Employee stock purchase program | Equity compensation | Flexible time off

Senior-level Full Time

Bengaluru

14d ago

Staff ML Engineer - Ads ML Infrastructure USD 175K-312K

Deep learning | Distributed Systems | Feature Stores | GPU Kernels | High Performance

Senior-level Full Time

Cupertino; New York City

14d ago

Staff ML Engineer - Ads ML Infrastructure USD 175K-312K

Deep learning | Distributed Systems | Feature Store | Federated Learning | GPU Kernels

Senior-level Full Time

Cupertino; New York City

14d ago

Senior AI Scientist USD 123K-197K

ALiBi | Adafactor | AdamW | Attention | BF16

Annual bonus opportunity | Company RRSP contribution | Equity awards | Hybrid work | Insurance coverage

Senior-level Full Time

Remote - USA, United States R

15d ago

Gen AI Engineer USD 112K-167K

AWQ | AWS | AWS ECS | AWS EKS | Agile

401k match | Dental insurance | Life insurance | Medical insurance | Paid Holidays

Mid-level Full Time

GA-ATLANTA, 740 W PEACHTREE ST NW, …

15d ago

Senior Software Engineer, RL Post-Training Frameworks EUR 90K-140K

C# | C++ | CPUs | CUDA | Container lifecycle

Comprehensive benefits | Family benefits | Health insurance | Paid time off

Senior-level Full Time

Remote - Germany R

18d ago

Principal LLM Inference Engineer USD 195K-285K

Batching | C# | C++ | CUDA | CUDA kernel

Equity | Flexible working hours | Health insurance | Paid time off

Senior-level Full Time

Santa Clara

20d ago

Staff ML Engineer, Generative Model Performance & Efficiency USD 251K-310K

Data parallelism | Diffusion Models | Efficient Attention | Expert parallelism | Flax

Senior-level Full Time

Mountain View, California, United States, New …

21d ago

Senior Software Architect, AI Networking

C++ | CUDA | Cluster scheduling | Compute scheduling | Deep learning

Senior-level Full Time

Israel, Tel Aviv

27d ago

Software Engineer, Systems ML USD 141K-208K

C plus plus | CUDA | Co-design | Compiler optimization | Deep learning

Senior-level Full Time

Bellevue, WA | Menlo Park, CA …

1mo ago

Senior AI Infra Engineer - Large Model Inference Systems (Multimodal/LLM/VLM) USD 198K-368K

Attention Mechanisms | Batching | CUDA | Data parallelism | Distributed Systems

Senior-level Full Time

San Jose, California, United States

1mo ago

Senior AI Infra Engineer - Large Model Training Infrastructure (LLM/VLM /Agent RL) USD 207K-300K

Attention Mechanisms | Data parallelism | Deep learning | Distributed Training | Language Models

Senior-level Full Time

San Jose, California, United States

1mo ago

AI Engineer EUR 60K-80K

AWQ | AWS | Agent SDK | CI/CD | CUDA

Career growth opportunities | Permanent employment | Remote work option

Mid-level Full Time

Remote - Paris, France R

1mo ago

Staff Software Engineer, AI Runtime USD 190K-265K

Algorithms | Automatic Recovery | Checkpointing | Collective communication | Data Structures

Senior-level Full Time

Mountain View, California; San Francisco, California

1mo ago

LLM Inference Frameworks and Optimization Engineer USD 160K-230K

C++ | CUDA | CUDA graph | Cluster scheduling | Compiler

Equity | Health insurance

Mid-level Full Time

San Francisco, Singapore, Amsterdam

1mo ago

Research Scientist, Efficient Deep Learning - New College Grad 2026 USD 168K-264K

Architecture Search | C++ | CUDA | Computer Vision | Deep learning

Senior-level Full Time

US, CA, Santa Clara, United States

1mo ago

AI Engineer USD 100K-135K

AWQ | AWS | AWS EC2 | Agent Frameworks | CI/CD

401k match | Health insurance | Learning and development stipend | Paid parental leave | Paid time off

Mid-level Full Time

Remote USA - In Tandem R

1mo ago

Senior Software Engineer, AI Runtime USD 160K-225K

Algorithms | Checkpointing | Collective communication | Data Structures | Data parallelism

Senior-level Full Time

Mountain View, California; San Francisco, California

1mo ago

Staff Compiler Engineer - PyTorch + Kernel DSLPLATE USD 163K-253K

Autotuning | Collective Primitives | Cost Based Compilation | Custom ISA | Cutlass

401k | Adoption support stipend | Charitable giving match | Fertility care stipend | Flexible work environment

Senior-level Full Time

San Jose, California, United States

1mo ago

Data/AI Engineer Intern SGD 40K-57K

AI Job Scheduling | Automated testing | C++ | Checkpointing | DeepSpeed

Entry-level Full Time Internship

Singapore-CapitaSky

1mo ago

Find jobs in AI/ML, Data Science and Big Data