aijobs.net

Sign in

Find jobs in AI/ML, Data Science and Big Data

4 results for Paged Attention (Skill/Tech stack)

ML Performance Engineer USD 100K-150K

Benchmarking | C++ | Compiler optimization | Continuous batching | Custom Kernel

Remote work

Senior-level Full Time

Scottsdale, AZ R

6d ago
Senior Engineer, Inference Data Plane USD 139K-174K

Autoscaling | Continuous batching | Data parallelism | GRPC | Go

Employee assistance program | Flexible time off | Hybrid work model | LinkedIn Learning access | Local Employee Meetups

Senior-level Full Time

Seattle

9d ago
Senior Engineer, Inference Data Plane USD 139K-174K

Continuous batching | Data parallelism | Distributed Systems | GRPC | Go

Employee assistance program | Employee stock purchase program | Equity compensation | Flexible time off | LinkedIn Learning access

Senior-level Full Time

Denver R

9d ago
MaaS 架构师 CNY 240K-480K

Attention | Batching | C++ | CUDA | Continuous batching

Senior-level Full Time

上海、北京

1mo ago