Find jobs in AI/ML, Data Science and Big Data
10 results
for FlashAttention
(Skill/Tech stack)
-
Senior-level Full TimeBeijing Yizhuang, China2d ago
-
A/B | A/B Testing | AUC | AWQ | AWS SageMakerSenior-level Full TimeTel-Aviv, Israel2d ago
-
Senior Software Engineer, LLM Performance USD 180K-339KC++ | CUDA | Cutlass | FlashAttention | FlashInferSenior-level Full TimeSF Bay Area (Hybrid) R3d ago
-
VLA训练infra算法工程师 - XiaomiRobotics CNY 240K-480KBF16 | C++ | CPU/memory optimization | CUDA | Data pipelineMid-level Full Time北京6d ago
-
Miclaw-大模型训练推理方向实习生 CNY 25K-37KAttention Mechanism | C++ | CUDA | Compiler optimization | FlashAttentionEntry-level Internship北京15d ago
-
Entry-level Internship北京15d ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | CUDA profiling | Containerization | Context Parallelism | Data I/OHealth and wellness programs | Hybrid work | Time away from workEntry-level Full TimeMountain View, CA, United States16d ago
-
Software Engineering Manager, LLM Training USD 170K-277KCUDA | Containerization | Data parallelism | Distributed Systems | DockerFlexible-hybrid work | Health and wellness programs | Time offEntry-level Full TimeMountain View, CA, United States17d ago
-
Machine Learning Engineer, AI Models EUR 72K-96KC++ | CUDA | FlashAttention | Kernel Fusion | Memory hierarchySenior-level Full TimeCyprus17d ago
-
Senior AI Software Engineer, Kernel Libraries USD 184K-287KApache TVM | C# | C++ | CUDA | FlashAttentionBenefits | EquitySenior-level Full TimeUS, CA, Santa Clara, United States1mo ago