Member of Technical Staff - Pretraining / Inference Optimization
Remote | Germany | USA
Black Forest Labs
Black Forest Labs is a cutting-edge startup pioneering generative image and video models. Our team, which invented Stable Diffusion, Stable Video Diffusion, and FLUX.1, is currently seeking a strong researcher / engineer to work closely with our research team on pretraining and inference optimization.
Role:
- Finding ideal training strategies (parallelism, precision trade-offs) for a variety of model sizes and compute loads
- Profiling, debugging, and optimizing single and multi-GPU operations using tools such as Nsight or stack trace viewers
- Reasoning about the speed and quality trade-offs of quantization for model inference
- Developing and improving low-level kernel optimizations for state-of-the-art inference and training
- Innovating new ideas that bring us closer to the limits of a GPU
Ideal Experiences:
- Being familiar with the latest and the most effective techniques in optimizing inference and training workloads
- Optimizing for both memory-bound and compute-bound operations
- Understanding GPU memory hierarchy and computation capabilities
- Deep understanding of efficient attention algorithms
- Implementing both forward and backward Triton kernels and ensuring their correctness while considering floating point errors
- Using, for example, pybind to integrate custom-written kernels into a PyTorch framework
Nice to have:
- Experience with Diffusion and Autoregressive models
- Experience in low-level CUDA kernel optimizations
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
2
0
0
Categories:
Deep Learning Jobs
Leadership Jobs
Tags: Autoregressive models CUDA GPU Model inference PyTorch Research Stable Diffusion
Regions:
Remote/Anywhere
Europe
North America
Countries:
Germany
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Data Scientist II jobsData Engineer II jobsSr. Data Engineer jobsPrincipal Data Engineer jobsStaff Data Scientist jobsBusiness Intelligence Analyst jobsStaff Machine Learning Engineer jobsData Science Manager jobsData Manager jobsPrincipal Software Engineer jobsData Science Intern jobsJunior Data Analyst jobsSoftware Engineer II jobsDevOps Engineer jobsBusiness Data Analyst jobsData Specialist jobsData Analyst Intern jobsSr. Data Scientist jobsLead Data Analyst jobsStaff Software Engineer jobsResearch Scientist jobsAI/ML Engineer jobsData Engineer III jobsSenior Backend Engineer jobsBI Analyst jobs
NLP jobsAirflow jobsOpen Source jobsEconomics jobsKafka jobsTerraform jobsKPIs jobsMLOps jobsLinux jobsNoSQL jobsJavaScript jobsPhysics jobsComputer Vision jobsGoogle Cloud jobsData Warehousing jobsRDBMS jobsScikit-learn jobsPostgreSQL jobsBanking jobsGitHub jobsScala jobsData warehouse jobsHadoop jobsStreaming jobsPandas jobs
R&D jobsdbt jobsOracle jobsBigQuery jobsCX jobsClassification jobsLooker jobsDistributed Systems jobsPySpark jobsReact jobsScrum jobsRAG jobsRedshift jobsRobotics jobsJira jobsELT jobsIndustrial jobsMicroservices jobsPrompt engineering jobsGPT jobsSAS jobsNumPy jobsData Mining jobsData strategy jobsMySQL jobs