Member of Technical Staff - Pretraining / Inference Optimization
Remote | Germany | USA
Black Forest Labs is a cutting-edge startup pioneering generative image and video models. Our team, which invented Stable Diffusion, Stable Video Diffusion, and FLUX.1, is currently seeking a strong researcher / engineer to work closely with our research team on pretraining and inference optimization.
Role:
- Finding ideal training strategies (parallelism, precision trade-offs) for a variety of model sizes and compute loads
- Profiling, debugging, and optimizing single and multi-GPU operations using tools such as Nsight or stack trace viewers
- Reasoning about the speed and quality trade-offs of quantization for model inference
- Developing and improving low-level kernel optimizations for state-of-the-art inference and training
- Innovating new ideas that bring us closer to the limits of a GPU
Ideal Experiences:
- Being familiar with the latest and the most effective techniques in optimizing inference and training workloads
- Optimizing for both memory-bound and compute-bound operations
- Understanding GPU memory hierarchy and computation capabilities
- Deep understanding of efficient attention algorithms
- Implementing both forward and backward Triton kernels and ensuring their correctness while considering floating point errors
- Using, for example, pybind to integrate custom-written kernels into a PyTorch framework
Nice to have:
- Experience with Diffusion and Autoregressive models
- Experience in low-level CUDA kernel optimizations
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
9
1
0
Categories:
Deep Learning Jobs
Leadership Jobs
Tags: Autoregressive models CUDA GPU Model inference PyTorch Research Stable Diffusion
Regions:
Remote/Anywhere
Europe
North America
Countries:
Germany
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Data Engineer II jobsBI Developer jobsSr. Data Engineer jobsPrincipal Data Engineer jobsStaff Data Scientist jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsData Science Manager jobsData Manager jobsData Science Intern jobsBusiness Intelligence Analyst jobsSoftware Engineer II jobsDevOps Engineer jobsJunior Data Analyst jobsData Specialist jobsData Analyst Intern jobsBusiness Data Analyst jobsStaff Software Engineer jobsLead Data Analyst jobsSr. Data Scientist jobsAI/ML Engineer jobsSenior Backend Engineer jobsData Governance Analyst jobsData Engineer III jobsResearch Scientist jobs
NLP jobsAirflow jobsOpen Source jobsMLOps jobsKPIs jobsTerraform jobsLinux jobsEconomics jobsJavaScript jobsKafka jobsNoSQL jobsData Warehousing jobsGoogle Cloud jobsComputer Vision jobsGitHub jobsRDBMS jobsPostgreSQL jobsR&D jobsPhysics jobsScikit-learn jobsStreaming jobsData warehouse jobsHadoop jobsScala jobsBanking jobs
dbt jobsPandas jobsBigQuery jobsOracle jobsClassification jobsReact jobsScrum jobsLooker jobsCX jobsRAG jobsDistributed Systems jobsPySpark jobsIndustrial jobsPrompt engineering jobsMicroservices jobsRedshift jobsELT jobsJira jobsRobotics jobsTypeScript jobsGPT jobsSAS jobsOpenAI jobsLangChain jobsNumPy jobs