Machine Learning Engineer, Distributed Training Infrastructure
San Francisco, United States
Twelve Labs
Bring human-like video understanding to any application, whether you have terabytes or petabytes of video.
Who we are
At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media.
With a remarkable $107 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.
We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.
About the role
As Machine Learning Engineer, Distributed Training Infrastructure, you will be responsible for ensuring that compute performance and ease-of-use never delay our research timeline. You will own strategy and implementation for all compute & training infrastructure optimization, observability, scaling, and orchestration. You will collaborate closely with other engineers and scientists to define and implement your chosen roadmap. This role is a perfect fit for research minded compute specialists who want to build SOTA video, vision, and video-language modeling systems!
1) Recruiter Phone Screen2) Initial Technical Assessment3) Final round technical assessment & culture interview4) Reference Checks
We're also excited to share that we'll do global onboarding in Seoul for all new hires (paid company travel!).
Even if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-to-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at Twelve Labs.
We welcome applicants from all walks of life and are committed to equal-opportunity employment. We cherish and celebrate diversity not just because it is the right thing to do, but because it makes our company much stronger.
Benefits and Perks
🤝 An open and inclusive culture and work environment.🧑💻 Work closely with a collaborative, mission-driven team on cutting-edge AI technology.🦷 Full health, dental, and vision benefits✈️ Extremely flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.🏙 Remote-flexible, offices in San Francisco and Seoul and coworking stipend🛂 VISA support (such as H1B and OPT transfer for US employees)
At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media.
With a remarkable $107 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.
We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.
About the role
As Machine Learning Engineer, Distributed Training Infrastructure, you will be responsible for ensuring that compute performance and ease-of-use never delay our research timeline. You will own strategy and implementation for all compute & training infrastructure optimization, observability, scaling, and orchestration. You will collaborate closely with other engineers and scientists to define and implement your chosen roadmap. This role is a perfect fit for research minded compute specialists who want to build SOTA video, vision, and video-language modeling systems!
In this role, you will:
- Own our compute strategy e2e
- Partner with researchers to understand our future research roadmap and to identify scaling limitations which will most imminently block us from achieving our goals
- Be a hands on leader who is excited to debug perplexing node failures at odd hours
- Mentor junior engineers/researchers, and hold a high bar around code quality / engineering best practices
- Work across teams to understand and manage project priorities and product deliverables, evaluate trade-offs, and drive technical initiatives from ideation to execution to shipment
You may be a good fit if you have:
- 7+ years of industry experience
- Owned large scale distributed training efforts across thousands of accelerators
- Experience with a panoply of HPC related tools and have developed strong opinions about how we should build our stack
- A passion for solving the most pressing technical challenges, as opposed to the most intellectually satisfying ones
- Strong Python and infrastructure-as-code expertise
- Excellent communication skills in written and spoken English
1) Recruiter Phone Screen2) Initial Technical Assessment3) Final round technical assessment & culture interview4) Reference Checks
We're also excited to share that we'll do global onboarding in Seoul for all new hires (paid company travel!).
Even if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-to-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at Twelve Labs.
We welcome applicants from all walks of life and are committed to equal-opportunity employment. We cherish and celebrate diversity not just because it is the right thing to do, but because it makes our company much stronger.
Benefits and Perks
🤝 An open and inclusive culture and work environment.🧑💻 Work closely with a collaborative, mission-driven team on cutting-edge AI technology.🦷 Full health, dental, and vision benefits✈️ Extremely flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.🏙 Remote-flexible, offices in San Francisco and Seoul and coworking stipend🛂 VISA support (such as H1B and OPT transfer for US employees)
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Categories:
Deep Learning Jobs
Engineering Jobs
Machine Learning Jobs
Tags: Engineering HPC Machine Learning Python Research
Perks/benefits: Career development Flex hours Flex vacation Health care Home office stipend Parental leave
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Staff Machine Learning Engineer jobsStaff Data Scientist jobsBI Developer jobsData Scientist II jobsPrincipal Data Engineer jobsData Manager jobsJunior Data Analyst jobsResearch Scientist jobsData Science Manager jobsBusiness Data Analyst jobsData Engineer III jobsSenior AI Engineer jobsLead Data Analyst jobsData Specialist jobsData Science Intern jobsSr. Data Scientist jobsPrincipal Software Engineer jobsData Analyst Intern jobsSoftware Engineer II jobsData Analyst II jobsBI Analyst jobsAzure Data Engineer jobsSoftware Engineer, Machine Learning jobsJunior Data Engineer jobsSenior Data Scientist, Performance Marketing jobs
Snowflake jobsEconomics jobsLinux jobsOpen Source jobsBanking jobsHadoop jobsComputer Vision jobsRDBMS jobsJavaScript jobsPhysics jobsMLOps jobsKafka jobsData Warehousing jobsKPIs jobsAirflow jobsGoogle Cloud jobsNoSQL jobsR&D jobsStreaming jobsScala jobsData warehouse jobsOracle jobsClassification jobsGitHub jobsPostgreSQL jobs
Scikit-learn jobsSAS jobsCX jobsTerraform jobsPySpark jobsScrum jobsPandas jobsData Mining jobsDistributed Systems jobsIndustrial jobsBigQuery jobsRobotics jobsLooker jobsJira jobsJenkins jobsUnstructured data jobsE-commerce jobsRedshift jobsdbt jobsData strategy jobsPharma jobsReact jobsMicroservices jobsMySQL jobsNumPy jobs