ML Compiler Software Engineering Technical Lead

Santa Clara, Ca

d-Matrix

d-Matrix delivers the world's most efficient AI computing solutions for generative AI at scale

View all jobs at d-Matrix

Apply now Apply later

At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration.

We value humility and believe in direct communication. Our team is inclusive, and our differing perspectives allow for better solutions. We are seeking individuals passionate about tackling challenges and are driven by execution.  Ready to come find your playground? Together, we can help shape the endless possibilities of AI. 

Location:

Hybrid, working onsite at our Santa Clara, CA headquarters 3 days per week.

The role: MLIR Software Engineering Technical Lead

What you will do:

The Compiler Technical Lead role is driving the design and implementation of the MLIR-based compiler framework. In this role, you will be overseeing the development of the compiler that partitions and maps large-scale NLP models to our scalable, multi-chiplet, parallel processing architecture with hundreds of digital in-memory tensor processors, vector processors, data shaping processors and both on-chip and off-chip memory. The compiler will also coordinate the scheduling of parallel tasks onto the processors, data movements and inter processor synchronization. The many-pass compiler architecture requires graph optimization passes, constant folding, data reshaping, padding, tiling and various other backend-specific operations. The software will support a split offline/online mapping process with just-in-time mapping to chiplets, processors and DDR memory channels.

This role requires collaborating with the HW and SW architecture team, the Pytorch front-end pre-processing team, the data science numerics team, AI kernel team, SW test group, the benchmark group and the teams developing the various simulator and emulation platforms. It is central to the overall efficiency of the solution. As such, we are seeking an AI compiler expert with experience in the TVM, Glow or preferably, the MLIR project. Also important is familiarity with the LLVM project. Experience mapping graph operations to many-core processors (or spatial fabrics) would be desirable.

This role does NOT require hardware design or verification experience. That said, an understanding of the trade-offs made by processor architects when implementing accelerators for DNNs, DCNNs, transformer models and attention mechanisms is useful - especially when it comes to mapping very large NLP models to such architectures.

What you will bring:

Minimum:

  • BS / MS Preferred in Computer Science or equivalent with 10+ years in ML Compiler.

  • Experience establishing, growing and/or developing engineering teams (and software teams in particular).

  • Experience with leading agile development methods is preferable including coordinating scrums, managing sprints and project task tracking with Kanban boards or similar.

  • Experience running code reviews, bug tracking meetings, familiarity and experience with CI/CD flows.

  • Managing interdependencies with other teams in order to meet milestones and target levels of performance.

  • Excellent documentation and presentation skills.

This role includes technical leadership aspects: specifically the motivation, engagement, goal setting, performance tracking, objective setting and performance management.

#LI-DL1

Equal Opportunity Employment Policy

d-Matrix is proud to be an equal opportunity workplace and affirmative action employer. We’re committed to fostering an inclusive environment where everyone feels welcomed and empowered to do their best work. We hire the best talent for our teams, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. Our focus is on hiring teammates with humble expertise, kindness, dedication and a willingness to embrace challenges and learn together every day.

d-Matrix does not accept resumes or candidate submissions from external agencies. We appreciate the interest and effort of recruitment firms, but we kindly request that individual interested in opportunities with d-Matrix apply directly through our official channels. This approach allows us to streamline our hiring processes and maintain a consistent and fair evaluation of al applicants. Thank you for your understanding and cooperation.

Apply now Apply later
Job stats:  1  0  0

Tags: Agile Architecture CI/CD Computer Science Engineering Generative AI Kanban Machine Learning NLP PyTorch

Region: North America
Country: United States

More jobs like this