Principal Machine Learning Engineer, Speech Recognition
Canada
SoundHound Inc.
Voice AI interfaces for hardware devices, services, vehicles, mobile apps, and more powered by SoundHound's conversational intelligence solutionsSoundHound AI believes every person should be able to interact naturally with the products around them—by simply talking. With a global reach spanning two dozen languages, we build Voice AI products with conversational intelligence for industries ranging from automotive, restaurants, and retail to enterprise sectors such as financial institutions, insurance, and healthcare. Our solutions empower customers to extend their brand in new and meaningful ways, revolutionizing how businesses connect with their audiences.
The Principal Software Engineer, Machine Learning role on the Automatic Speech Recognition (ASR) Modeling team leads efforts to research and develop production-ready speech recognition solutions, focusing on end-to-end speech recognition and large language models (LLMs). The role's purpose is to continuously innovate and refine the architecture of vital systems. This position requires someone who excels in a challenging, algorithm-driven, and hands-on environment.
In this role, you will:
- Drive innovation and advancements in the core architecture of Automatic Speech Recognition (ASR).
- Lead and contribute to cutting-edge research in Machine Learning (ML), with a focus on ASR, natural language understanding, dialogue generation, and context-aware conversational systems.
- Develop novel algorithms and models to enhance the performance and intelligence of our ASR platform.
- Train and optimize large-scale deep learning ASR models.
- Collaborate closely with cross-disciplinary teams, including machine learning engineers, software developers, and product managers, to translate research insights into practical solutions.
- Mentor and support junior researchers and engineers, fostering a culture of continuous learning and innovation.
- Stay informed on the latest developments in ASR and large language models (LLMs), driving the ongoing enhancement of our technology stack.
We would love to hear from you if:
- You have a Ph.D. in Computer Science and bring at least five years of relevant, hands-on industry experience, or a Master's in Computer Science with at least eight years of industry experience.
- You have a proven track record of impactful research and development in Machine Learning and ASR, with a portfolio of publications and projects that showcase your expertise.
- You are highly skilled in deep learning frameworks (e.g., TensorFlow, PyTorch) and have experience with training large-scale models.
- You have strong programming abilities in Python, enabling you to implement and optimize ASR models effectively.
- You are adept at solving complex problems and developing innovative solutions in the realm of conversational AI.
- You are passionate about staying at the cutting edge of AI research and are driven to push the boundaries of what's possible in conversational AI.
- You are excellent in teamwork and communication, collaborating seamlessly with cross-functional teams.
We’d be especially excited if you,
- You are proficient in C/C++ and possess strong overall software engineering expertise.
- You have industry experience in the field of Automatic Speech Recognition (ASR).
This role is available throughout Canada. Employees within a 100-kilometer radius of our Toronto office are expected to work from the office on three pre-scheduled “core days” each month to encourage cross-team connection and in-person collaboration. Aside from these office-specific “core days,” this job allows for virtual/remote, hybrid, and in-office workplace setting options. In addition to salary and equity, you will receive comprehensive healthcare, paid time off, and other benefits. Our recruiting team will provide a specific salary range based on location and years of experience.
Please note that if your application is advanced, the initial step will be an invitation to partake in a pre-assessment.
_____________________________________
SoundHound AI strives to be a values-driven company that is supportive of one another, open and honest, undaunted by challenges, nimble and focused, and determined to excel and win. Diversity, equity, inclusion, and belonging are key to who we are as a company. With a mission to build Voice AI for the world, creating a team with global perspectives is critical to our success. Learn more about our philosophy, benefits, and culture at https://www.soundhound.com/careers.
We care deeply about fostering an environment where everyone is supported and can do their best work. SoundHound ensures that individuals with disabilities are provided reasonable accommodations to participate in the interview process, perform essential job functions, and receive other employment benefits.
To view our job applicant privacy policy, please visit https://static.soundhound.com/corpus/ta/applicantprivacynotice.html.
Come join our growing team and bring your unique voice to our mission!
#LI-REMOTE
#LI-MR1
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture ASR Computer Science Conversational AI Deep Learning Engineering Excel LLMs Machine Learning NLP Privacy Python PyTorch Research TensorFlow
Perks/benefits: Career development Equity / stock options
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.