Senior Staff Research Engineer, Speech Machine Learning

665 Clyde Avenue, Mountain View, CA, USA

Samsung Research America

For more than 70 years, Samsung has been at the forefront of innovation. Our discoveries, inventions and breakthrough products have helped shape the history of the digital revolution. We continue to expand our global reach and open new...

View all jobs at Samsung Research America

Apply now Apply later

Lab Summary:

Bixby is an intelligent personal assistant which is only available as a built-in application on Samsung flagship devices and wearables. This application uses Natural Language Understanding to perform tasks on these devices using voice/ text, including but not limited to making phone calls, sending text messages, setting up meetings, opening apps, setting alarms and timers, getting directions, answering general questions, providing information about restaurants and other businesses, etc.

Position Summary:

For this position we are expanding our AILs voice technology and features to include advanced research and projects in Wake word detection, Automatic Speech Recognition (ASR), that includes Acoustic and Language Modeling, and personalization. We also work on language and gender detection using speech signals, Speaker identification, verification and diarization techniques. At AIL we perform state-of-the-art research in multi-lingual/accents research and bringing those research ideas to production. We are looking for candidates with extensive expertise in Digital Signal/Speech Processing with Speech recognition specialization, demonstrated research expertise by publishing papers in reputed journals/conferences, excellent knowledge of  Deep/Machine Learning with 7+ years of industry experience. Candidates are expected to work in a fast paced environments. 

Position Responsibilities: 

  • Architect and design end to end Automatic Speech Recognition products, applications and solutions for specific business needs and provide implementation guidance during delivery
  • Leverage, customize and implement ASR models, algorithms, and methodologies to improve the overall quality ASR in various applications and systems
  • Analyze and evaluate the performance ASR systems and provide design recommendations
  • Analyze and make right technological choices for generative ai solutions
  • Design and prototype reusable components for LLM based solutions for ASR
  • Architect components of an ASR solution to address Responsible AI & Security
  • Collaborate seamlessly with diverse, cross-functional teams to accurately identify and prioritize requirements, ensuring that the language model meets the needs and expectations of various stakeholders
  • Create and maintain comprehensive technical documentation that comprehensibly captures the intricate details of the language model, facilitating seamless understanding, efficient troubleshooting, and future development
  • Harness the power of transformer architecture, a cutting-edge deep learning model widely employed in natural language processing and computer vision, to optimize the language model's performance and efficiency
  • Exploiting the transformative capabilities of transformer architectures to seamlessly process and reshape vast volumes of data, empowering the language model to achieve unprecedented levels of accuracy and versatility
  • Ensure ethical AI development practices, prioritizing fairness, transparency, and privacy

Required Skills:

  • MS or Ph.D. in Computer Science or Digital Signal Processing or equivalent combination of education, training, and experience
  • 7+ years of relevant professional experience in Machine Learning or relevant field
  • Experience with Tensorflow or Pytorch or similar frameworks
  • Worked on advance architectures such as transformers, conformer and other advanced models for ASR systems
  • Working experience on ASR in large scale production systems
  • Experience in modeling ML algorithms on GPUs at scale
  • Experience with multi-lingual speech, low resource speech research and architectures
  • Working experience on deploying recognition engines on both server and edge devices
  • Experience with Acoustic modeling, noise and ambient modeling, and its effects on ASR
  • Knowledge of state-of-the-art Large Language models such as Deepseek, GPT, BERT variants and other deep fusion techniques is essential
  • Working on WFST, n-gram and other shallow fusion techniques for named entity recognitions
  • Experience on speaker recognition, wakeup and audio-based language recognition is desirable
  • Experience with improving ASR performance in far field and noisy environments
  • Working experience on masking and spectral restoration based noise suppression and speech enhancement techniques
  • Experience in developing advance classification models such as ECAPA-TDNN for speaker, gender classifications
  • Ability to develop project plans and experience to execute them
  • Research expertise in ML and written research publications
  • C/C++, PYTHON, JAVA programming language experience
  • Leadership ability to lead a mid-size team
Our total rewards programs are designed to motivate and engage exceptional talent. The base pay range for roles at this level is listed below, but may be higher or lower in other states due to geographic differentials in the labor market. Within the base pay range, individual rates depend on a number of factors—including the role’s function and location as well as the individual’s knowledge, skills, experience, education and training. This is part of our comprehensive compensation package with annual bonus eligibility and generous benefits to help you live life well.Base Pay Range$197,800—$296,600 USD

Additional Information

Disclosure of Trade Secrets

Samsung has a strict policy on trade secrets. In applying to Samsung and progressing through the recruitment process, you must not disclose any trade secrets of a current or previous employer.

Essential Job Functions

This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, and frequently operate standard office equipment, such as telephones and computers.

Samsung Research America is committed to complying with all Federal, State and local laws related to the employment of qualified individuals with disabilities. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact the recruiter or email sratalent@samsung.com.

Equal Employment Opportunity

At Samsung, we believe that innovation and growth are driven by an inclusive culture and a diverse workforce. We aim to create a global team where everyone belongs and has equal opportunities, inspiring our talent to be their true selves. Together, we are building a better tomorrow for our customers, partners, and communities.

Samsung Research America is committed to employing a diverse workforce, and  provide Equal Employment Opportunity for all individuals regardless of race, color, religion, gender, age, national origin, marital status, sexual orientation, gender identity, status as a protected veteran, genetic information, status as a qualified individual with a disability, or any other characteristic protected by law.

For more information regarding protection from discrimination under Federal law for applicants and employees, please refer to this link: Pay Transparency

Apply now Apply later

Tags: Architecture ASR BERT Classification Computer Science Computer Vision Deep Learning Generative AI GPT Java LLMs Machine Learning NLP Privacy Python PyTorch Research Responsible AI Security TensorFlow Transformers

Perks/benefits: Career development Conferences Salary bonus

Region: North America
Country: United States

More jobs like this