Researcher, Machine Learning

665 Clyde Avenue, Mountain View, CA, USA

Samsung Research America

For more than 70 years, Samsung has been at the forefront of innovation. Our discoveries, inventions and breakthrough products have helped shape the history of the digital revolution. We continue to expand our global reach and open new...

View all jobs at Samsung Research America

Apply now Apply later

Lab Summary:

Samsung Research America is looking for outstanding researchers to join our Emerging Technologies (ET) Group.  The ET Group is uniquely positioned to be at the heart of Samsung Research America’s innovation engine aimed towards advancing Samsung’s product offerings across smartphones, wearables, TVs, XR devices, and identifying the next big growth drivers for Samsung across a range of emerging technologies.

Position Summary: 

We are looking for highly skilled and motivated researchers/engineers who can contribute to the development of fast LLM inferencing techniques, AI technology stack optimization, neuro-symbolic AI models, temporal knowledge graphs, and multimodal reasoning.

Position Responsibilities

  • Speculative decoding algorithms tailored for LLMs
  • Optimization of GPU/TPU utilization to minimize latency in token generation pipelines
  • Analyzing bottlenecks in model inference (e.g., memory bandwidth, compute constraints) and propose solutions to improve benchmark model performance pre- and post-optimization
  • Lead integration of multiple projects at the intersection of Cognitive Modeling, Machine Learning, Knowledge Representation and Reasoning  
  • Collaborate with a multidisciplinary team of researchers across different teams
  • Stay ahead of industry trends in LLM acceleration (e.g., dynamic batching, quantization, kernel fusion)
  • Publish findings and contribute to open-source projects
  • Generate creative solutions (patents), publish research results in top conferences (papers)

 Required Skills:

  • PhD in C.S., EE or related fields or equivalent combination of education, training and experience
  • 2+ years of work experience after PhD.
  • Expertise in knowledge graph and retrieval augmented generation (RAG) models
  • Experience in Contextual intelligence
  • Experience in large language model (LLM), including Transformer model architecture, attention mechanisms, decoder only LLMs, and autoregressive model optimization
  • Proficiency in PyTorch, CUDA, and distributed training/inference framework (e.g., DeepSpeed, vLLM) 
  • Hands-on experience profiling and optimizing LLMs on GPUs/TPU
  • A strong publication record in top-tier AI and NLP conferences is a plus

Special Attributes:

  • Ability to debug complex, latency-critical systems
  • Strong analytical and problem-solving skills, with a keen attention to detail and a passion for pushing the boundaries of AI capabilities
  • Excellent written and verbal communication skills, with the ability to present complex concepts and research findings in a clear and concise manner
  • Demonstrated ability to work independently as well as collaboratively in a fast-paced research and development environment
Our total rewards programs are designed to motivate and engage exceptional talent. The base pay range for roles at this level is listed below, but may be higher or lower in other states due to geographic differentials in the labor market. Within the base pay range, individual rates depend on a number of factors—including the role’s function and location as well as the individual’s knowledge, skills, experience, education and training. This is part of our comprehensive compensation package with annual bonus eligibility and generous benefits to help you live life well.Base Pay Range$151,200—$207,750 USD

Additional Information

Be careful not to disclose information related to the trade secrets of your previous or current employer(s)

Essential Job Functions

This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, and frequently operate standard office equipment, such as telephones and computers.

Samsung Research America is committed to complying with all Federal, State and local laws related to the employment of qualified individuals with disabilities. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact the recruiter or email sratalent@samsung.com.

Affirmative Action / Equal Opportunity

Samsung Research America is an Affirmative Action and Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, disability, or status as a protected veteran.

For more information regarding protection from discrimination under Federal law for applicants and employees, please refer to the links below.

Know Your Rights  |  Pay Transparency

Apply now Apply later
Job stats:  1  0  0

Tags: Architecture Autoregressive models CUDA GPU LLMs Machine Learning Model inference NLP Open Source PhD Pipelines PyTorch RAG Research vLLM

Perks/benefits: Career development Conferences Salary bonus

Region: North America
Country: United States

More jobs like this