Researcher, Machine Learning
665 Clyde Avenue, Mountain View, CA, USA
Samsung Research America
For more than 70 years, Samsung has been at the forefront of innovation. Our discoveries, inventions and breakthrough products have helped shape the history of the digital revolution. We continue to expand our global reach and open new...Lab Summary:
Samsung Research America is looking for outstanding researchers to join our Emerging Technologies (ET) Group. The ET Group is uniquely positioned to be at the heart of Samsung Research America’s innovation engine aimed towards advancing Samsung’s product offerings across smartphones, wearables, TVs, XR devices, and identifying the next big growth drivers for Samsung across a range of emerging technologies.
Position Summary:
We are looking for highly skilled and motivated researchers/engineers who can contribute to the development of fast LLM inferencing techniques, AI technology stack optimization, neuro-symbolic AI models, temporal knowledge graphs, and multimodal reasoning.
Position Responsibilities
- Speculative decoding algorithms tailored for LLMs
- Optimization of GPU/TPU utilization to minimize latency in token generation pipelines
- Analyzing bottlenecks in model inference (e.g., memory bandwidth, compute constraints) and propose solutions to improve benchmark model performance pre- and post-optimization
- Lead integration of multiple projects at the intersection of Cognitive Modeling, Machine Learning, Knowledge Representation and Reasoning
- Collaborate with a multidisciplinary team of researchers across different teams
- Stay ahead of industry trends in LLM acceleration (e.g., dynamic batching, quantization, kernel fusion)
- Publish findings and contribute to open-source projects
- Generate creative solutions (patents), publish research results in top conferences (papers)
Required Skills:
- PhD in C.S., EE or related fields or equivalent combination of education, training and experience
- 2+ years of work experience after PhD.
- Expertise in knowledge graph and retrieval augmented generation (RAG) models
- Experience in Contextual intelligence
- Experience in large language model (LLM), including Transformer model architecture, attention mechanisms, decoder only LLMs, and autoregressive model optimization
- Proficiency in PyTorch, CUDA, and distributed training/inference framework (e.g., DeepSpeed, vLLM)
- Hands-on experience profiling and optimizing LLMs on GPUs/TPU
- A strong publication record in top-tier AI and NLP conferences is a plus
Special Attributes:
- Ability to debug complex, latency-critical systems
- Strong analytical and problem-solving skills, with a keen attention to detail and a passion for pushing the boundaries of AI capabilities
- Excellent written and verbal communication skills, with the ability to present complex concepts and research findings in a clear and concise manner
- Demonstrated ability to work independently as well as collaboratively in a fast-paced research and development environment
Additional Information
Be careful not to disclose information related to the trade secrets of your previous or current employer(s)
Essential Job Functions
This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, and frequently operate standard office equipment, such as telephones and computers.
Samsung Research America is committed to complying with all Federal, State and local laws related to the employment of qualified individuals with disabilities. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact the recruiter or email sratalent@samsung.com.
Affirmative Action / Equal Opportunity
Samsung Research America is an Affirmative Action and Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, disability, or status as a protected veteran.
For more information regarding protection from discrimination under Federal law for applicants and employees, please refer to the links below.
Tags: Architecture Autoregressive models CUDA GPU LLMs Machine Learning Model inference NLP Open Source PhD Pipelines PyTorch RAG Research vLLM
Perks/benefits: Career development Conferences Salary bonus
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.