Data for LLMs - Software Engineer Intern: 2025

Multiple Cities

IBM

For more than a century, IBM has been a global technology innovator, leading advances in AI, automation and hybrid cloud solutions that help businesses grow.

View all jobs at IBM

Apply now Apply later

Introduction
Want to be a part of preparing and governing data for IBM’s Granite models? We are a group of scientists, engineers and designers working on the state-of-the-art Data and Model Factory that produces all of IBM’s Granite models. Our work enables and accelerates the entire data pipeline, from data clearance and acquisition to quality-focused data engineering. These data are used in pre-training, fine-tuning, instruction-tuning, or RAG solutions powered by IBM Granite. We thrive in opensource innovation, responsible use of data and AI, collaboration across disciplines, including backend engineering, data science, distributed computing, natural language processing, among others.

Your Role and Responsibilities
This is for a 2025 summer internship with the following start dates: May – August or June – September for quarter system schools.

During your internship, you can expect to work on challenging engineering problems, often involving large-scale data and models, and produce cutting edge technology in a diverse and nurturing research environment. You’ll learn and practice how to define problems, build prototypes, test hypotheses, and deploy results. In the past, interns have contributed to open-source projects, built functioning systems and prototypes, and published their results as papers or patents.

Required Technical and Professional Expertise

  • Applicants should be enrolled in a Bachelor-level course and have a science, technology, engineering, or mathematical discipline background.
  • Programming languages such as Python, Spark, Ray, and C++.
  • Your areas of experience should include foundation models, machine learning, AI, natural language processing, data engineering, cloud computing, database management, and other computer science and engineering topics.


Preferred Technical and Professional Expertise

  • Your areas of interest should include foundation models, machine learning, AI, natural language processing, data engineering, cloud computing, database management, and other computer science and engineering topics.

Key Job Details
Role:Data for LLMs – Software Engineer Intern: 2025 Location: Multiple Locations See All Yorktown Heights San Jose Cambridge Albany Category:Software Engineering Employment Type:Full-time OR Part-time Travel Required:No Travel Contract Type:Internship Company:(0147) International Business Machines Corporation Req ID:729592BR

Projected Minimum Salary:$71,280 per year Projected Maximum Salary:$71,280-$130,680/year per year Date Posted:November 25, 2024
Apply now Apply later
Job stats:  6  5  0

Tags: Computer Science Engineering LLMs Machine Learning NLP Open Source Python RAG Research Spark

More jobs like this