Foundation Models for Data Research Intern: 2025

Multiple Cities

IBM

For more than a century, IBM has been a global technology innovator, leading advances in AI, automation and hybrid cloud solutions that help businesses grow.

View all jobs at IBM

Apply now Apply later

Introduction
IBM Research Scientists are charting the future of Artificial Intelligence, creating breakthroughs in quantum computing, discovering how blockchain will reshape the enterprise, and much more. Join a team that is dedicated to applying science to some of today’s most complex challenges, whether it’s discovering a new way for doctors to help patients, teaming with environmentalists to clean up our waterways or enabling retailers to personalize customer service.

Your Role and Responsibilities This is for a 2025 summer internship with the following start dates: May – August or June – September for quarter system schools.

We are broadly interested in further improving the capabilities of foundation models (FMs) for a range of data management tasks such as data discovery, metadata enrichment, data access and retrieval with querying, and automated data-driven insights.
Topics of interest include research on interactive orchestration of data workflows such as natural language to data insights spanning multiple tools and functions, knowledge-driven data discovery and querying with graphs and mutli-modal FMs, step-by-step planning and reasoning for complex data workflows , and low-computational cost inference techniques for FMs to efficiently automate or assist users with data tasks.
We are looking for interns with skills and tasks of interest include:

  • [LLM for code generation] Research for effective use of foundational models for code generation pipelines specific to data tasks such as SQL for data retrieval
  • [Agents and Reasoning] Research for developing novel autonomous agentic systems to compete with Text-to-SQL on public leaderboards like BIRD and Spider 2.0
  • [Knowledge Graphs, Multi-Modal FMs] Research for novel ways to combine foundational models, knowledge graphs, and multi-modal data for improving tasks such as data discovery and automated text-to-sql
  • [FM Inference] Research for improving foundation models inference in terms of both answer generation and computational cost.


Required Technical and Professional Expertise

  • Applicants should be PhD & MS students pursuing graduate studies.
  • Pursuing graduate studies in computer science and related fields.
  • Having at least one Research publication at a top conference in AI.
  • Familiarity and working expertise with large language models.


Preferred Technical and Professional Expertise

  • Familiarity with knowledge graphs, RAG, agentic frameworks.
  • Familiarity with reinforcement learning, knowledge distillation and prompt optimization.
  • Familiarity with SQL.

Key Job Details
Role:Foundation Models for Data Research Intern: 2025 Location: Multiple Locations See All Yorktown Heights San Jose Cambridge Albany Category:Research Employment Type:Full-time OR Part-time Travel Required:No Travel Contract Type:Internship Company:(0147) International Business Machines Corporation Req ID:729601BR

Projected Minimum Salary:$85,320 per year Projected Maximum Salary:$85,320-$156,420/year per year Date Posted:September 29, 2024
Apply now Apply later
  • Share this job via
  • 𝕏
  • or
Job stats:  5  3  0
Category: Research Jobs

Tags: Blockchain Computer Science Data management LLMs PhD Pipelines RAG Reinforcement Learning Research SQL

More jobs like this