Thesis Work 30hp - Merging Code Large Language Models for Enhanced Code Generation

Södertälje, SE, 151 38

Applications have closed

Scania Group

Scania is a world-leading provider of transport solutions, including trucks and buses for heavy transport applications combined with an extensive product-related service offering.

View all jobs at Scania Group

Find more jobs like this Jobs in Sweden

Posted 2 months ago

Thesis projects at Scania are excellent ways of making contacts for your future working life. Many of our current employees started their career with a thesis project.

Background:
Large Language Models (LLMs) have revolutionized code generation, enabling developers to write more efficient and accurate software with the assistance of AI. However, individual models often have limitations in scope, specialization, or performance. Model merging—combining multiple LLMs to leverage their unique strengths—presents a promising avenue to create superior models without the high computational costs associated with fine-tuning. By merging code-specific LLMs, we aim to enhance their capabilities, achieving better performance on diverse coding tasks while maintaining cost-effectiveness.

Target:
This project explores the potential of merging multiple code-focused LLMs to develop a unified model that excels in various code generation tasks. The research will investigate different merging techniques, evaluate their effectiveness, and identify the optimal strategies for combining models to maximize performance. Potential contributions include improved code generation accuracy, increased versatility across programming languages, and reduced reliance on expensive computational resources. Additionally, the project will examine the theoretical and practical challenges of model merging, such as compatibility of model architectures and the preservation of specialized knowledge.

Example of assignments:
•   Develop Merging Algorithms – Study and or develop different algorithms for merging multiple code LLMs.
•   Evaluate Performance – Compare the merged model against individual models and fine-tuned alternatives on various code generation benchmarks to assess improvements in accuracy and efficiency.
•   Develop Benchmark – Develop novel code generation benchmark.

Education:
MSc in Computer Science or similar, with some background in formal methods.

Contact person and supervisor:
Liv Kåreborn, AI Sweden, liv.kareborn@ai.se
Minal Patil, senior researcher, Scania, minal.patil@scania.com
Mattias Nyberg, Adj. prof, KTH / Research Lead, Scania, mattias.nyberg@scania.com

Number of students: 1-3
Time:20 weeks, full time 40 hours per week
Start: Jan 2025
Credits: 30hp

Application:
Enclose CV, personal letter and transcript of grades.
Application shall be registered in both: Thesis project application, and the "Apply"-button on this page

A background check might be conducted for this position. We are conducting interviews continuously and may close the recruitment earlier than the date specified.

Find more jobs like this Jobs in Sweden

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 1 0 0

Category: NLP Jobs

Tags: Architecture Computer Science LLMs Research

Region: Europe

Country: Sweden

More jobs like this

« Back to job search To the top ↑

Explore more career opportunities

Find even more open roles below ordered by popularity of job title or skills/products/technologies used.

Thesis Work 30hp - Merging Code Large Language Models for Enhanced Code Generation

Södertälje, SE, 151 38

Applications have closed

Scania Group

More jobs like this

Language Engineer - Native in Danish (Denmark)

Language Engineer - Native in Swedish (from Sweden)

INTERN - Data Scientist (GPT, SQL, 6 Months)

Internship in Advancing Drug Development with LLMs for Preclinical Studies (m/f/d)

PhD Position in Advanced Student Modeling and Tailored Large Language Models for Personalized Learning in Computer Science

Postdoctoral Researcher Position in Advanced Student Modeling and Tailored Large Language Models for Personalized Learning in Computer Science

Internship in Advancing Drug Development for Ophthalmology with multimodal LLMs (m/f/d)

Bell Labs Internship on Source-aware Language Models (PhD)

Hebrew Language Data Annotator - Barcelona, Spain

Finnish Language Data Annotator - Barcelona

Explore more career opportunities