Data science | computer science internship: AI and LLM for diagnostics
Eindhoven, Building 25, Netherlands
ASML
ASML gives the world's leading chipmakers the power to mass produce patterns on silicon, helping to make computer chips smaller, faster and greener.Introduction
ASML maintains an extensive collection of diagnostic documents called PCS that detail the Problem, Cause, and Solution faced by our engineers while Installing or Servicing the machines. While these documents contain valuable diagnostic information, their unstructured format makes it challenging to scale and standardize diagnostic procedures. A better alternative is an executable diagnostic process called DDFs which offers a more consolidated and structured approach by using dynamic flowcharts to guide engineers through a sequence of actions to identify the failure cases. As our install base and engineer workforce increases, providing service actions in a guided way compared to relying on the engineer's ability to find the correct PCS becomes even more significant. Leveraging our existing PCS to DDF can also help us unlock valuable data mining opportunities, enabling us to understand which PCSs are more used, successful, or relevant ultimately leading to improved and more leaner knowledge base management.
Your assignment
This internship project focuses on developing an intelligent system to automate the conversion of legacy Problem-Cause-Solution (PCS) documents into structured Deterministic Diagnostic Flow (DDF) documents using LLMs. The project aims to streamline diagnostic processes while leveraging the existing organizational knowledge.
Generate diagnostic questions and service action from PCS documents using LLMs. Compare LLMs performance against other baseline models. If necessary, fine tune LLMs using ASML specific data to improve performance. Evaluate the LLMs performance for the question generation problem using standard evaluation techniques. Scaling system to handle large number of PCS documents. Stretch Goal: Constructing control matrix on the questions generated to lead to unique failure modes
Your profile
To be a great match for this internship you:
Are a graduating master student in Computer Science, Data Science, Machine Learning or a similar field.
Are enthusiastic about GenAI and LLMs or related technologies.
Have some basic knowledge of or familiarity with PyTorch and/or TensorFlow and are experienced in Python.
Are able to work independently and autonomously and are pro-active.
Have strong communication skills and are fluent in English (verbal and written).
This is a master graduation internship for a minimum of 6 months, for 4 to 5 days per week (at least 3 days onsite). The start date of this internship is September 2025.
Other requirements you need to meet
You are enrolled at an educational institute for the entire duration of the internship;
You are located in the Netherlands to perform your internship. In case you are currently living/studying outside of the Netherlands, your CV/motivation letter includes the willingness to relocate;
If you are a non-EU citizen, studying in the Netherlands, your university is willing to sign the documents relevant for doing an internship (i.e., Nuffic agreement).
Diversity and inclusion
ASML is an Equal Opportunity Employer that values and respects the importance of a diverse and inclusive workforce. It is the policy of the company to recruit, hire, train and promote persons in all job titles without regard to race, color, religion, sex, age, national origin, veteran status, disability, sexual orientation, or gender identity. We recognize that diversity and inclusion is a driving force in the success of our company.
Need to know more about applying for a job at ASML? Read our frequently asked questions.
Tags: Computer Science Data Mining Generative AI LLMs Machine Learning Python PyTorch TensorFlow
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.