Senior Machine Learning Scientist, Gen AI
Madrid
Adyen
End-to-end payments, data, and financial management in one solution. Meet the financial technology platform that helps you realize your ambitions faster.This is Adyen
Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.
For our teams, we create an environment with opportunities for our people to succeed, backed by the culture and support to ensure they are enabled to truly own their careers. We are motivated individuals who tackle unique technical challenges at scale and solve them as a team. Together, we deliver innovative and ethical solutions that help businesses achieve their ambitions faster.
This is Gen AI
Our team's mission is to create a Generative AI platform at Adyen that supports various applications based on LLMs. This involves developing platform-oriented components for deploying an LLM backend within Adyen's GPU cluster in Kubernetes, with features like monitoring, access control, rate limiting, prompt debugging, and experiment tracking. We mainly use Open Source frameworks like HuggingFace and LangChain, and models like Llama or Mixtral. This involves developing platform components, but also delivering on some of the most promising use cases across different areas within the company. Through use cases like support case routing and sentiment analysis, they showcase AI's adaptability across different domains within the organization, revolutionizing workflows and decision-making processes. Read more about the work of Gen AI here.
What you'll do
You will be responsible for building and interpreting algorithms that power data products at Adyen using Generative AI or NLP techniques. That means leading end-to-end development from prompt engineering, few-shot learning, and fine-tuning self-hosted LLMs if needed. You will work on Natural Language Processing techniques and LLMs to tackle text classification, sentiment analysis, and summarization of Question-Answering retrieval.
- Provide technical guidance and mentorship to other data scientists specialized in different domains.
- Collaborate with cross-functional teams across Adyen to integrate LLM applications in different systems.
- Ensure efficient data preprocessing and ETL pipelines to create features that feed machine learning algorithms during training.
- Set up experiments to adjust modeling decisions, perform exploratory analysis, tune hyperparameters, or validate hypothesis selection for the right metric set for each business problem. Report metrics and monitor performance to keep stakeholders updated and ensure a smooth model deployment in production.
- Iterate with merchants and product audiences, creating algorithms that power state-of-the-art machine learning-based solutions, and be able to explain the reasons behind the executed inference.
Who you are
- You have 5+ years of professional experience as a Machine Learning- or Data Scientist.
- Proven experience developing, training, validating, benchmarking, and monitoring machine learning algorithms, particularly in the natural language processing domain
- Extensive knowledge of machine learning algorithms, including a deep understanding of statistical modeling and Python development tooling and libraries: Pandas, Numpy, Scikit-learn, Pytest, PySpark, SQL, Airflow, and MLflow. Strong experience with machine learning frameworks such as Pytorch and HuggingFace Transformers.
- Familiarity with prompt engineering techniques and frameworks like LangChain, LlamaIndex, or DSpy. Good understanding of LLM models, including other components like VectorDBs and document loaders.
- Knowledge of version control systems (e.g., Git), C/CD, RESTful APIs, containerized applications (Docker), and microservices deployed in Kubernetes. Adherence to coding best practices, including code reusability, documentation, and testing.
- You are an analytical thinker with a knack for understanding operational requirements and converting them into actionable ML solutions.
- Proactively taking the lead in projects, from ideation to deployment, while ensuring stakeholder collaboration.
- You can communicate complex outcomes with clarity over a wide range of audiences.
- We appreciate a forward-thinking mindset driven by experimentation and iterative development. A solid foundation in statistics and mathematics will serve you well in this role.
Data Positions at Adyen
We know companies handle different definitions for their data-related positions; this is for instance dependent on the size of a company. Since the birth of the Data Solution and the growth of all data streams, we categorized and defined all our positions. Have a look at this blogpost to find out!
This role is based out of our Madrid office. We are an office-first company and value in-person collaboration; we do not offer remote-only roles.
Our Diversity, Equity and Inclusion commitments
Our unique approach is a product of our diverse perspectives. This diversity of backgrounds and cultures is essential in helping us maintain our momentum. Our business and technical challenges are unique, and we need as many different voices as possible to join us in solving them - voices like yours. No matter who you are or where you’re from, we welcome you to be your true self at Adyen.
Studies show that women and members of underrepresented communities apply for jobs only if they meet 100% of the qualifications. Does this sound like you? If so, Adyen encourages you to reconsider and apply. We look forward to your application!
What’s next?
Ensuring a smooth and enjoyable candidate experience is critical for us. We aim to get back to you regarding your application within 5 business days. Our interview process tends to take about 4 weeks to complete, but may fluctuate depending on the role. Learn more about our hiring process here. Don’t be afraid to let us know if you need more flexibility.
Adyen is an equal opportunity employer. We do not discriminate based on race, color, ethnicity, ancestry, national origin, religion, sex, gender, gender identity, gender expression, sexual orientation, age, disability, veteran status, genetic information, marital status or any legally protected status.
All your information will be kept confidential according to EEO guidelines.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow APIs Classification Docker Engineering ETL Generative AI Git GPU HuggingFace Kubernetes LangChain LLaMA LLMs Machine Learning Mathematics Microservices MLFlow Model deployment NLP NumPy Open Source Pandas Pipelines Prompt engineering PySpark Python PyTorch Scikit-learn SQL Statistical modeling Statistics Testing Transformers
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.