Software Engineer - Reinforcement Learning
Zurich or Remote (EMEA)
Applications have closed
DFINITY
The DFINITY Foundation is a major contributor to the Internet Computer blockchain.Employment Type: 6 Month Contract
We are looking for a Software Engineer with a focus on data preparation and AI model training. You will work on assembling, annotating, and cleaning training data, while contributing to reward modeling and supervised fine-tuning tasks.
You might thrive in this role if you:
- Have a deep understanding of machine learning and machine learning applications.
- Working knowledge and experience tuning large language models (multimodal) and building evaluations.
- Be willing to dive into large codebases to debug.
- Someone who thrives in a dynamic and technically complex environment.
- Track record of delivering outside-the-box novel solutions to solve real-world constraints.
Responsibilities
- Data Assembly & Annotation: Gather and annotate training data for AI models, ensuring it meets the quality requirements for reward modeling and supervised fine-tuning.
- Data Cleaning & Processing: Conduct data cleaning and preprocessing to ensure models receive high-quality input.
- Model Training: Participate in the training and fine-tuning of models, ensuring that they meet performance and accuracy standards.
- Collaboration: Work with AI engineers, data scientists, and other team members to ensure efficient workflows and data handling.
- Continuous Improvement: Support iterative improvements to models based on performance monitoring and feedback.
Requirements
- Experience: At least 3 years of experience working in a software engineering role focused on AI/ML tasks.
- Data Expertise: Hands-on experience assembling, annotating, and cleaning training data for machine learning models.
- Technical Skills: Proficiency in Python and experience with AI frameworks like TensorFlow or PyTorch.
- Model Training: Familiarity with model training, reward modeling, and supervised fine-tuning techniques.
- Attention to Detail: Strong focus on data quality and attention to detail when handling large datasets.
Bonus Points
- Experience working with reward modeling for AI systems.
- Familiarity with data labeling tools and techniques for supervised fine-tuning.
- Knowledge of cloud platforms for AI/ML workloads.
About DFINITY and the Internet Computer:
Join our team of over 250 talented individuals, including world-renowned cryptographers, distributed systems engineers, programming language experts, and industry leaders, who are shaping the future of the internet and web3. DFINITY was founded in 2016 by entrepreneur and crypto theoretician, Dominic Williams.
All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Blockchain Crypto Data quality Distributed Systems Engineering LLMs Machine Learning ML models Model training Python PyTorch Reinforcement Learning TensorFlow
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.