Software Engineer - Reinforcement Learning

Zurich or Remote (EMEA)

DFINITY

The DFINITY Foundation is a major contributor to the Internet Computer blockchain.

View all jobs at DFINITY

Apply now Apply later

Employment Type: 6 Month Contract

We are looking for a Software Engineer with a focus on data preparation and AI model training. You will work on assembling, annotating, and cleaning training data, while contributing to reward modeling and supervised fine-tuning tasks.

You might thrive in this role if you:

  • Have a deep understanding of machine learning and machine learning applications.
  • Working knowledge and experience tuning large language models (multimodal) and building evaluations.
  • Be willing to dive into large codebases to debug.
  • Someone who thrives in a dynamic and technically complex environment.
  • Track record of delivering outside-the-box novel solutions to solve real-world constraints.

 

Responsibilities

  • Data Assembly & Annotation: Gather and annotate training data for AI models, ensuring it meets the quality requirements for reward modeling and supervised fine-tuning.
  • Data Cleaning & Processing: Conduct data cleaning and preprocessing to ensure models receive high-quality input.
  • Model Training: Participate in the training and fine-tuning of models, ensuring that they meet performance and accuracy standards.
  • Collaboration: Work with AI engineers, data scientists, and other team members to ensure efficient workflows and data handling.
  • Continuous Improvement: Support iterative improvements to models based on performance monitoring and feedback.

 

Requirements

  • Experience: At least 3 years of experience working in a software engineering role focused on AI/ML tasks.
  • Data Expertise: Hands-on experience assembling, annotating, and cleaning training data for machine learning models.
  • Technical Skills: Proficiency in Python and experience with AI frameworks like TensorFlow or PyTorch.
  • Model Training: Familiarity with model training, reward modeling, and supervised fine-tuning techniques.
  • Attention to Detail: Strong focus on data quality and attention to detail when handling large datasets.

 

Bonus Points

  • Experience working with reward modeling for AI systems.
  • Familiarity with data labeling tools and techniques for supervised fine-tuning.
  • Knowledge of cloud platforms for AI/ML workloads.

About DFINITY and the Internet Computer:

DFINITY is a leading contributor to the Internet Computer Protocol (ICP), with a mission to bring the world's compute onto the secure ICP network. Built on its unique third-generation blockchain technology, ICP enables the development and operation of a new generation of unstoppable, tamper-proof, fully decentralized web applications. Its powerful technology can run entire AI models within smart contracts, representing a major advancement for secure AI. Through seamless integration with Bitcoin, Ethereum, and other networks, ICP facilitates multi-chain operations for digital assets and web3.
Join our team of over 250 talented individuals, including world-renowned cryptographers, distributed systems engineers, programming language experts, and industry leaders, who are shaping the future of the internet and web3.   DFINITY was founded in 2016 by entrepreneur and crypto theoretician, Dominic Williams.

All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.
Apply now Apply later
  • Share this job via
  • 𝕏
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  12  1  0

Tags: Blockchain Crypto Data quality Distributed Systems Engineering LLMs Machine Learning ML models Model training Python PyTorch Reinforcement Learning TensorFlow

Perks/benefits: Career development

Regions: Remote/Anywhere Africa Europe Middle East
Country: Switzerland