PhD Research Engineer Intern - Language Models for Biological Sequences
London
InstaDeep
InstaDeep delivers AI-powered decision-making systems for the Enterprise. With expertise in both machine intelligence research and concrete business deployments, we provide a competitive advantage to our customers in an AI-first world.
InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.
Join us to be a part of the AI revolution!
As a PhD Intern in the London Research Team you will be responsible for implementing and developing biological language models, model architectures and novel algorithms, as well as identifying and investigating promising research ideas to the most relevant challenges in biological sequence design. Advances in large language models have enabled us to leverage powerful pre-trained representations for in-silico experiment design optimization loops in few-shot and low-data settings. Many challenges remain on how to best combine these tools for improved downstream tasks. We’re looking for a candidate to contribute to the development of state-of-the-art biological language models focused on advancing fitness prediction and multi-round sequence design methodologies.
Right to work: Please note that you will require the legal right to work in the location you are applying for.
Join us to be a part of the AI revolution!
As a PhD Intern in the London Research Team you will be responsible for implementing and developing biological language models, model architectures and novel algorithms, as well as identifying and investigating promising research ideas to the most relevant challenges in biological sequence design. Advances in large language models have enabled us to leverage powerful pre-trained representations for in-silico experiment design optimization loops in few-shot and low-data settings. Many challenges remain on how to best combine these tools for improved downstream tasks. We’re looking for a candidate to contribute to the development of state-of-the-art biological language models focused on advancing fitness prediction and multi-round sequence design methodologies.
Role responsibilities:
- Support the efforts of the Research Team through the development of novel methods and applications under the guidance of our Research Scientists and Engineers.
- Design and implement ML models for proof of concept and benchmarking.
- Write high-quality, maintainable, well-documented, and modular python code.
- Report and present experimental results and research findings, verbally and in writing.
- Contribute to team research and publications.
Requirements
- Currently enrolled in a PhD programme (or recent graduate) in a related STEM discipline.
- Experience coding with deep learning frameworks such as JAX, Pytorch and/or Tensorflow, along with theoretical and practical knowledge in machine learning, deep learning, LLMs, Bayesian optimization and/or non-parametric models.
- Excellent communication skills and collaborative spirit.
- Good programming skills in Python.
- Proven ability to contribute to research communities and/or efforts, as evidenced by publishing scientific papers in leading journals or conferences (JMLR, ICLR, NeurIPS, ICML, etc.).
- Work permit for the UK for the duration of the internship.
Beneficial
- Software engineering skills (git, quality coding standards, …)
- Experience implementing/training/fine-tuning protein language models
- Specific experience in any of the following domains: coding in JAX, and training models using DP/FSDP techniques.
- Experience building machine learning models for biological data.
Right to work: Please note that you will require the legal right to work in the location you are applying for.
Job stats:
14
6
0
Categories:
Big Data Jobs
Engineering Jobs
NLP Jobs
Research Jobs
Tags: Architecture Bayesian Deep Learning Engineering FSDP GCP Git Google Cloud ICLR ICML JAX JMLR LLMs Machine Learning ML models NeurIPS PhD Python PyTorch Research STEM TensorFlow
Perks/benefits: Conferences
Region:
Europe
Country:
United Kingdom
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsData Engineer II jobsStaff Data Scientist jobsPrincipal Data Engineer jobsSr. Data Engineer jobsStaff Machine Learning Engineer jobsPrincipal Software Engineer jobsData Science Manager jobsData Manager jobsData Science Intern jobsSoftware Engineer II jobsDevOps Engineer jobsBusiness Intelligence Analyst jobsJunior Data Analyst jobsData Analyst Intern jobsData Specialist jobsBusiness Data Analyst jobsLead Data Analyst jobsStaff Software Engineer jobsSr. Data Scientist jobsSenior Backend Engineer jobsData Governance Analyst jobsAI/ML Engineer jobsData Engineer III jobsResearch Scientist jobs
Consulting jobsAirflow jobsMLOps jobsOpen Source jobsKPIs jobsKafka jobsJavaScript jobsEconomics jobsLinux jobsTerraform jobsNoSQL jobsData Warehousing jobsComputer Vision jobsGoogle Cloud jobsGitHub jobsRDBMS jobsPostgreSQL jobsR&D jobsScikit-learn jobsStreaming jobsPhysics jobsData warehouse jobsBanking jobsHadoop jobsdbt jobs
Scala jobsLooker jobsPandas jobsOracle jobsBigQuery jobsClassification jobsReact jobsRAG jobsCX jobsScrum jobsDistributed Systems jobsPySpark jobsIndustrial jobsPrompt engineering jobsELT jobsJira jobsMicroservices jobsRedshift jobsGPT jobsRobotics jobsTypeScript jobsOpenAI jobsLangChain jobsSAS jobsJenkins jobs