Research Engineer in AI
Cape Town
⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️
InstaDeep
InstaDeep delivers AI-powered decision-making systems for the Enterprise. With expertise in both machine intelligence research and concrete business deployments, we provide a competitive advantage to our customers in an AI-first world.Join us to be a part of the AI revolution!
The Role:We seek a Research Engineer to join our research team and play a key role in advancing our mission to build cutting-edge AI models for decision-making with application to large-scale industrial optimisation. You will be involved in all aspects of the research process, including:Developing and implementing high-quality Python code in a production-level environment.Distributing and accelerating code on large-scale accelerator clusters.Identifying and preprocessing relevant data sources, and building efficient data loading pipelines.Designing and executing training and validation pipelines, and interpreting results.Contributing to the development of research plans and disseminating our work through research papers and participation in conferences.We are a results-oriented, hands-on team that thrives on collaboration. You will have the opportunity to work closely with fellow researchers, applied scientists, and product teams.
Responsibilities:
- Algorithmic Optimisation: Research and understand the latest deep learning literature to implement and optimise state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
- Algorithmic Design: Identify limitations in the application of existing state-of-the-art algorithms for our desired use cases, and propose concrete algorithmic contributions to mitigate those limitations and improve upon existing state-of-the-art.
- Scaling Expertise: Design and implement strategies to efficiently scale machine learning models across different accelerator platforms (GPU/TPU).
- Performance Optimisation: Analyse and profile ML systems under heavy load, pinpointing bottlenecks and implementing targeted optimisations.
- Distributed Systems Architecture: Create robust distributed training and inference solutions for maximum computational efficiency.
Required Skills:
- A Masters or equivalent in Computer Science, Engineering, Mathematics, or related field.
- Fundamentals of modern Deep Learning and experience with Reinforcement Learning.
- 2+ years of work experience.
- Expertise in Python.
- Experience working with Linux systems.
- Experience with at least one modern machine learning framework (JAX, Tensorflow, PyTorch, etc.)
- Experience with Docker.
Highly Desirable:
- Experience working with JAX and packages within the JAX ecosystem.
- Track record of successfully building and scaling ML models.
- Experience with software profiling, identifying bottlenecks, and delivering efficient solutions.
Right to work: Please note that you will require the legal right to work without visa sponsorship in the location you are applying for. We do not sponsor work visas.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Computer Science Deep Learning Distributed Systems Docker Engineering GCP Google Cloud GPU Industrial JAX Linux Machine Learning Mathematics ML models Pipelines Python PyTorch Reinforcement Learning Research TensorFlow
Perks/benefits: Conferences
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.