Senior Software Engineer
Remote
Bodo.ai
Bodo is a next-generation compute engine that can speed up and lower costs of long-running data processing and ETL/ELT jobsOur compiler technology has already empowered some of the world’s most data-driven companies to solve their data challenges with unprecedented speed and scalability. Backed by leaders like Dell Technologies, Snowflake, and AMD, and adopted by Fortune 10 customers, we are just beginning to unlock the true potential of data platforms.
That’s why we call it Transformative Python.
We’re looking for talented engineers with a passion for compilers, HPC, and AI to join us on this exciting journey to reshape the future of data analytics.
We are seeking a talented and experienced Senior Software Engineer to join our team as the Technical Lead. The ideal candidate will have experience designing, developing, and maintaining data pipelines combining data infrastructure and AI training infrastructure, to create an end-to-end product.
About the role
- Architect and design a Python package to help users create scalable pipelines, to go from ‘raw data’ to a trained AI model; including data filtering, data cleaning, data visualization, synthetic data creation, and integration into ML training (incl RL)
- Work with large datasets to develop both generic models as well as fine-tuned AI models, especially LLMs, using the package
- Continually improve the package by incorporating state-of-the-art techniques and frameworks
Experence
- 10+ years experience in data engineering or similar roles, with strong knowledge of designing and implementing complex AI and ML solutions, as a Senior Data Scientist, Machine Learning Engineer, or AI Engineer
- Strong proficiency in building large-scale data processing pipelines with AI training, familiar with distributed workloads (e.g., multiprocessing, MPI, Ray, Dask, Spark)
- Experience developing end-to-end pipelines for model training; from handling structured and unstructured data sources to cleaning and creating synthetic data to actual training
- Experience with AI technologies across the training journey, intimate familiarity with using Pytorch/ Horovod/ TensorflowAbility to take extreme ownership over your work
- Excellent problem-solving and communication skills
- Active GitHub contributions are a big plus
- Built Data pipelines for ML Training (Must, Ideally: Ray)
At Bodo, we embrace the challenge of building transformative technology and are looking for engineers with a passion for making an impact. Studies show that underrepresented groups may hesitate to apply if they don’t meet 100% of the qualifications. We encourage you to apply, even if you feel you don’t check every box. We’re looking for potential and drive just as much as qualifications. We’re excited to see what you’ll bring to Bodo.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Data Analytics Data pipelines Data visualization Engineering GitHub Horovod HPC LLMs Machine Learning Model training Pipelines Python PyTorch Snowflake Spark Unstructured data
Perks/benefits: 401(k) matching Career development Equity / stock options Flex hours Gear Health care Parental leave
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.