Bioinformatics Software Systems Engineer
Boston, MA
Manifold Bio
Our innovative technology combines protein barcoding and high-throughput in vivo capabilities to design drugs with more precise molecular testing.Manifold Bio is a dynamic biotech company building a pipeline of targeted biologics using a novel in vivo-centric discovery approach. Our drug discovery engine is differentiated by massively parallel screening in vivo from the beginning of our discovery process. This unique platform is powered by a proprietary protein barcoding technology that allows multiplexed protein quantitation at unprecedented scale and sensitivity. We combine this and other high-throughput protein engineering approaches with computational design to create antibody-like drugs and other biologics. Our world-class team of protein engineers, biologists, and computational scientists are working together to aim the platform at therapeutic opportunities where precise targeting is the key to overcoming clinical challenges.
Position
Manifold Bio is seeking an exceptional engineer to help lead the build out of our Data Platform at Manifold Bio. This role will be responsible for both identifying and implementing data solutions across our entire stack: identifying the best solutions to first collect, curate and manage our rich data streams and then to make data accessible to computational, ML, and general researchers alike. This role will work closely with Computational Scientists and the CTO, as well as wet lab scientists across the company. You’ll be expected to be strong at both data engineering and designing new system architectures. Manifold Bio runs largely on AWS and Benchling, Benchling Connect and Benchling Vivo–so a familiarity with both AWS and Benchling API/sdk is a plus. A passion for building infrastructure, supporting researchers in the life sciences, and a commitment to strong best practices are all qualities that would be a great fit.
Responsibilities
- Work closely with Manifold’s Computational Team and wet-lab scientists to identify and deploy solutions to augment our ability to capture, store, and make decisions based on our data
- Create tools, models, algorithms and data pipelines to support novel data streams
- Create interfaces for researchers to access data without engineering support
- Present and report on data model and infrastructure updates to the team
- Own interfaces and integrations with partner services, including Benchling
Qualifications
- 5+ years of relevant programming experience (including Python)
- Demonstrated and proven experience modeling and building data solutions
- Experience developing, orchestrating and supporting ETL pipelines
- Cloud computing experience with Amazon Web Services (AWS)
- Experience with data profiling, data quality, master data management, metadata management
- Experience across multiple operating systems: Unix/Linux, Mac, and [tolerance of] Windows
- Detail-oriented with excellent problem identification and problem-solving skills
- Demonstrated ability to work both independently and as part of a team
- A deep passion for data modeling and developing new methods
- PREFERRED: master’s degree, project management experience, relevant certs in data science or project management
- PREFERRED: experience working with Next Generation Sequencing (NGS) data
We value different experiences and ways of thinking and believe the most talented teams are built by bringing together people of diverse cultures, genders, and backgrounds.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Architecture AWS Bioinformatics Data management Data pipelines Data quality Drug discovery Engineering ETL Linux Machine Learning Pipelines Protein engineering Python
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.