Senior Data Scientist, Applied AI

Cambridge, MA USA

Flagship Pioneering, Inc.

We are Flagship Pioneering We are a biotechnology company that invents platforms and builds companies that change the world. Pioneering Partnerships…

View all jobs at Flagship Pioneering, Inc.

Apply now Apply later

Company Summary

Lila Sciences is a privately held, early-stage technology company pioneering the application of artificial intelligence to transform every aspect of the scientific method. Lila is backed by Flagship Pioneering, which brings the courage, long-term vision, and resources needed to realize unreasonable results. Join our mission-driven team and contribute to the future of science.

Our Life Sciences effort is leveraging AI and high-throughput automation for valuable therapeutic discovery and development across biological modalities.

At Lila, we are uniquely cross-functional and collaborative. We are actively reimagining the way teams work together and communicate. Therefore, we seek individuals with an inclusive mindset and a diversity of thought. Our teams thrive in unstructured and creative environments. All voices are heard because we know that experience comes in many forms, skills are transferable, and passion goes a long way.

If this sounds like an environment you’d love to work in, even if you only have some of the experience listed below, please apply.

The Role:

We are seeking a  Senior Data Scientist to join our Applied AI group and lead data-driven initiatives that enhance our Large Language Model (LLM) capabilities and advance our mission toward Scientific Superintelligence. In this role, your primary focus will be collecting, monitoring, and analyzing chat logs to uncover actionable insights for continuous LLM refinement. You will collaborate closely with software engineers, ML researchers, and domain scientists to design analytical workflows, evaluate model performance in real-world settings, and help instill best practices for data-centric decision-making in AI. 

 Key Responsibilities: 

  • Data Collection & Analysis: Gather and preprocess large volumes of internal chat logs, applying statistical methods and NLP techniques to uncover trends, patterns, and areas for LLM improvements. 
  • LLM Evaluation & Optimization: Design and implement experiments to assess model performance, guiding model tuning and feature enhancements based on empirical evidence. 
  • Cross-Functional Collaboration: Work alongside Data Engineers and ML researchers to build robust data pipelines; translate insights into data-driven recommendations for stakeholders across the organization. 
  • Data Visualization & Reporting: Develop dashboards and visualizations that effectively communicate complex findings to both technical and non-technical audiences, facilitating informed decision-making. 
  • Statistical Modeling & ML: Apply machine learning techniques to generate predictive insights, explore generative AI methods, and validate data-driven hypotheses. 
  • Continuous Improvement: Champion best practices in reproducible research, version control, and documentation to ensure reliability and scalability of data workflows. 

Qualifications: 

  • Educational Background: Ph.D. or Master’s degree in a scientific field of study. 
  • Professional Experience: 3+ years of industry experience in data science, analytics, or ML model development—ideally in a production environment. 
  • Technical Proficiency
  • Python & OOP: Strong Python skills with a solid grasp of object-oriented programming principles. 
  • ML & Statistical Methods: Hands-on experience in machine learning, data analysis, and statistical modeling. 
  • NLP: Familiarity with natural language processing techniques, especially for text data analytics and model evaluation. 
  • Data Analysis & Visualization: Proven ability to transform raw data into actionable insights using modern data analysis libraries (e.g., Pandas, Plotly, or similar). 
  • Communication & Collaboration: Exceptional communication skills with the ability to distill complex technical concepts for stakeholders across disciplines. 

Preferred Qualifications: 

  • Experience with ML & Generative AI: Prior work on data pipelines specifically supporting ML or generative AI models; familiarity with the MLOps lifecycle. 
  • Retrieval-Augmented Generation (RAG): Hands-on experience with vector database and RAG techniques for AI systems. 
  • Agentic AI Systems: Exposure to or experience building agent-driven platforms where AI systems autonomously execute complex tasks. 
  • Kubernetes Proficiency: Comfort with container orchestration and scaling using Kubernetes. 
  • Startup Environment: Experience adapting quickly and delivering results in a fast-paced, evolving environment. 
  • Domain Background: Exposure to life sciences, material sciences, or related fields. 

More About Flagship Pioneering

Flagship Pioneering is a biotechnology company that invents and builds platform companies, each with the potential for multiple products that transform human health or sustainability. Since its launch in 2000, Flagship has originated and fostered more than 100 scientific ventures, resulting in more than $90 billion in aggregate value. Many of the companies Flagship has founded have addressed humanity’s most urgent challenges: vaccinating billions of people against COVID-19, curing intractable diseases, improving human health, preempting illness, and feeding the world by improving the resiliency and sustainability of agriculture.  Flagship has been recognized twice on FORTUNE’s “Change the World” list, an annual ranking of companies that have made a positive social and environmental impact through activities that are part of their core business strategies, and has been twice named to Fast Company’s annual list of the World’s Most Innovative Companies. Learn more about Flagship at www.flagshippioneering.com.

Flagship Pioneering and our ecosystem companies are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.

At Flagship, we recognize there is no perfect candidate. If you have some of the experience listed above but not all, please apply anyway. Experience comes in many forms, skills are transferable, and passion goes a long way. We are dedicated to building diverse and inclusive teams and look forward to learning more about your unique background.

Recruitment & Staffing Agencies: Flagship Pioneering and its affiliated Flagship Lab companies (collectively, “FSP”) do not accept unsolicited resumes from any source other than candidates. The submission of unsolicited resumes by recruitment or staffing agencies to FSP or its employees is strictly prohibited unless contacted directly by Flagship Pioneering’s internal Talent Acquisition team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of FSP, and FSP will not owe any referral or other fees with respect thereto.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: Data analysis Data Analytics Data pipelines Data visualization Generative AI Kubernetes LLMs Machine Learning ML models MLOps NLP OOP Pandas Pipelines Plotly Python RAG Research Statistical modeling Statistics

Perks/benefits: Career development Startup environment Team events

Region: North America
Country: United States

More jobs like this