Principal Data Scientist

Boston, Massachusetts, United States

Full Time Senior-level / Expert USD 203K - 377K * ^est.

One Door

View all jobs at One Door

Apply now Apply later

Posted 6 hours ago

We are looking for a skilled and proactive Data Scientist/GenAI LLM Engineer to play a pivotal role in driving innovation within our organization and transforming retail visual merchandising. This role will play a key part in driving high-impact projects for leading organizations across specialty retail, logistics, supply chain, and beyond.

At the cutting edge of AI/ML technologies, we empower our clients to harness the value of unstructured data and uncover hidden insights within their enterprise information. As an LLM engineer, you’ll bring your deep expertise in LLM/GenAI technologies to the table. In collaboration with our R&D team, product managers, and engineering leads, you'll prototype, build, test, and scale innovative products powered by GenAI/LLM technologies. You'll also be instrumental in fine-tuning model hyperparameters, optimizing configurations, and ensuring the highest level of model performance to drive impactful outcomes for our clients.

RESPONSIBILITIES

Develop LLM solutions on customer data, such as RAG architectures on enterprise knowledge repos, querying structured data with natural language, agents, and content generation.
Develop end-to-end AI/ML solutions using Python, LLM/GenAI frameworks and tools.
Develop CI/CD pipelines, containerize LLM models, and deploy them on cloud or on-premise. Ensure support and maintenance for all LLM/ML model lifecycle stages, including developing training datasets, fine-tuning, testing, deployment pipelines, and ongoing deviation monitoring.
Design prototypes and POCs to showcase feasibility and value; provide architectural solutions.
Research, design, build, and train innovative LLM applications to address complex real-world problems.
Offer technical guidance to clients implementing LLM technologies.

QUALIFICATIONS

Bachelor’s Degree (final-year students may apply) in Statistics, Applied Mathematics, Computer Science, or a related field
3+ years of hands-on experience with Python; 2+ years of experience with command line scripting; 1+ years of experience building and maintaining scalable API solutions
2+ years of professional experience with NLP; 1+ years of professional experience with Large Language Models (LLM)/GenAI technology (e.g., OpenAI API, GPT-4, Gemini, Llama, Claude, Amazon Bedrock, Langchain, HuggingFace Transformers, PyTorch); 1+ years of experience with prompt engineering and vector databases
2+ years of experience with AWS, GCP, or Microsoft Azure; 2+ years of experience with MLOps and CI/CD pipeline development, containerization, and model deployment in test and production environments
Team player who can communicate complex LLM capabilities and limitations to non-technical stakeholders.

PREFERRED

Master’s or Ph.D. in a relevant field
7+ years of product engineering and/or data science experience
Experience with Ruby on Rails, JavaScript, or Flutter; 2+ years of experience with Snowflake or Databricks
Deep knowledge of a Retail domain or industry, with a focus on NLP/LLM
In-depth understanding of Responsible AI standards and protocols
Applied research background using frameworks to build LLM prototypes; knowledge of best practices for production LLM development

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 1 0 0

Category: Data Science Jobs

Tags: APIs Architecture AWS Azure CI/CD Claude Computer Science Databricks Engineering GCP Gemini Generative AI GPT GPT-4 HuggingFace JavaScript LangChain LLaMA LLMs Machine Learning Mathematics MLOps Model deployment NLP OpenAI Pipelines Prompt engineering Python PyTorch R RAG R&D Research Responsible AI Ruby Snowflake Statistics Testing Transformers Unstructured data