Lead Data Scientist

Delhi

⚠️ We'll shut down after Aug 1st - try foo🦍 for all jobs in tech ⚠️

Full Time Senior-level / Expert USD 38K - 72K * ^est.

HighLevel

HighLevel is the all-in-one sales & marketing platform that agencies can white-label and resell to their clients!

Posted 7 hours ago

About Us:HighLevel is an AI powered, all-in-one white-label sales & marketing platform that empowers agencies, entrepreneurs, and businesses to elevate their digital presence and drive growth. We are proud to support a global and growing community of over 2 million businesses, comprised of agencies, consultants, and businesses of all sizes and industries. HighLevel empowers users with all the tools needed to capture, nurture, and close new leads into repeat customers. As of mid 2025, HighLevel processes over 15 billion API hits and handles more than 2.5 billion message events every day. Our platform manages over 470 terabytes of data distributed across five databases, operates with a network of over 250 microservices, and supports over 1 million domain names.
Our PeopleWith over 1,500 team members across 15+ countries, we operate in a global, remote-first environment. We are building more than software; we are building a global community rooted in creativity, collaboration, and impact. We take pride in cultivating a culture where innovation thrives, ideas are celebrated, and people come first, no matter where they call home.
Our ImpactAs of mid 2025, our platform powers over 1.5 billion messages, helps generate over 200 million leads, and facilitates over 20 million conversations for the more than 2 million businesses we serve each month. Behind those numbers are real people growing their companies, connecting with customers, and making their mark - and we get to help make that happen.
About the Role:As a Senior Data Scientist, you will design and deploy AI-driven systems that support key business functions like Sales, Customer Success, and Product. You’ll own the end-to-end lifecycle from experimentation to production, applying techniques like predictive modeling, real-time scoring, and AI agent orchestration. Working cross-functionally, you’ll translate data into automation and decision-making tools that drive measurable business outcomes.

Requirements:

8+ years in data science, ML, or applied AI roles, ideally within SaaS (B2B or PLG preferred)
Expert in SQL, Python, and modeling frameworks (e.g. scikit-learn, XGBoost, LightGBM)
Proven experience building and deploying predictive models in production (churn, conversion, LTV, usage drop-off)
Experience in fine-tuning models either with FFT or LORA (Or variants of)
Strong hands-on experience with OpenAI models, LangChain, and agent orchestration tools
Demonstrated prompt engineering capability: designing and refining system and task-specific prompts
Experience implementing retrieval-augmented generation (RAG) using embeddings and vector DBs (Pinecone, FAISS, etc.)
Experience testing, training, and deploying models/agents via Cloudflare Workers or equivalent serverless environments
Familiarity with streaming usage data pipelines and real-time behavioral scoring
Strong storytelling skills: you can articulate technical work to non-technical stakeholders clearly and persuasively

Responsibilities:

Develop and fine-tune machine learning models using advanced algorithms like gradient boosting (XGBoost, LightGBM) and lightweight neural networks to better grade customer churn, account health decline, upsell opportunities, and trial conversion rates
Pull data from feature sets across CRM, product usage, support, and NPS. Cleanse and transform data to form a holistic view of account health.
Build production-grade models to predict churn, account health decline, usage slowness, upsell opportunity, and trial conversion
Create real-time scoring mechanisms to alert GTM teams about at-risk customers and under-engaged segments
Use OpenAI models, LangChain (or equivalent) or open source models to build intelligent assistants, auto-analysis agents, and retrieval-based matchers
Design prompts and agent flows to answer RevOps questions, generate insight summaries, and automate interventions
Implement retrieval-augmented generation (RAG) architectures using vector databases (e.g., Pinecone, FAISS)

EEO Statement:At HighLevel, we value diversity. In fact, we understand it makes our organisation stronger. We are committed to inclusive hiring/promotion practices that evaluate skill sets, abilities, and qualifications without regard to any characteristic unrelated to performing the job at the highest level. Our objective is to foster an environment where really talented employees from all walks of life can be their true and whole selves, cherished and welcomed for their differences while providing excellent service to our clients and learning from one another along the way! Reasonable accommodations may be made to enable individuals with disabilities to perform essential functions.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 0 0 0

Categories: Data Science Jobs Leadership Jobs

Tags: APIs Architecture Data pipelines Engineering FAISS LangChain LightGBM LoRA Machine Learning Microservices ML models OpenAI Open Source Pinecone Pipelines Predictive modeling Prompt engineering Python RAG Scikit-learn SQL Streaming Testing XGBoost