Data Scientist
Hyderabad
Matillion
Matillionās unified ELT platform is the next step in data integration. Use AI to build faster pipelines, enhance data productivity and deliver analyticsā¦We are on a mission to power the data productivity of our customers and the world, by helping teams get data business ready, faster. Our technology allows customers to load, transform, sync and orchestrate their data.Ā
We are looking for passionate, high-integrity individuals to help us scale up our growing business. Together, we can make a dent in the universe bigger than ourselves.
With offices in the UK, US and Spain, we are now thrilled to announce the opening of our new office in Hyderabad, India. This marks an exciting milestone in our global expansion, and we are now looking for talented professionals to join us as part of our founding team.
Matillion is a fast paced hyper-scale software development company. You will be based in India but working with colleagues globally specifically across the US, the UK and in Hyderabad.
The Enterprise data team is responsible for producing Matillion's reporting metrics and KPIs. We work closely with Finance colleagues, the product team and Go To Market to interpret the data that we have and provide actionable insight across the business.
The purpose of this role is to:
Increase the value of strategic information from the data warehouse, Salesforce, Thoughtspot, and DPC HubDevelop models to help us understand customer behaviour specifically onboarding, product usage and churnUse our rich data assetsĀ to streamline operational processes
What will you be doing?
- Run structured experiments to evaluate and improve LLM performance across generative and task-oriented functions.
- Improving our AI evaluation frameworks
- Investigating ways generative AI can be used to improve data quality
- Some more traditional data science predictive models to forecast customer consumption, churn and/or anomaly detection for failing data pipelines
- Keeping current on the latest research and proposing proof of concept projects to explore how it can assist us
- Educating other team members to raise the teamās understating of theoretical concepts and the latest developments
What are we looking for?
- Technical/Role Specific - Core Skills MSc, PhD, or equivalent experience in ML, NLP, or a related field.
- Strong understanding of LLM internals: transformer architecture, tokenization, embeddings, sampling strategies.
- Python fluency, especially for data science and experimentation (NumPy, Pandas, Matplotlib, Jupyter).
- Experience with LLM tools (e.g. Hugging Face, LangChain, OpenAI API).
- Familiarity with prompt engineering and structured evaluation of generative outputs.
- Technical/Role Specific - Preferrable Skills
- Any experience of reinforcement learning techniques, even if on a small scale
- Experience ofĀ model evaluation
- fine tuning, model distillation, instruction tuning or transfer learning
- agentic systems (tool use / agentic frameworks)
- implementing guardrails
- RAG architecture design and vector search
- Ā Understanding of
- Model failure modes, fallback strategies, and error recovery
- LLM performance optimization tradeoffs (latency, cost, accuracy)
- Uncertainty estimation and confidence scoring in generative systems
- Privacy and compliance considerations in AI for SaaS
Personal Capabilities
- Enthusiasm to learnĀ
- Able to coach and mentor those around you to increase their knowledge
- Comfort working across teams
- Ability to translate requirements between data scientists (research focus) and software engineers (product focus)
- Clear communication of challenges, timelines, and possible solutions to stakeholders
- Adaptability to rapid changes in a dynamic tech startup environment
- Enthusiasm for learning new AI/ML Ops tools, libraries, and techniques
- Proactive at diagnosing problems to understand a true root cause
- Willingness to experiment and to look for ways to optimise existing systems
- Willingness to pivot quickly in a rapidly evolving generative AI landscape
Our 6 core values guide how we work together and with our customers and partners. We operate a truly flexible and hybrid working culture that promotes work-life balance, and are proud to be able to offer the following benefits:
- Company Equity - 27 days paid time off- 12 days of Company Holiday- 5 days paid volunteering leave- Group Mediclaim (GMC)- Enhanced parental leave policies- MacBook Pro- Access to various tools to aid your career development
More about MatillionThousands of enterprises including Cisco, DocuSign, Slack, and TUI trust Matillion technology to load, transform, sync, and orchestrate their data for a wide range of use cases from insights and operational analytics, to data science, machine learning, and AI.Ā
With over $300M raised from top Silicon Valley investors, we are on a mission to power the data productivity of our customers and the world.
We are passionate about doing things in a smart, considerate way. Weāre honoured to be named a great place to work for several years running by multiple industry research firms.Ā
We are dual headquartered in Manchester, UK and Denver, Colorado.
We are keen to hear from prospective Matillioners, so even if you donāt feel you match all the criteria please apply and a member of our Talent Acquisition team will be in touch. Alternatively, if you are interested in Matillion but don't see a suitable role, please email talent@matillion.com.
Matillion is an equal opportunity employer. We celebrate diversity and we are committed to creating an inclusive environment for all of our team. Matillion prohibits discrimination and harassment of any type. Matillion does not discriminate on the basis of race, colour, religion, age, sex, national origin, disability status, genetics, sexual orientation, gender identity or expression, or any other characteristic protected by law.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index š°
Tags: APIs Architecture Data pipelines Data quality Data warehouse Engineering Finance Generative AI Jupyter KPIs LangChain LLMs Machine Learning Matillion Matplotlib NLP NumPy OpenAI Pandas PhD Pipelines Privacy Prompt engineering Python RAG Reinforcement Learning Research Salesforce
Perks/benefits: Career development Flex hours Flex vacation Gear Parental leave Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.