Senior Data Scientist, Operations (GenAI)
United Kingdom
Argus Media
Commodity prices and indexes, forecasts, news, and expert analysis for oil, natural gas, fuels, bioenergy, agriculture, and fertilizer sectors around the world.
Senior Data Scientist, Operations (GenAI)
Holborn, London, UK.
Argus is where smart people belong and where they can grow. We answer the challenge of illuminating markets and shaping new futures.
What we’re looking for
Join our Generative AI team to work on groundbreaking projects that shape the future of AI and data science. Your contributions will directly impact the development of innovative solutions used by global industry leaders. You'll play a pivotal role in transforming how our data are seamlessly integrated with AI systems, paving the way for the next generation of customer interactions.
We are seeking an experienced Senior Data Scientist to join our Generative AI team. This role will focus on creating and maintaining AI-ready data, leveraging the deep technical knowledge already established within the London team. You will support text and numerical data extraction, curation, and metadata enhancements, accelerating development. You will also help ensure rapid response times, minimizing potential disruptions.
What will you be doing
- AI-Ready Data Development: Design, develop, and maintain high-quality AI-ready datasets, ensuring data integrity, usability, and scalability to support advanced Generative AI models.
- Advanced Data Processing: Lead hands-on efforts in complex data extraction, cleansing, and curation for diverse text and numerical datasets. Implement sophisticated metadata enrichment strategies to enhance data utility and accessibility for AI systems.
- Algorithm Implementation & Optimization: Implement and optimize state-of-the-art algorithms and pipelines for efficient data processing, feature engineering, and data transformation tailored for LLM and GenAI applications.
- GenAI Application Development: Apply and integrate frameworks like LangChain and Hugging Face Transformers to build modular, scalable, and robust Generative AI data pipelines and applications.
- Prompt Engineering Application: Apply advanced prompt engineering techniques to optimize LLM performance for specific data extraction, summarization, and generation tasks, working closely with the Lead's guidance.
- LLM Evaluation Support: Contribute to the systematic evaluation of Large Language Models (LLMs) outputs, analysing quality, relevance, and accuracy, and supporting the implementation of LLM-as-a-judge frameworks.
- Retrieval-Augmented Generation (RAG) Contribution: Actively contribute to the implementation and optimization of RAG systems, including working with embedding models, vector databases, and, where applicable, knowledge graphs, to enhance data retrieval for GenAI.
- Technical Mentorship: Act as a technical mentor and subject matter expert for junior data scientists, providing guidance on best practices in coding, data handling, and GenAI methodologies.
- Cross-Functional Collaboration: Collaborate effectively with global data science teams, engineering, and product stakeholders to integrate data solutions and ensure alignment with broader company objectives.
- Operational Excellence: Troubleshoot and resolve data-related issues promptly to minimize potential disruptions, ensuring high operational efficiency and responsiveness.
- Documentation & Code Quality: Produce clean, well-documented, production-grade code, adhering to best practices for version control and software engineering.
Skills and Experience
- Academic Background: Advanced degree in AI, statistics, mathematics, computer science, or a related field.
- Programming and Frameworks: Deep experience with Python, TensorFlow or PyTorch, and NLP libraries such as spaCy and Hugging Face.
- GenAI Tools: Practical experience with LangChain, Hugging Face Transformers, and embedding models for building GenAI applications.
- Prompt Engineering: Deep expertise in prompt engineering, including prompt tuning, chaining, and optimization techniques.
- LLM Evaluation: Experience evaluating LLM outputs, including using LLM-as-a-judge methodologies to assess quality and alignment.
- RAG and Knowledge Graphs: Practical understanding and experience using vector databases. In addition, familiarity with graph-based RAG architectures and the use of knowledge graphs to enhance retrieval and reasoning would be a strong plus.
- Cloud: Practical experience with Gemini/OpenAI models and comfortable with cloud platforms such as AWS, Google Cloud, or Azure. Proficient with Docker for containerization.
- Data Engineering: Strong understanding of data extraction, curation, metadata enrichment, and AI-ready dataset creation.
- Collaboration and Communication: Excellent communication skills and a collaborative mindset, with experience working across global teams.
What’s in it for you
Our rapidly growing, award-winning business offers a dynamic environment for talented, entrepreneurial professionals to achieve results and grow their careers. Argus recognizes and rewards successful performance and as an Investor in People, we promote professional development and retain a high-performing team committed to building our success.
- Competitive salary and company bonus scheme
- Group pension scheme
- Group healthcare and life assurance scheme
- Flexible working environment
- 25 days holiday with annual increase up to 30 days
- Subsidised gym membership
- Season ticket travel loans
- Cycle to work scheme
- Extensive internal and external training
About Argus
Argus is the leading independent provider of market intelligence to the global energy and commodity markets. We offer essential price assessments, news, analytics, consulting services, data science tools and industry conferences to illuminate complex and opaque commodity markets. Headquartered in London with over 1,500 staff, Argus is an independent media organisation with 30 offices in the world’s principal commodity trading hubs.
Companies, trading firms and governments in 160 countries around the world trust Argus data to make decisions, analyse situations, manage risk, facilitate trading and for long-term planning. Argus prices are used as trusted benchmarks around the world for pricing transportation, commodities and energy. Founded in 1970, Argus remains a privately held UK-registered company owned by employee shareholders and global growth equity firm General Atlantic.
Argus is committed to ensuring career and personal growth for all its staff and provides extensive training and career development opportunities, as well as participation in employee-led initiatives, including a women’s network. Our core values are Excellence, Integrity, Partnership and Inclusivity.
For more details about the company and to apply please make sure you upload your CV via our website: www.argusmedia.com/en/careers/open-positions
By submitting your job application, you automatically acknowledge and consent to the collection, use and/or disclosure of your personal data to the Company. Argus is an equal opportunity employer. We welcome and encourage diversity in the workplace regardless of race, gender, sexual orientation, gender identity, disability or veteran status.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture AWS Azure Computer Science Consulting Data pipelines Docker Engineering Feature engineering GCP Gemini Generative AI Google Cloud LangChain LLMs Mathematics NLP OpenAI Pipelines Prompt engineering Python PyTorch RAG spaCy Statistics TensorFlow Transformers
Perks/benefits: Career development Competitive pay Conferences Equity / stock options Fitness / gym Flex hours Salary bonus Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.