Data Platform Lead
London, United Kingdom
Corsearch
Combining AI-fueled technology and decades of expertise, Corsearch is revolutionizing how companies establish and protect their brands.
Do you get excited when hearing about trademarks and brand protection news? YES?! So do we! At Corsearch, there’s no pushing trademark solutions and brand protection from our thoughts. We’re thinking about coined trademarks in the car, a detailed design search over lunch, counterfeits while sitting with the in-laws, and anti-piracy while working out
We are a mission-led company, driven by a passion for making the world better and safer for our brand customers and their consumers. It’s what we do. And people come to Corsearch to be challenged, developed, supported, and valued 👍
✅The Role
Technology at Corsearch currently consists of Engineering and Data Science functions, with nearly 150 people globally, aligned to Trademark Solutions or Online Brand Protection Business Units.
Trademark Solutions acquires data from 200 jurisdictions globally by means of data feeds, web crawling and publications/gazettes. The master data system ingests, enriches and stores data in order to support a complex ecosystem of platforms which underpin Corsearch’s managed search, screening & watch services.
The current master data system (“DMS”) is based on C#, MS SQL and MySQL. Corsearch is looking to recruit a new team to build out & transition to a replacement platform using managed services in the cloud.
The ideal candidate will have had prior experience with traditional database-driven software in .Net and Microsoft SQL and more recent experience with modern cloud Lakehouse systems such as Databricks.
✅Responsibilities and Duties
- Lead the design and implementation of data migration strategies from Microsoft SQL Server to Databricks
- Architect and build scalable data pipelines using Delta Lake and Databricks' Lakehouse platform
- Optimize existing ETL/ELT processes and transform them into modern, cloud-native solutions
- Implement data quality frameworks and testing methodologies
- Collaborate with stakeholders to understand business requirements and translate them into technical specifications
- Establish best practices for data modelling, governance, and security in the new Lakehouse environment
- Mentor team members on Databricks best practices and modern data engineering principles
✅ Essential
- Experience in data engineering, with hands-on experience with Databricks
- Strong expertise in SQL and Python programming
- Proven track record of successful data platform migrations
- Experience with Delta Lake, Apache Spark, and other big data technologies
- Deep understanding of data warehousing concepts and dimensional modelling
- Experience with AWS cloud platforms
- Knowledge of data governance, security, and compliance requirements
✅ Preferred Experience
- Databricks certifications (e.g., Databricks Certified Associate Developer)
- Knowledge of dbt, Airflow, or similar modern data tools
- Familiarity with CI/CD practices for data pipelines
- Experience with Microsoft SQL Server migration projects
- Understanding of data mesh and data fabric architectures
✅ Technical Skills
- Languages: SQL, Python
- Platforms: Databricks, AWS
- Technologies: Apache Spark, Delta Lake
- Tools: Git, JIRA, GitHub/GitLab
- Concepts: Data Modelling, ETL/ELT, Data Quality, Data Governance
Corsearch is an equal opportunity and inclusive employer and does not tolerate discrimination of any kind. We are committed to creating a diverse and inclusive workplace where all employees feel valued, respected, and supported.
We welcome applications from all individuals regardless of race, nationality, religion, gender, gender identity or expression, sexual orientation, age, disability, or any other protected characteristic.
Together, we are working proactively to build a workplace where everyone can belong and be at their best selves. Together, we make an Impact.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Architecture AWS Big Data CI/CD Databricks Data governance Data pipelines Data quality Data Warehousing dbt ELT Engineering ETL Git GitHub GitLab Jira MS SQL MySQL Pipelines Python Security Spark SQL Testing
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.