Senior Data Engineer
London, England, United Kingdom
Applications have closed
Simple Machines
Data Engineered to Life. Engineering and software development for data.Simple Machines. Data Engineered to Life™
Simple Machines is a leading independent boutique technology firm with a global presence, including teams in London, Sydney, San Francisco, and New Zealand. We specialise in creating technology solutions at the intersection of data, AI, machine learning, data engineering, and software engineering. Our mission is to help enterprises, technology companies, and governments better connect with and understand their organisations, their people, their customers, and citizens. We are a team of creative engineers and technologists dedicated to unleashing the potential of data in new and impactful ways. We design and build bespoke data platforms and unique software products, create and deploy intelligent systems, and bring engineering expertise to life by transforming data into actionable insights and tangible outcomes. We engineer data to life™.
The Role:
The Senior Data Engineer at Simple Machines is a dynamic, hands-on role focused on building real-time data pipelines and implementing data mesh architectures to enhance client data interactions. This position blends deep technical expertise in modern data engineering methods with a client-facing consulting approach, enabling clients to effectively manage and utilize their data. Within a team of top-tier engineers, the role involves developing greenfield data solutions that deliver tangible business outcomes across various environments.
Technical Responsibilities:
- Developing Data Solutions: Implement and enhance data-driven solutions integrating with clients' systems using state-of-the-art tools such as Databricks, Snowflake, Google Cloud, and AWS. Embrace modern data architecture philosophies including data products, data contracts, and data mesh to ensure a decentralized and consumer-oriented approach to data management.
- Data Pipeline Development: Develop and optimise high-performance, batch and real-time data pipelines employing advanced streaming technologies like Kafka, and Flink. Utilise workflow orchestration tools such as Dataflow and Airflow.
- Database and Storage Optimisation: Optimise and manage a broad array of database technologies, from traditional relational databases (e.g., PostgreSQL, MySQL) to modern NoSQL solutions (e.g., MongoDB, Cassandra). Focus on strategies that enhance data accessibility, integrity, and performance.
- Big Data Processing & Analytics: Utilise big data frameworks such as Apache Spark and Apache Flink to address challenges associated with large-scale data processing and analysis. These technologies are crucial for managing vast datasets and performing complex data transformations and aggregations.
- Cloud Data Management: Implement and oversee cloud-specific data services including AWS Redshift, S3, Google BigQuery, and Google Cloud Storage. Leverage cloud architectures to improve data sharing and interoperability across different business units.
- Security and Compliance: Ensure all data practices comply with security policies and regulations, embedding security by design in the data infrastructure. Incorporate tools and methodologies recommended for data security and compliance, ensuring robust protection and governance of data assets.
Consulting Responsibilities:
- Client Advisory: Provide expert advice to clients on optimal data practices that align with their business requirements and project goals.
- Training and Empowerment: Educate client teams on the latest technologies and data management strategies, enabling them to efficiently utilize and maintain the solutions we have developed.
- Professional Development: Keep up with the latest industry trends and technological advancements, continually upgrading skills and achieving certifications in the technologies Simple Machines implements across its client base.
Requirements
Ideal Skills and Experience:
- Core Data Engineering Tools & Technologies: Demonstrates proficiency in SQL and Spark, and familiarity with platforms such as Databricks and Snowflake. Well-versed in various storage technologies including AWS S3, Google Cloud BigQuery, Cassandra, MongoDB, Neo4J, and HDFS. Adept in pipeline orchestration tools like AWS Glue, Apache Airflow, and dbt, as well as streaming technologies like Kafka, AWS Kinesis, Google Cloud Pub/Sub, and Azure Event Hubs.
- Data Storage Expertise: Knowledgeable in data warehousing technologies like BigQuery, Snowflake, and Databricks, proficient in managing various data storage formats including Parquet, Delta, ORC, Avro, and JSON to optimize data storage and retrieval.
- Building and Managing Large-scale Data Systems: Experienced in developing and overseeing large-scale data pipelines and data-intensive applications within production environments.
- Data Modelling Expertise: Proficient in data modelling, understanding the implications and trade-offs of various methodologies and approaches.
- Infrastructure Configuration for Data Systems: Competent in setting up data system infrastructures, favouring infrastructure-as-code practices using tools such as Terraform and Pulumi.
- Programming Languages: Proficient in Python and SQL, with additional experience in programming languages like Java, Scala, GoLang, and Rust considered advantageous.
- CI/CD Implementation: Knowledgeable about continuous integration and continuous deployment practices using tools like GitHub Actions and ArgoCD, enhancing software development and quality assurance.
- Testing Tools and Frameworks: Experienced with data quality and testing frameworks such as DBT, Great Expectations, and Soda, ensuring the reliability of complex data systems.
- Commercial Application of Data Engineering Expertise: Demonstrated experience in applying data engineering skills across various industries and organizations in a commercial context.
- Agile Delivery and Project Management: Skilled in agile, scrum, and kanban project delivery methods, ensuring efficient and effective solution development.
- Consulting and Advisory Skills: Experienced in a consultancy or professional services setting, offering expert advice and crafting customized solutions that address client needs. Effective in engaging stakeholders and translating business requirements into practical data engineering strategies.
Professional Experience and Qualifications:
- Professional Experience: At least 5+ years of data engineering or equivalent experience in a commercial, enterprise, or start-up environment. Consulting experience within a technology consultancy or professional services firm is highly beneficial.
- Educational Background: Degree or equivalent experience in computer science or a related field.
Benefits
What We Offer in the UK:
- Salary: Competitive salary and benefits package.
- Pension: Up to 5% employer contribution, matching up to a 5% employee contribution, for a total of up to 10%.
- Annual Leave: 4 weeks standard + 1 week additional annual leave over Christmas shut down period, plus public holidays.
- Your Day - No Questions Asked: One additional day off per year, no explanation required!
- Regular Lunches: Provided at team meet-ups and on workdays at Simple Machines' co-working space.
- Health and Wellbeing Allowance: £1,250 allowance per year to be used for any food and non-alcoholic beverages during business hours, healthcare, gym memberships, sporting goods and accessories, and any wellness appointments.
- Professional Development: £1,500 annual budget for training, courses, and conferences, with potential for additional funding.
- Certifications: £2,500 annual budget for certifications and related courses.
- Equipment Allowance: £1,500 for UK team members, plus Apple MacBook Pro laptops and necessary accessories.
- Company Sick Leave: 10 days per annum, includes coverage for employee’s family.
- Antenatal Support: Paid time off for antenatal appointments, including classes recommended by health professionals.
- Terminal Illness Benefit: Three months' continuance of salary at full pay.
Join Us:
Simple Machines is a diverse and globally distributed team of individual talents. Everyone in the firm is among the best at what they do. That’s why they’re here. We have a collective obsession with the future and a passion to create real change through technology. If you’re someone who’s as passionate as we are about building a world-class technology company specialising in engineering for data, you’ll fit right in.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow Architecture Avro AWS AWS Glue Azure Big Data BigQuery Cassandra CI/CD Computer Science Consulting Databricks Dataflow Data management Data pipelines Data quality Data Warehousing dbt Engineering Flink GCP GitHub Golang Google Cloud HDFS Java JSON Kafka Kanban Kinesis Machine Learning MongoDB MySQL Neo4j NoSQL Parquet Pipelines PostgreSQL Python RDBMS Redshift Rust Scala Scrum Security Snowflake Spark SQL Streaming Terraform Testing
Perks/benefits: Career development Competitive pay Conferences Gear Health care Startup environment Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.