Data Engineer - AI Startup (Hybrid, Stock Options)
Barcelona, Catalonia, Spain
UNITH
Engage your audiences like never before by combining the power of human-like Digital Avatars and Conversational AI.At UNITH, we're at the forefront of transforming customer journeys with conversational AI. Listed on the ASX, we lead the way in creating lifelike digital humans using synthetic facial movement, voice engineering, and conversational design. With Digital Humans available in over 60 languages and featuring more than 600 voices, we're revolutionizing the conversational AI industry.
Why Join UNITH?
We are seeking a skilled Data Engineer to join our growing Platform team. In this role, you will build and maintain data pipelines that capture, process, and store conversation logs from our digital human platform. You'll work closely with our AI/ML, Backend, and Analytics teams to ensure data flows efficiently throughout our systems and supports both internal and customer-facing analytics. Your initial focus will be on developing API endpoints to provide customers with access to their conversation logs, collaborating with our NLP Engineer to extract valuable insights from these conversations, implementing usage-based pricing metrics with Stripe integration, and building data pipelines for evaluating and tracing digital human conversations.
Key Responsibilities:
- Design, implement, and maintain ETL processes using AWS Glue, Kinesis, and various database technologies to transform raw conversation data into structured formats
- Develop and optimize data pipelines for capturing and processing conversation logs and session data
- Create secure API endpoints that allow customers to access and analyze their conversation logs
- Maintain and improve data pipelines to provide metrics for usage-based pricing, including Stripe integration
- Build data pipelines to support evaluation and traceability of digital human conversations
- Collaborate with our NLP Engineer to extract actionable insights from conversation data and develop methods to share these insights with customers
- Implement data security and privacy measures to ensure compliance with GDPR and ISO 27001 standards
- Collaborate with the Analytics Engineer to create data structures that enable actionable insights
- Ensure data quality, integrity, and security throughout the data lifecycle
- Implement efficient data storage solutions balancing performance, cost, and compliance requirements
- Support the integration of data systems with our broader AWS infrastructure
- Build scalable data architecture that can handle growing volumes of conversation data
- Document data flows, schemas, and processes for team knowledge sharing
- Troubleshoot and resolve data pipeline issues in production environments
Requirements:
- Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience
- 3+ years of experience in data engineering roles
- Strong experience with AWS data services, particularly Glue, Kinesis, and S3
- Experience with database technologies such as Redshift, Aurora, Postgres, or OpenSearch
- Proficiency in Python and SQL
- Experience with ETL design, implementation, and maintenance
- Knowledge of data modeling and database design principles
- Experience building and maintaining secure API endpoints
- Familiarity with streaming data technologies
- Strong understanding of data privacy regulations (GDPR) and security best practices
- Knowledge of security frameworks like ISO 27001
- Experience with version control systems (Git)
Preferred Qualifications:
- Experience working with conversation or NLP data
- Knowledge of AI/ML data pipelines and requirements
- Experience extracting insights from conversational data
- Experience integrating with payment processing systems like Stripe for usage-based billing
- Experience building systems for AI model evaluation and conversation traceability
- Familiarity with Apache Parquet file format
- Experience with AWS Athena and Lambda
- Understanding of BI tools and dashboarding systems (particularly Metabase)
- Experience with API design and RESTful services
- Experience implementing data anonymization and pseudonymization techniques
- Experience with infrastructure as code tools
- Knowledge of containerization technologies (Docker, Kubernetes)
Benefits:
- Competitive salary package and benefits.
- Lead the way in a fast-growing startup, driving the conversational AI revolution.
- Work with cutting-edge technology, shaping the future of digital experiences.
- Collaborate with a dynamic and talented team in a supportive and inclusive work environment.
- Opportunities for career growth as we expand globally.
- Additional perks include your choice of machine, hybrid working options, a fantastic office with a rooftop terrace in Barcelona, lunch compensation, private health insurance with Alan, and a generous ClassPass discount.
To apply, please submit your resume and a cover letter detailing your relevant experience and your enthusiasm for joining UNITH. For a standout application, you can also reach out directly to anniek@unith.ai.
At UNITH, we believe diversity drives innovation. Our mission to transform customer experiences relies on a team with a wide range of perspectives, experiences, and backgrounds. We are committed to creating an inclusive workplace where everyone feels valued and empowered to thrive.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Architecture Athena AWS AWS Glue Computer Science Conversational AI Data pipelines Data quality Docker Engineering ETL Git ISO 27001 Kinesis Kubernetes Lambda Machine Learning Metabase NLP OpenSearch Parquet Pipelines PostgreSQL Privacy Python Redshift Security SQL Streaming
Perks/benefits: Competitive pay Equity / stock options Health care Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.