Data Engineer
Petah Tikva, Center District, IL
ā ļø We'll shut down after Aug 1st - try fooš¦ for all jobs in tech ā ļø
Cellebrite
Cellebriteās Digital Intelligence Suite of Forensic Solutions empowers law enforcement, governments, and enterprises to collect, review, analyze & manage data.Description
Company Overview:
Ā Cellebriteās (Nasdaq: CLBT) mission is to enable its global customers to protect and save lives by enhancing digital investigations and intelligence gathering to accelerate justice in communities around the world.
Cellebriteās AI-powered Digital Investigation Platform enables customers to lawfully access, collect, analyze and share digital evidence in legally sanctioned investigations while preserving data privacy. Thousands of public safety organizations, intelligence agencies and businesses rely on Cellebriteās digital forensic and investigative solutions-available via cloud, on-premises and hybrid deployments-to close cases faster and safeguard communities.āÆTo learn more, visit us at www.cellebrite.com,āÆhttps://investors.cellebrite.com/investors and find us on social media @Cellebrite.Ā
Position Overview:
Weāre looking for a highly skilled and opinionated Senior Data Engineer to join our Data Platform team.
If you thrive in cloud-native, high-scale environments and enjoy full ownership of the data lifecycle - from design to deployment - this role is for you.
Youāll work closely with data scientists, ML engineers, and product teams to build scalable, production-ready data infrastructure leveraging Apache Spark, AWS services, and Infrastructure-as-Code.
Key Responsibilities:
- Influence Data Architecture: Design scalable, secure data platforms using Spark (EMR), Glue, and AWS event-driven services.
- Develop Scalable Pipelines: Build batch & streaming ETL/ELT pipelines with Spark EMR, Athena, Iceberg, EKS, Lambda.
- Drive Innovation: Introduce patterns like data mesh, serverless analytics, and schema-aware pipelines.
- Cross-Team Collaboration: Work with ML, backend, and product teams to deliver data-powered solutions.
- Operational Excellence: Apply observability, cost control, performance tuning, and CI/CD automation using CloudWatch, Step Functions, Terraform/CDK.
- Security: Implement AWS best practices for IAM, encryption, compliance, and auditability.
Experience:
- 8+ years of hands-on experience in data engineering, with proven responsibility for designing, developing, and maintaining large-scale, distributed data systems in cloud-native environments (preferably AWS).
- End-to-end ownership of complex data architectures ā from data ingestion to processing, storage, and delivery in production-grade systems.
- Deep understanding of data modeling, data quality, and pipeline performance optimization.
Technical Expertise:
- Apache Spark: Expertise in writing efficient Spark jobs using PySpark, with experience running workloads on AWS EMR and/or Glue for large-scale ETL and analytical tasks.
- AWS Services: Strong hands-on experience with S3, Lambda, Glue, Step Functions, Kinesis, and Athena ā including building event-driven and serverless data pipelines.
- Solid experience in building and maintaining both batch and real-time (streaming) data pipelines and integrating them into production systems.
- Infrastructure as Code (IaC): Proficient in using Terraform, AWS CDK, or SAM to automate deployment and manage scalable data infrastructure.
- Python as a primary development language (with bonus points for TypeScript experience).
- Comfortable working in agile, fast-paced environments, with strong debugging, testing, and performance-tuning capabilities.
Mindset & Skills:
- Strong system design & code quality focus
- Ownership from architecture to monitoring
- Automation-oriented, loves simplicity
- Excellent communication & mentoring abilities.
Nice to Have:
- AWS certifications (e.g., Solutions Architect, Data Engineer).
- Experience with ML pipelines or AI-enhanced analytics.
- Familiarity with data governance, data mesh, or self-service platforms.
- Domain experience in cybersecurity, law enforcement, or regulated industries.
Why Join Cellebrite?
ā Competitive compensation & full benefits.
ā Work on mission-critical systems that protect lives.
ā High-impact, collaborative, innovation-driven culture.
ā Continuous learning & real career growth opportunities.
Requirements
NonePersonal Characteristics
None* Salary range is an estimate based on our AI, ML, Data Science Salary Index š°
Tags: Agile Architecture Athena AWS CI/CD Data governance Data pipelines Data quality ELT Engineering ETL Kinesis Lambda Machine Learning Pipelines Privacy PySpark Python Security Spark Step Functions Streaming Terraform Testing TypeScript
Perks/benefits: Career development Competitive pay Startup environment
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.