Data Engineer (fixed-term contract)

Hanoi, Hanoi, VN

ActiveFence

ActiveFence empowers Trust & Safety and online security professionals in their quest to keep platform users and the public safe from harm.

View all jobs at ActiveFence

Apply now Apply later

Description

Job Grade: Short-term contract with the potential for full-time conversion upon evaluation

About the Role:

We are looking for a highly skilled and proactive Data Engineer to join our team for maternity leave cover, with the possibility of transitioning into a full-time role. This position is ideal for an analytical problem-solver who thrives in dynamic environments and can seamlessly integrate technical expertise with business acumen to deliver impactful solutions.

Responsibilities:

  • Collaborate with cross-functional teams to translate business needs into actionable data strategies and scalable solutions.
  • Design and implement end-to-end data workflows to address complex challenges with a focus on maintainability and scalability.
  • Extract and enrich information from diverse data sources, including structured, unstructured, and image-based content, to support violation labeling and classification.
  • Develop and maintain a threat taxonomy that includes categories, keyword patterns, and severity scoring to enable accurate downstream tagging.
  • Apply best practices in data storage and architecture, including Spark-based ingestion into Delta Lake with appropriate schema design, partitioning, validation, error handling, deduplication, and monitoring.
  • Deliver clean, robust, well-documented, and testable Python code for:
  • Process automation
  • Web scraping and API-driven data collection (including text, image, audio, and video etc.)
  • Scalable pipelines for data processing, enrichment, and storage using the Spark ecosystem


Requirements

  • Proven experience solving complex problems and delivering end-to-end solutions in data-driven environments.
  • Hands-on experience in Python & SQL programming.
  • Solid experience with PySpark, Delta Lake, and scalable schema design for big data workflows.
  • Familiarity with structured data formats (JSON, CSV) and experience in data validation and transformation pipelines.
  • Ability to write clean, well-documented, and maintainable code, with clear setup and execution instructions 
  • Strong attention to detail, with the ability to work independently and deliver under tight deadlines.
  • Advantage: experience working with threat intelligence, cybersecurity data, or other security-related datasets.

If you are a forward-thinking data professional who can quickly adapt, innovate, and deliver, we invite you to join our team for this exciting opportunity. Apply today to make an impact!

About ActiveFence

ActiveFence is the leading tool stack for Trust & Safety teams, worldwide. By relying on ActiveFence’s end-to-end solution, Trust & Safety teams – of all sizes – can keep users safe from the widest spectrum of online harms, unwanted content, and malicious behavior, including child safety, disinformation, fraud, hate speech, terror, nudity, and more.

Using cutting-edge AI and a team of world-class subject-matter experts to continuously collect, analyze, and contextualize data, ActiveFence ensures that in an ever-changing world, customers are always two steps ahead of bad actors. As a result, Trust & Safety teams can be proactive and provide maximum protection to users across a multitude of abuse areas, in 70+ languages.

Backed by leading Silicon Valley investors such as CRV and Norwest, ActiveFence has raised $100M to date; employs 300 people worldwide, and has contributed to the online safety of billions of users across the globe.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0
Category: Engineering Jobs

Tags: APIs Architecture Big Data Classification CSV JSON Pipelines PySpark Python Security Spark SQL

Region: Asia/Pacific
Country: Vietnam

More jobs like this