Lead Data Engineer
USA (Remote)
Full Time Senior-level / Expert USD 160K - 240K
Mysten Labs
Mysten Labs is developing crucial infrastructure to power a more decentralized internet, transforming how people connect and share information online.Mysten Labs believes that decentralized and open protocols are the bedrock of the internet of value. This is why at Mysten Labs, we are creating foundational infrastructure to accelerate the adoption of decentralized protocols based on blockchain technologies.
The Data team is looking to hire a Staff Data Engineer to design and execute on the next iteration of our data systems. This is an exciting opportunity for someone who wants to design web3 analytics systems that run at web2 scale, reliability, and robustness. Interested candidates will have the opportunity to help define the new standards for blockchain analytics on top of the world’s premier object-centric blockchain.
This role requires a combination of high capacity hands-on individual contributor work and planning data architecture. As a Staff Data Engineer you will partner with the heads of Data and Engineering to effectively and smoothly process high-volume data in a reliable and robust way. You will get to touch all parts of the business at Mysten Labs and become the expert on data structures on the Sui blockchain.
Responsibilities:
Design and implement scalable ingestion pipelines with a reusable and modular framework
Build robust, reusable frameworks to ingest data from internal sources (e.g., Prod DBs, cloud buckets, etc) and external APIs or files (e.g., CSVs, webhooks).
Ensure idempotency, backfill support, and error handling in pipeline design.
Optimize and own data warehousing, with clear table definitions, schemas, cost efficiency
Architect a centralized data lake/warehouse with clear schemas and partitioning strategies.
Support both batch and streaming workloads, and optimize for cost and performance.
Enable data discoverability, usability, and governance
Implement or integrate data cataloging and lineage tools
Define naming conventions, documentation standards, and ownership metadata to make data self-serve and intuitive for data scientists, analysts, and product / GTM teams.
Set up a mechanism for scalable access controls (with RBAC or ABAC), PII tagging, and data obfuscation.
Enable approaches for data quality checks, validation pipelines, and alerting for broken or stale data
Develop a strong understanding of how to use on-chain and off-chain data together
Required Qualifications:
5+ years experience in data engineering
Strong SQL and Python
Strong and informed opinions on data orchestrators, catalogs, governance, and testing frameworks
Experience combining in-house and external data
Preferred Qualifications:
Experience with, or interest in learning, Rust
Prior blockchain and cryptocurrency experience
Experience designing and implementing streaming data solutions
Experience or interest in security analytics and data for security teams
Employment is contingent upon the successful completion of a background check, which may include verification of employment history, education credentials, criminal history, and other relevant information.
Regarding the recent rash of technology job scams: Be aware that emails from genuine Mysten Labs group recruiters will always come from the @mystenlabs.com domain or related subdomains (e.g., mystenlabs.com/careers). Remember: you can always verify positions on our job boards at www.mystenlabs.com/careers.
Our team is remote first and we are hiring across the world. Here at Mysten Labs, you’ll be joining a world-class team with tremendous growth potential as we bring the next billion users to web3. We raised a $300M Series B round from top Silicon Valley led venture funds like Jump Crypto, Andreessen Horowitz (a16z), Binance Labs, Redpoint, Lightspeed, Coinbase Ventures, Electric Capital, Standard Crypto, NFX, Slow Ventures, Scribble Ventures, Samsung Next, Lux Capital, among other investment firms and strategic partners. Come join us and build the future of web3!
Tags: APIs Architecture Blockchain Crypto Data quality Data Warehousing Engineering Pipelines Python Rust Security SQL Streaming Testing
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.