Staff Data Engineer
Remote (Brazil)
- Remote-first
- Website
- @brightwheel 𝕏
- GitHub
- Search
brightwheel
See why brightwheel is the best software solution for childcare providers in 2025. The all-in-one platform is built to save you and your staff time.
Our Mission and OpportunityEarly education is one of the greatest determinants of childhood outcomes, is a must for working families, and has a lasting social and economic impact. Brightwheel’s vision is to enable high quality early education for every child — by giving teachers meaningfully more time with students each day, engaging parents in the development of their kids, and supporting the small businesses that make up the backbone of the $175 billion early education market. Brightwheel is the most loved technology brand in early education globally, trusted by thousands of educators and millions of families.
Our TeamWe are a fully remote team with employees across every time zone in the US. Our team is passionate, talented, and customer-focused. Our exceptional investor group includes Addition, Bessemer Venture Partners, Chan Zuckerberg Initiative, GGV Capital, Lowercase Capital, Emerson Collective, and Mark Cuban.
We believe that everyone—from our employees to the students, teachers, and administrators we serve— should be given the opportunity to learn and thrive, whatever their background may be. We celebrate diversity in all forms because it allows our team and the communities we serve to reach their full potential and do their best work. From decision making, to how we operate, we ground ourselves in our Leadership Principles every day.
Who You AreBrightwheel is seeking a Staff Data Engineer to join the Data Engineering team.
As a Staff Data Engineer at Brightwheel, you will play a key role in the implementation and evolution of our web scraping and data platform. You will be a technical leader responsible for crafting and implementing a best in class web scraping strategy and infrastructure. You will build and scale pipelines that garner millions of records, across hundreds of sites, stored as measurable data that enable insights for our Analytics team and our customers.
You are passionate about data engineering and possess deep technical skills. You have built scalable web scraping platforms from the ground up. You have experience juggling multiple projects with shifting priorities while continuing to deliver value to the business. You are a curious, detail oriented, self-starter who wants to take full ownership of high impact projects with visibility throughout the organization.
Our TeamWe are a fully remote team with employees across every time zone in the US. Our team is passionate, talented, and customer-focused. Our exceptional investor group includes Addition, Bessemer Venture Partners, Chan Zuckerberg Initiative, GGV Capital, Lowercase Capital, Emerson Collective, and Mark Cuban.
We believe that everyone—from our employees to the students, teachers, and administrators we serve— should be given the opportunity to learn and thrive, whatever their background may be. We celebrate diversity in all forms because it allows our team and the communities we serve to reach their full potential and do their best work. From decision making, to how we operate, we ground ourselves in our Leadership Principles every day.
Who You AreBrightwheel is seeking a Staff Data Engineer to join the Data Engineering team.
As a Staff Data Engineer at Brightwheel, you will play a key role in the implementation and evolution of our web scraping and data platform. You will be a technical leader responsible for crafting and implementing a best in class web scraping strategy and infrastructure. You will build and scale pipelines that garner millions of records, across hundreds of sites, stored as measurable data that enable insights for our Analytics team and our customers.
You are passionate about data engineering and possess deep technical skills. You have built scalable web scraping platforms from the ground up. You have experience juggling multiple projects with shifting priorities while continuing to deliver value to the business. You are a curious, detail oriented, self-starter who wants to take full ownership of high impact projects with visibility throughout the organization.
What You'll Do
- Use modern tooling to build robust, extensible, and performant web scraping platform
- Build thoughtful and reliable data acquisition and integration solutions to meet business requirements and data sourcing needs.
- Deliver best in class infrastructure solutions for flexible and repeatable applications across disparate sources.
- Troubleshoot, improve and scale existing data pipelines, models and solutions
- Build upon data engineering's CI/CD deployments, and infrastructure-as-code for provisioning AWS and 3rd party (Apify) services.
Qualifications, Skills, & Abilities
- 5+ years of work experience as a data engineer/full stack engineering, coding in Python.
- 5+ years of experience building web scraping tools in python, using Beautiful Soup, Scrapy, Selenium, or similar tooling
- 3-5 years of deployment experience with CI/CD
- Strong experience of HTML, CSS, JavaScript, and browser behavior.
- Experience with RESTful APIs and JSON/XML data formats.
- Knowledge of cloud platforms and containerization technologies (e.g., Docker, Kubernetes).
- Advanced understanding of how at least one big data processing technology works under the hood (e.g. Spark / Hadoop / HDFS / Redshift / BigQuery / Snowflake)
- Excellent analytical, problem solving, and troubleshooting skills to manage complex process and technology issues without guidance
- 2+ years of experience developing in Airflow
- 2+ deploying Infrastructure as Code within AWS or similar
- 2+ deploying microservices and/or APIs within cloud environment
- 1+ years using ML / AI workflows for data enrichment and/or sentiment analysis by integrating scraped data into ML pipelines.
Preferred Experience
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
1
0
0
Categories:
Engineering Jobs
Leadership Jobs
Tags: Airflow APIs AWS Big Data BigQuery CI/CD Data pipelines Docker Engineering Hadoop HDFS JavaScript JSON Kubernetes Machine Learning Microservices Pipelines Python Redshift Selenium Snowflake Spark XML
Perks/benefits: Flex hours
Regions:
Remote/Anywhere
South America
Country:
Brazil
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Sr. Data Engineer jobsData Scientist II jobsStaff Data Scientist jobsBI Developer jobsStaff Machine Learning Engineer jobsPrincipal Data Engineer jobsData Manager jobsSenior AI Engineer jobsJunior Data Analyst jobsBusiness Data Analyst jobsData Science Manager jobsResearch Scientist jobsData Science Intern jobsPrincipal Software Engineer jobsData Specialist jobsLead Data Analyst jobsSoftware Engineer II jobsData Analyst Intern jobsSr. Data Scientist jobsData Engineer III jobsBI Analyst jobsSoftware Engineer, Machine Learning jobsAI/ML Engineer jobsData Analyst II jobsDevOps Engineer jobs
Snowflake jobsEconomics jobsLinux jobsOpen Source jobsData Warehousing jobsAirflow jobsNoSQL jobsGoogle Cloud jobsHadoop jobsKafka jobsComputer Vision jobsRDBMS jobsMLOps jobsBanking jobsJavaScript jobsClassification jobsScikit-learn jobsKPIs jobsPhysics jobsData warehouse jobsScala jobsStreaming jobsOracle jobsTerraform jobsGitHub jobs
Looker jobsSAS jobsPostgreSQL jobsR&D jobsScrum jobsPySpark jobsPandas jobsCX jobsBigQuery jobsData Mining jobsJira jobsDistributed Systems jobsdbt jobsRobotics jobsIndustrial jobsUnstructured data jobsMicroservices jobsRedshift jobsReact jobsData strategy jobsPharma jobsJenkins jobsNumPy jobsE-commerce jobsGPT jobs