Senior Data Engineer
Remote (Brazil)
- Remote-first
- Website
- @brightwheel 𝕏
- GitHub
- Search
brightwheel
See why brightwheel is the best software solution for childcare providers in 2025. The all-in-one platform is built to save you and your staff time.Our TeamOur team is passionate, talented, and customer-focused. We embody our Leadership Principles in our work and culture. We are a fully remote team with employees across every time zone in the US. Our exceptional investor group includes Addition, Bessemer, Emerson Collective, Lowercase Capital, Mark Cuban, Notable Capital, and others.
Who You AreBrightwheel is seeking a Staff Data Engineer to join the Data Engineering team.
As a Senior Data Engineer at Brightwheel, you will play a key role in the implementation and evolution of our web scraping and data platform. You will be a technical leader responsible for crafting and implementing a best in class web scraping strategy and infrastructure. You will build and scale pipelines that garner millions of records, across hundreds of sites, stored as measurable data that enable insights for our Analytics team and our customers.
You are passionate about data engineering and possess deep technical skills. You have built scalable web scraping platforms from the ground up. You have experience juggling multiple projects with shifting priorities while continuing to deliver value to the business. You are a curious, detail oriented, self-starter who wants to take full ownership of high impact projects with visibility throughout the organization.
What You'll Do
- Use modern tooling to build robust, extensible, and performant web scraping platform
- Build thoughtful and reliable data acquisition and integration solutions to meet business requirements and data sourcing needs.
- Deliver best in class infrastructure solutions for flexible and repeatable applications across disparate sources.
- Troubleshoot, improve and scale existing data pipelines, models and solutions
- Build upon data engineering's CI/CD deployments, and infrastructure-as-code for provisioning AWS and 3rd party (Apify) services.
Qualifications, Skills, & Abilities
- 3+ years of work experience as a data engineer/full stack engineering, coding in Python.
- 3+ years of experience building web scraping tools in python, using Beautiful Soup, Scrapy, Selenium, or similar tooling
- 1-3 years of deployment experience with CI/CD
- Strong experience of HTML, CSS, JavaScript, and browser behavior.
- Experience with RESTful APIs and JSON/XML data formats.
- Knowledge of cloud platforms and containerization technologies (e.g., Docker, Kubernetes).
- Advanced understanding of how at least one big data processing technology works under the hood (e.g. Spark / Hadoop / HDFS / Redshift / BigQuery / Snowflake)
- Excellent analytical, problem solving, and troubleshooting skills to manage complex process and technology issues without guidance
Preferred Experience
- 2+ years of experience developing in Airflow
- 2+ deploying Infrastructure as Code within AWS or similar
- 2+ deploying microservices and/or APIs within cloud environment
- 1+ years using ML / AI workflows for data enrichment and/or sentiment analysis by integrating scraped data into ML pipelines.
Premium Benefits & Wellness Support:We want our team members and their families to thrive. We support this through:--Healthcare Coverage: Medical, dental, and vision benefits typically valued at $15,000+, with brightwheel providing high coverage for both employees and families --Generous Paid Parental Leave for growing families--Flexible Paid Time Off (PTO) to recharge and relax--401(k) Enrollment to help you plan for the future--Monthly Wellness & Productivity Stipend to support your well-being
Work-Life Flexibility:We are a fully remote company, giving you the flexibility to work from where you thrive. Say goodbye to the commuting hassle and reclaim valuable time for what matters most.
Brightwheel is committed to creating a diverse and inclusive work environment and is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity, gender expression, sexual orientation, national origin, genetics, disability, age, or veteran status.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow APIs AWS Big Data BigQuery CI/CD Data pipelines Docker Engineering Hadoop HDFS JavaScript JSON Kubernetes Machine Learning Microservices Pipelines Python Redshift Selenium Snowflake Spark XML
Perks/benefits: Equity / stock options Flex hours Flex vacation Health care Home office stipend Medical leave Parental leave Wellness
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.