Lead/Senior Data Engineer
US - Remote
phData
phData knows data. We're expert data engineers, data strategists and machine learning implementers. Our managed data services are end to end. Contact us for more information.phData is a remote-first global company with employees based in the United States, Latin America and India. We celebrate the culture of each of our team members and foster a community of technological curiosity, ownership and trust. Even though we're growing extremely fast, we maintain a casual, exciting work environment. We hire top performers and allow you the autonomy to deliver results.
- 5x Snowflake Partner of the Year (2020, 2021, 2022, 2023, 2024)
- Fivetran, dbt, Atlation, Matillion Partner of the Year
- #1 Partner in Snowflake Advanced Certifications
- 600+ Expert Cloud Certifications (Sigma, AWS, Azure, Dataiku, etc)
- Recognized as an award-winning workplace in US, India and LATAM
Required Experience:
- 4+ years as a hands-on Data Engineer and/or Software Engineer designing and implementing data solutions
- Ability to develop end-to-end technical solutions into production — and to help ensure performance, security, scalability, and robust data integration.
- Ability to multitask, prioritize, and work across multiple projects at once.
- Programming expertise in Java, Python and/or Scala, including experience with software development life cycle, including unit and integration testing
- Core cloud data platforms including Snowflake, AWS, Azure, Databricks and GCP
- Experience using SQL and the ability to write, debug, and optimize SQL queries
- Client-facing written and verbal communication skills and experience
- Create and deliver detailed presentations
- Detailed solution documentation (e.g. including POCS and roadmaps, sequence diagrams, class hierarchies, logical system views, etc.)
- 4-year Bachelor's degree in Computer Science or a related field
Prefer any of the following:
- Production experience in core data platforms: Snowflake (including Snowflake Native Apps), AWS, Azure, GCP, Hadoop, Databricks, IICS
- Cloud and Distributed Data Storage: S3, ADLS, HDFS, GCS, Kudu, ElasticSearch/Solr, Cassandra or other NoSQL storage systems
- Data integration technologies: Spark, Kafka, event/streaming, Streamsets, Matillion, Fivetran, NiFi, AWS Data Migration Services, Azure DataFactory, Informatica Intelligent Cloud Services (IICS), Google DataProc or other data integration technologies
- Multiple data sources (e.g. queues, relational databases, files, search, API)
- Complete software development lifecycle experience including design, documentation, implementation, testing, and deployment
- Automated data transformation and data curation: dbt, Spark, Spark streaming, automated pipelines
- Workflow Management and Orchestration: Airflow, AWS Managed Airflow, Luigi, NiFi
Why phData? We offer:
- Remote-First Work Environment
- Casual, award-winning small-business work environment
- Collaborative culture that prizes autonomy, creativity, and transparency
- Competitive comp, excellent benefits, 4 weeks PTO plus 10 Holidays (and other cool perks)
- Accelerated learning and professional development through advanced training and certifications
#LI-DNI
phData celebrates diversity and is committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at phData. We are proud to be an equal opportunity employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at People Operations.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow APIs AWS Azure Cassandra Computer Science Databricks Dataproc dbt Elasticsearch FiveTran GCP Hadoop HDFS Informatica Java Kafka Matillion NiFi NoSQL Pinecone Pipelines Python RDBMS Scala SDLC Security Snowflake Spark SQL Streaming Testing
Perks/benefits: Career development
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.