Pyspark Data Engineer with Databricks
Tasks
- Build data ingestion and data modeling solutions
- Build orchestration workflows
- Design end-to-end ETL/ELT pipelines
- Develop and maintain Python PySpark data pipelines
- Develop and operationalize ML workflows with MLflow
- Implement CI CD for data and ML pipelines
- Implement data quality validation reconciliation anomaly detection
- Implement pipeline monitoring logging alerting observability
- Optimize Spark jobs for performance scalability cost
Perks/Benefits
- Employee assistance programs
- Life and disability insurance
- Medical, dental, and vision coverage
- Paid time off
- Retirement savings plan
Skills/Tech-stack
Airflow | Alerting | Anomaly Detection | Apache Spark | CI/CD | Cluster tuning | Data Modeling | Data Quality | Data Validation | Databricks | Databricks Workflows | Distributed Systems | ELT | ETL | Logging | MLflow | Monitoring | Orchestration | PySpark | Python | Snowflake | Spark optimization
Education
N/A
Roles
Regions
Countries
States
Cities
Related jobs
-
AWS | Analytics | Data Mining | Generative AI | Machine LearningMentorship | Training | Work-life balanceSenior-level Full TimeArlington, Virginia, USA7h ago
-
API Integration | AWS | Autogen | Azure | Cloud platformHybrid work environmentSenior-level Contract Full TimeChicago, Illinois, United States9h ago
-
Senior/Staff Software Engineer - Perception & Sensing USD 195K-280K3D Object Detection | 3D segmentation | A/B | A/B Testing | B testingSenior-level Full TimeFoster City, CA9h ago
-
AI Engineer USD 103K-140KAI Agents | AI Studio | Access Control | Anthropic Claude | AuthenticationBonus eligibleSenior-level Full TimeDenver, CO, United States9h ago
-
Staff Software Engineer, GenAI Platform USD 208K-250KAPI | AWS EKS | Access Control | Agent Orchestration | Audit LoggingCatered lunches | Cultural and team offsites | Employee giving match | Flexible work schedule | Generous vacation policySenior-level Full TimeSan Francisco, CA, United States9h ago
-
Director, Data Governance & Observability USD 165K-190KAPI Design | Alerting | Automated testing | CI/CD | DashboardsExecutive-level Full TimeNew York, NEW YORK, United States11h ago
-
Senior-level Full TimeCincinnati, OH, United States11h ago
-
Bill of Materials | Change Management | Data Cleansing | Data Governance | Data Migration401k | Collaborative team environment | Cross training | Dental insurance | Medical insuranceMid-level Full TimeWichita, KS, United States11h ago
-
Robotic Orchestration Platform Software Engineer USD 125K-250KAWS | Agile | Azure | C# | CI/CDAgile environment | Remote supportSenior-level Full TimeSan Francisco, California12h ago
-
Senior Data Engineer USD 111K-124KAccess Control | Agile | Azure | CI/CD | Data Governance401k contributions | Education assistance | Life and disability coverage | Medical, dental, and vision coverage | Paid sabbaticalSenior-level Full TimeAtlanta, Georgia or Gainesville, FL13h ago
-
Senior-level Full TimeBoston, Massachusetts, United States13h ago
-
Data Analytics & Engineering Opportunities USD 65K-105KHive | Microstrategy | MySQL | Oracle | Python401k with firm profit share | Dental insurance | Disability insurance | Firm paid holidays | Flexible spending accountEntry-level Full TimeWashington, DC, United States13h ago
-
Analytics Engineer USD 60K-75KDBT | Data Architecture | Data Modeling | Data Pipelines | Data Quality401k employer match | Medical/Dental/Vision | Paid Holidays | Paid parental leave | SabbaticalMid-level Full TimeBoulder, Colorado, United States14h ago
-
Mid-Level Data Engineer USD 90K-98KAPI Development | Azure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake StorageRemote workMid-level Full TimeWork from home, VA, United States R15h ago
-
Senior Data Engineer USD 165K-180KAPIs | Anomaly Detection | Azure | Azure Data | Azure Data FactorySenior-level Full TimeWork from home, VA, United States R15h ago
-
Sr. Machine Learning Engineer USD 100K-160KCI/CD | Data Fusion | Data analytics | Deep learning | DockerHybrid work environmentSenior-level Full TimeCocoa Beach, Florida, United States16h ago
-
Quantitative Developer (DV Equities) USD 100K-150KC++ | Linux | Mathematics | Python | StatisticsDental insurance | Dependent care options | FSA | Flexible vacation | Group term life insuranceNone Full TimeNew York16h ago
-
Senior AI Engineer | Sage Home Loans USD 150K-220KAgent Orchestration | Automated Regression | Automated regression testing | Cost Optimization | DPO401k match | Disability insurance | Employee assistance program | Flexible paid time off | Flexible spending accountsSenior-level Full TimeCharlotte, NC R17h ago
-
Senior DevOps Engineer ID63545 USD 135K-185KAWS | Apache Airflow | ArgoCD | Azure | BigQueryFlextime | Growth roadmaps | Mentorship | Office work options | Remote work optionsSenior-level Full TimeMiami, United States17h ago
-
BigQuery | Curriculum Development | Data Governance | Data Lakehouse | Data ModelingAsynchronous hiring process | Part-time hoursSenior-level Part TimeBoston, US17h ago
-
Evergreen - Mathematics for Machine Learning USD 80K-300KAutodiff | JAX | Linear Algebra | Matrix Operations | NumPyAsynchronous hiring process | Flexible collaboration | Part-time hoursMid-level Full TimeBoston, US18h ago
-
Senior Director, AI / Machine Learning Software Engineer USD 136K-300KApache Flink | Apache Spark | CI/CD | Data Lineage | Data PrivacyHealth benefits | Paid leave | Paid volunteer timeSenior-level Full TimeNew York, NY, United States18h ago
-
Data Engineering | Machine Learning | Machine Learning Pipelines | Python | Recommendation SystemsSenior-level Full TimeSan Jose, California, United States19h ago
-
Data Pipelines | Full Stack | Full-Stack Development | Machine Learning | PythonSenior-level Full TimeSan Jose, California, United States19h ago
-
C++ | Data Analysis | Data Manipulation | Data Processing | Deep learningSenior-level Full TimeMountain View, CA, USA20h ago