Senior Research Data Engineer (Canada)
Tasks
- Automate quality filtering and synthesis
- Build and own gold data layer
- Build reusable Databricks Spark transformation pipelines
- Capture data provenance and clinical event sequencing
- Curate multimodal datasets with metadata and labels
- Design gold data products for researcher AI needs
- Detect near duplicates
- Ensure point in time correctness
- Generate synthetic data using LLM APIs
- Implement weak supervision and programmatic labeling
- Maintain reproducible dataset snapshots and lineage
- Reverse engineer data semantics
- Support regulated data handling with controlled access
- Transform silver tables into curated gold datasets
- Validate and document datasets
Perks/Benefits
- N/A
Skills/Tech-stack
AWS | Airflow | Azure | Dagster | Data Drift | Data Versioning | Databricks | Databricks Workflows | Deidentification | Delta Lake | Embeddings | Feature Engineering | Generative AI | Git | HIPAA | Hugging Face | Hugging Face Datasets | LLM | LSH | MLflow | MinHash | Parquet | Prefect | PySpark | Python | RAG | SQL | Spark | Tokenization | Train Validation Test | Train validation test split | Unity Catalog | Weak Supervision
Education
N/A
Related jobs
-
Staff AI Engineer - Grafana AI/ML | USA | Remote CAD 186K-230KAWS | Agent Frameworks | Agent workflows | Alerting | AzureCompany funded AI coding assistant budget | Global annual leave policy | Remote workSenior-level Full TimeCanada (Remote) R1d ago
-
Senior AI Engineer - Grafana AI/ML | USA | Remote CAD 129K-217KAWS | Azure | Docker | GCP | GenAIAnnual leave policy | Company funded AI usage budget | Developer productivity support | Global culture | In-person onboardingSenior-level Full TimeCanada (Remote) R1d ago
-
Apache Flink | Apache Kafka | Data Observability | Data Processing | Data QualityFlexible vacation policy | Fully remote friendly | Health, dental, vision coverage | Hybrid flexibility | Learning and development supportSenior-level Full TimeCanada R1d ago
-
Data Engineer, Pricing CAD 108K-135KAWS | Amazon DynamoDB | Amazon S3 | Apache Airflow | Apache HBaseChild care benefits | Dental insurance | Disability insurance | Family building benefits | Flexible paid time offSenior-level Full TimeToronto, Canada R1d ago
-
Senior Data Engineer CAD 119K-154KAPI Gateway | AWS Aurora | AWS CloudWatch | AWS Lambda | AWS RDSHealth insurance | Parental leave | Professional development stipend | Remote workSenior-level Full TimeRemote - Canada R1d ago
-
AI Observability | AWS | Azure | CI/CD | Cloud platformCareer growth opportunities | Equity opportunities | Experimental development environments | Fully remote work within North America | Health, dental, and vision insuranceSenior-level Full TimeCanada R2d ago
-
Senior Machine Learning Engineer, Rider Applied AI CAD 149K-187KAWS | Apache Spark | Cloud platform | Deep learning | DockerChild care benefits | Commuter benefits | Dental insurance | Disability benefits | Family building benefitsSenior-level Full TimeToronto, Canada R2d ago
-
Lead Embedded Developer CAD 116K-155KAlgorithms | Bash | BigQuery | C# | CI/CDBaby bonus | Competitive medical and dental benefits | Electric vehicle purchase incentive | Home office reimbursement | Online learning and networking opportunitiesSenior-level Full TimeOakville, Ontario - Canada R2d ago
-
Sr. Machine Learning Engineer, Content Shopping CAD 143K-189KDataset Management | Graph Representation | Graph Representation Learning | Information Extraction | Language ModelsFlexibility | Remote workSenior-level Full TimeToronto, ON, CA R2d ago
-
Database Engineer (MONGO DB) CAD 108K-149KAWS | Access Control | Auditing | Backup/Restore | BashOn-call rotation | Remote workMid-level Full TimeCanada - Remote R2d ago
-
Staff AI Engineer (Acquia DAM) CAD 130K-165KAWS | Agent systems | Azure | Benchmarking | CI/CDHealth insurance | Paid time off | Parental leave | Recognition programs | Wellness programsSenior-level Full TimeRemote-Canada R2d ago
-
Senior Databricks Engineer USD 180K-247KAWS | Autoscaling | Azure | CI/CD | CachingVisa sponsorshipSenior-level Full TimeCanada R3d ago
-
Senior-level Full TimeCanada R3d ago
-
AI Savvy Data Analyst (English version) CAD 110K-140KAnomaly Detection | BigQuery | Cloud APIs | Colab | Data GovernanceAdditional day off | Birthday day off | Flex days paid | Flexible work hours | Montreal transit pass discountMid-level Full TimeMontréal, QC, Canada R3d ago
-
Bash | Data Pipelines | Distributed Systems | Docker | GCPAccess to cutting-edge technologies | Autonomy | Bonus | Collaborative culture | Distributed-first environmentMid-level Full TimeCanada R3d ago
-
APIs | CI/CD | Cloud platform | Compliance | ContainersAnnual leave | Dental coverage | Health coverage | High autonomy | Home office setup supportSenior-level Full TimeCanada R3d ago
-
Senior Backend/Applied AI Developer CAD 120K-170KAWS | AWS Bedrock | Artificial Intelligence | C# | CI/CDSenior-level Full TimeCanada - Remote R4d ago
-
Senior-level Full TimeCanada R6d ago
-
Mid-level Full TimeVancouver, Canada R6d ago
-
AI Engineer CAD 145K-172KA/B | A/B Testing | API Development | API Integration | B testingCareer growth opportunities | Comprehensive benefits | Hybrid schedule | Inclusive office environment | Robust training programMid-level Full TimeKitchener, Canada R6d ago
-
AI Observability | AWS | Azure | CI/CD | Cost ControlCareer advancement | Fully remote work | Professional development opportunities | Work-life balanceSenior-level Full TimeCanada R6d ago
-
AI Workflow Orchestration | AI workflow | AWS DynamoDB | AWS Lambda | AWS Step FunctionsArchitectural influence | Engineering Led Collaboration | High technical ownership | Learning opportunities | Remote-first work modelSenior-level Full TimeCanada R7d ago
-
Senior Data Engineer CAD 120K-160KAirflow | Azure DevOps | Bash | BigQuery | CI/CDCareer development | Flexible paid time off | Health/dental coverage | In-person gatherings | Learning opportunitiesSenior-level Full TimeToronto, Ontario, Canada - Remote R7d ago
-
Senior Data & Analytics Engineer CAD 105K-128KArtificial Intelligence | Data Analysis | Data Architecture | Data Engineering | Data LineageDental insurance | Flexible remote work | Health insurance | Paid time off | Pension planSenior-level Full TimeCA Victoria, Canada R7d ago
-
Senior Machine Learning Operations Engineer USD 166K-208KAlerting | CI/CD | Canary Deployment | Champion Challenger | Drift DetectionSenior-level Full TimeSan Francisco, CA, New York, NY, … R7d ago