Senior Machine Learning Engineer, AI Platform
Tasks
- Design and manage GPU based inference and training workloads
- Design build and operate AI platform components for ML model training deploy and serving
- Implement observability metrics logging tracing and alerting for ML services
- Improve model lifecycle packaging versioning testing validation and deployment automation
- Optimize inference systems for throughput latency and cost
- Own model serving and inference workflows
- Participate in incident response on call rotations and post incident reviews
- Partner with product infrastructure security and data teams for scalable platform capabilities
- Propose architectural improvements and mentor engineers through code reviews
Perks/Benefits
- Accidental death and dismemberment
- Birthday day off
- Country specific holidays
- Disability insurance
- Employee assistance program
- Employee referral bonus
- Home office stipend
- Life insurance
- Medical, dental, and vision coverage
- Paid disability coverage
- Paid parental leave
- Professional development budget
- Retirement contributions
- Well-being stipend
- Wellness days
Skills/Tech-stack
Alerting | Batching | CI/CD | CUDA | Capacity Planning | Cloud Computing | Deployment Automation | Distributed Systems | Docker | GPU | Inference Pipelines | Kubernetes | Latency optimization | Logging | Machine Learning | Metrics | Model Conversion | Model Packaging | Model Serving | Observability | Performance Tuning | Python | Quantization | Resource Utilization Optimization | Resource utilization | Throughput Optimization | Tracing | Utilization Optimization | Version control
Education
Related jobs
-
ETL Data Engineer - HYBRID CAD 110K-144KAWS | Airflow | Amazon Redshift | Azure | Azure SynapseHybrid workSenior-level Full TimeHalifax, NS, CA R12h ago
-
Senior-level Full TimeToronto, Canada R12h ago
-
Senior Software Engineer, Logging & Experiments CAD 176K-200KCloud Native | Cloud-native infrastructure | Design Documentation | Distributed Systems | ExperimentationCareer coaching and support | In-office culinary options | Inclusive family building benefits | Long term savings or retirement plans | Mental health wellness and fitness benefitsSenior-level Full TimeVancouver, BC R14h ago
-
Senior Software Engineer - Data Platform CAD 126K-163KAWS ECS | AWS Lambda | AWS S3 | Airflow | Amazon DynamoDBFlexible remote work | Health insurance | Parental leave | Professional development stipendSenior-level Full TimeRemote - Canada R15h ago
-
ADLS Gen2 | Auditing | Azure Data | Azure Data Lake | Azure Data Lake StorageCareer growth opportunities | Collaborative engineering environment | Continuous learning opportunities | Fully remote within Canada | Technical upskilling opportunitiesSenior-level Full TimeCanada R23h ago
-
Operations Data Engineer | Luma CAD 105K-125KAWS | Cloud Architecture | Cloud Cost Optimization | Cloud Governance | CloudWatchCareer development | Collaborative team environment | Health and wellness benefits | Remote-friendly environmentMid-level Full TimeRemote - Canada R1d ago
-
Agile | BigQuery | Cloud Composer | Cloud Storage | Cloud platformRemote workSenior-level Full TimeToronto, ON, CA, M5J 2V5 R1d ago
-
Senior Software Engineer - Data Platform CAD 191K-191KAccess Control | Apache Airflow | Apache Kafka | Apache Spark | Cloud DataSenior-level Full TimeRemote - Canada R1d ago
-
Senior Software Engineer, Data Systems (Python) USD 170K-200KAPI | API Keys | Alerting | Apache Airflow | Batch Processing401k | Flexible PTO | Healthcare benefits | Paid Holidays | Paid parental leaveSenior-level Full TimeRemote - Canada R1d ago
-
Senior Data Engineer II, Finance CAD 188K-198KAirflow | Amazon Redshift | Apache Iceberg | Apache Spark | DBTAnnual refresh grants | Flexible work location | New hire equity grant | Remote workSenior-level Full TimeCanada - Remote (ON, AB, BC, … R1d ago
-
Senior Data Engineer CAD 120K-145KAWS | AWS CloudFormation | Apache Flink | Azure | Batch ProcessingSenior-level Full TimeCanada, Remote R1d ago
-
AI Governance | CRM Integration | Confluence | Data Modeling | Data integrationCollaborative engineering environment | Direct exposure to senior leadership | Fully remote within Canada | High ownership role | Innovation and rapid iterationSenior-level Full TimeCanada R1d ago
-
AWS | Amazon Aurora | Amazon RDS | Automation | Database Backup24/7 on-call rotation | Collaborative engineering environment | Fully remote within Canada | Long-term career growthSenior-level Full TimeCanada R1d ago
-
AI Agents | API Integration | Automation tools | Language Models | Large Language ModelsAccess to cutting-edge technologies | Collaboration with experienced professionals | Fully remote | High ownership and autonomy | Learning-focused cultureMid-level Full TimeCanada R2d ago
-
Senior Synapse Engineer (Remote) CAD 120K-145KAutoscaling | Azure Data | Azure Data Lake | Azure Data Lake Storage | Azure Data Lake Storage Gen2Remote workSenior-level Full TimeOntario, Canada, Canada R2d ago
-
AWS | Azure | CI/CD | Cloud platform | Data PipelinesDental insurance | Family support benefits | Flexible spending accounts | Flexible time off | Health insuranceSenior-level Full TimeCanada R3d ago
-
Senior Software Engineer (Pipeline team) CAD 145K-191KA/B | A/B Testing | AWS Bedrock | AWS ECS | AWS EKSSenior-level Full TimeCanada - Remote R4d ago
-
AWS | Airflow | Apache Spark | Azure Synapse | Azure Synapse Analytics401k matching | Disability insurance | Employee assistance program | Life insurance | Medical/Dental/Vision insuranceMid-level Full TimeRemote, USA ; Remote, Canada R4d ago
-
AI Pipelines | CI/CD | Cloud platform | Distributed Computing | FastAPIAnnual learning and equipment budget | Company provided MacBook | Employer Paid Dental Benefits | Employer paid health benefits | Employer paid virtual medical benefitsSenior-level Full TimeCanada R5d ago
-
Freelance Machine Learning Engineer CAD 110KLLM | Langchain | MLOps | NumPy | PandasFlexible hours | Part-time project-based workMid-level FreelanceCanada - Remote R5d ago
-
LLMs | Langchain | MLOps | Machine Learning | NumPyProject based workMid-level FreelanceCanada - Remote R5d ago
-
Agile Development | Application Programming | Application Programming Interfaces | Azure AI | Data PreprocessingInclusive work environment | Learning opportunities | Professional development | Remote workEntry-level Full Time InternshipCAN TRNT 5000 Ste 900, Canada R5d ago
-
Principal Data Engineer - MarTech USD 143K-178KAPI Integration | Batch Data Processing | Batch data | CCPA | Customer DataHealth and welfare benefits | Hybrid work model | Paid time off | Remote work flexibilitySenior-level Full TimeRemote, US or Remote, Ontario, Canada R5d ago
-
Staff Backend Engineer (AI Platform team) USD 185K-242KAWS | Agent Frameworks | Compliance | Data Privacy | DatadogEquity | Flexible PTO | Medical coverage | Monthly lifestyle stipendSenior-level Full TimeRemote - United States & Canada R5d ago
-
AWS | Amazon Kinesis | Apache Airflow | Apache Kafka | Apache SparkEmployee assistance program | Equity participation opportunities | Flexible paid time off | Fully remote work | Learning and professional developmentSenior-level Full TimeCanada R6d ago