Senior Machine Learning Engineer, AI Platform
Tasks
- Design and manage GPU based inference and training workloads
- Design build and operate AI platform components for ML model training deploy and serving
- Implement observability metrics logging tracing and alerting for ML services
- Improve model lifecycle packaging versioning testing validation and deployment automation
- Optimize inference systems for throughput latency and cost
- Own model serving and inference workflows
- Participate in incident response on call rotations and post incident reviews
- Partner with product infrastructure security and data teams for scalable platform capabilities
- Propose architectural improvements and mentor engineers through code reviews
Perks/Benefits
- Accidental death and dismemberment
- Birthday day off
- Country specific holidays
- Disability insurance
- Employee assistance program
- Employee referral bonus
- Home office stipend
- Life insurance
- Medical, dental, and vision coverage
- Paid disability coverage
- Paid parental leave
- Professional development budget
- Retirement contributions
- Well-being stipend
- Wellness days
Skills/Tech-stack
Alerting | Batching | CI/CD | CUDA | Capacity Planning | Cloud Computing | Deployment Automation | Distributed Systems | Docker | GPU | Inference Pipelines | Kubernetes | Latency optimization | Logging | Machine Learning | Metrics | Model Conversion | Model Packaging | Model Serving | Observability | Performance Tuning | Python | Quantization | Resource Utilization Optimization | Resource utilization | Throughput Optimization | Tracing | Utilization Optimization | Version control
Education
Related jobs
-
Senior Staff Data Engineer - Platform Data and Analytics CAD 228K-313KAWS | Alerting | Apache Airflow | Apache Spark | Cloud ComputingSenior-level Full TimeKitchener-Waterloo, ON; Toronto, ON R22h ago
-
API Development | AQL | AWS | Benchmarking | CypherBuddy program | Certification support | Global volunteer program | Holiday days | Hybrid work policySenior-level TemporaryToronto Canada R1d ago
-
Data Manipulation | Distributed Systems | Embeddings | Java | KubernetesCollaborative flat culture | Direct access to technical leadership | Exposure to cutting edge generative AI | Flexible schedule | High autonomyEntry-level Full TimeCanada R1d ago
-
Cloud Data | Cloud Data Platforms | DBT | Data Architecture | Data ProcessingCoworking allowance | Flexible paid time off | Health, dental and vision coverage | Home office stipend | Mental health resourcesSenior-level Full TimeCanada R1d ago
-
Azure Data | Azure Data Lakehouse | Azure SQL | DBT | Data GovernanceCareer growth | Continuous learning | Dental insurance | Flexible work arrangements | Global collaborationSenior-level Full TimeCanada R1d ago
-
API Integration | CRM | Context Management | Distributed Systems | Document stores401k retirement plan | Company offsite and team events | Flexible PTO | Fully remote | Health, dental, and vision insuranceSenior-level Full TimeCanada R1d ago
-
Senior Data Engineer CAD 102K-140KCI/CD | Data Factory | Data Governance | Data Modelling | DataOpsBirthday surprise | Enhanced parental leave | Free breakfast | Free fresh fruit | Health cash planSenior-level Full TimeHalifax, GB - Remote/Hybrid R1d ago
-
AWS Glue | AWS Lambda | AWS S3 | Access Control | Data GovernanceCareer growth opportunities | Collaborative and inclusive work environment | Diverse and inclusive culture | Flexible work arrangements | Permanent remote working modelSenior-level Full TimeCanada R1d ago
-
Big Data Senior Developer, Marketing & Advertising Data (French Services) (Telework/Hybrid) CAD 80K-106KApache Airflow | Apache Spark | Azure | Azure Event | Azure Event HubsBackground checks support for onboarding | Career growth opportunities | Employee resource groups | Flexible hours | Hybrid work environmentSenior-level Contract Full TimeQuebec (36.25), Canada R1d ago
-
API | AWS | AWS Kinesis | Agile | AnsibleCareer growth | Employee benefits | Hybrid work environmentSenior-level Full TimeMontreal 700, Canada R4d ago
-
Mid-level Full TimeCanada Remote - Consultant use only R4d ago
-
Expert AI Enablement Consultant (On-Call) CAD 120K-160KAPI Development | Agentic Workflows | Backend Development | Cloud deployment | Cost Optimization401k matching | Donation matching | Employee stock purchase plan | Flexible work arrangements | Professional development resourcesSenior-level Part TimeCanada Remote Office (CD99) R4d ago
-
Amazon Web Services | Backend Development | Code review | Debugging | Distributed SystemsDental and vision coverage | Employee stock purchase plan | Flexible spending wallets | Health care coverage | Remote-first work flexibilityMid-level Full TimeRemote Canada R5d ago
-
Staff Software Engineer Data - DC Tech Lead USD 148K-242KDBT | Databricks | ETL | PySpark | Python401k match | Flexible remote work | Health and wellness benefits | Home office stipend | Mental health supportSenior-level Full TimeRemote - Toronto, Ontario, Canada R5d ago
-
Senior-level Full TimeRemote- Canada R5d ago
-
Agent ou agente de recherche, conception, intégration et contrôle de systèmes robotiques mobiles CAD 64K-183KAerial robotics | Algorithmic Design | Algorithmic Design Tools | ArduPilot | ArducopterDental insurance | Disability insurance | Health insurance | Hybrid work option | Life insuranceMid-level Full TimeMontréal, QC, CA R5d ago
-
Staff AI Engineer - Grafana AI/ML | USA | Remote CAD 186K-230KAWS | Agent Frameworks | Agent workflows | Alerting | AzureCompany funded AI coding assistant budget | Global annual leave policy | Remote workSenior-level Full TimeCanada (Remote) R5d ago
-
Apache Flink | Apache Kafka | Data Observability | Data Processing | Data QualityFlexible vacation policy | Fully remote friendly | Health, dental, vision coverage | Hybrid flexibility | Learning and development supportSenior-level Full TimeCanada R6d ago
-
Data Engineer, Pricing CAD 108K-135KAWS | Amazon DynamoDB | Amazon S3 | Apache Airflow | Apache HBaseChild care benefits | Dental insurance | Disability insurance | Family building benefits | Flexible paid time offSenior-level Full TimeToronto, Canada R6d ago
-
Senior Data Engineer CAD 119K-154KAPI Gateway | AWS Aurora | AWS CloudWatch | AWS Lambda | AWS RDSHealth insurance | Parental leave | Professional development stipend | Remote workSenior-level Full TimeRemote - Canada R6d ago
-
AI Observability | AWS | Azure | CI/CD | Cloud platformCareer growth opportunities | Equity opportunities | Experimental development environments | Fully remote work within North America | Health, dental, and vision insuranceSenior-level Full TimeCanada R6d ago
-
Senior Machine Learning Engineer, Rider Applied AI CAD 149K-187KAWS | Apache Spark | Cloud platform | Deep learning | DockerChild care benefits | Commuter benefits | Dental insurance | Disability benefits | Family building benefitsSenior-level Full TimeToronto, Canada R6d ago
-
Lead Embedded Developer CAD 116K-155KAlgorithms | Bash | BigQuery | C# | CI/CDBaby bonus | Competitive medical and dental benefits | Electric vehicle purchase incentive | Home office reimbursement | Online learning and networking opportunitiesSenior-level Full TimeOakville, Ontario - Canada R6d ago
-
Sr. Machine Learning Engineer, Content Shopping CAD 143K-189KDataset Management | Graph Representation | Graph Representation Learning | Information Extraction | Language ModelsFlexibility | Remote workSenior-level Full TimeToronto, ON, CA R7d ago
-
Database Engineer (MONGO DB) CAD 108K-149KAWS | Access Control | Auditing | Backup/Restore | BashOn-call rotation | Remote workMid-level Full TimeCanada - Remote R7d ago