Distinguished Engineer - AI Computing System
Tasks
- Analyze training bottlenecks and propose optimizations
- Build AI training frameworks and operator libraries
- Collaborate on software hardware co optimization
- Collaborate with researchers on standards and patents
- Design cluster scheduling software architecture
- Develop acceleration libraries
- Develop training cluster technical roadmap
- Lead low precision training optimization
- Lead parallel strategy tuning
- Optimize training resource allocation
- Plan AI training framework architecture
Perks/Benefits
Skills/Tech-stack
Artificial Intelligence | Cloud Computing | Cluster computing | Cluster scheduling | Co-design | Distributed Systems | GPU Programming | Hardware co-design | Inference Optimization | Language Models | Large Language Models | Low-precision training | Machine Learning | Mixture of Experts | Model Inference | Model Inference Optimization | Multimodal Learning | NPU programming | Parallel Computing | Software-Hardware Co-Design | Software-hardware | Systems engineering | Training Optimization
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
Related jobs
-
Senior AI Engineer CAD 133K-160KAgent systems | Agile Development | Artificial Intelligence | Function Calling | Language ModelsBirthday off | Employer Paid Benefits | Generous vacation | Health days | Health spending accountSenior-level Full TimeToronto R5h ago
-
Finance Staff Data Engineer, AI Native USD 190K-280KAPI rate-limiting | AWS | Airflow | Alerting | Apache Spark401k matching | Employee Assistance Program (EAP) | Equipment and tools reimbursement | Flexible PTO | Free membership Platinum subscriptionSenior-level Full TimeRemote, USA; Remote, Canada R14h ago
-
ML Engineer, II - App Engine FRENCH USD 139K-166KC++ | CUDA | Distributed Systems | Embedded Software | EthernetDental insurance | Flexible schedule | Health insurance | Life insurance | Paid time offMid-level Full TimeMontreal, Canada, Ann Arbor, MI17h ago
-
ML Engineer, II - App Engine USD 153K-183KC++ | CUDA | Distributed Systems | GPU Programming | Linux401k match | Dental insurance | Disability insurance | Health insurance | Life insuranceMid-level Full TimeAnn Arbor, MI, Montreal, Canada17h ago
-
Data Engineer CAD 100K-130KData Modeling | Data Pipelines | Data Transformation | Data Validation | DatabricksExtended mental health coverage | Paid time off | Paid wellness days | Parental leave top-up | Remote workEntry-level Full TimeGreater Toronto Area R18h ago
-
API Integration | CRM Integration | Claude | HubSpot | JavaScriptCareer growth | Continuous learning | Fully remote | Global team collaborationMid-level Full TimeCanada R1d ago
-
Senior Machine Learning Engineer CAD 84K-128KARM Templates | Apache Spark | Azure ARM | Azure ARM templates | Azure DataFlexible benefits | Paid time off | Reimbursement for wellness initiatives | Wellness reimbursementSenior-level Full TimeToronto - Bay St, Canada1d ago
-
Senior Machine Learning Engineer, Agentic AI CAD 128K-176KBigQuery | Cloud Composer | Cloud Storage | Design Patterns | DockerSenior-level Full Time500 Lake Shore Blvd W, Toronto, …1d ago
-
Senior Machine Learning Engineer CAD 128K-176KBigQuery | Cloud Composer | Cloud Storage | Design Patterns | DockerSenior-level Full Time500 Lake Shore Blvd W, Toronto, …1d ago
-
Staff Software Engineer - Applied AI CAD 140K-180KAWS | Amazon Bedrock | CI/CD | Claude | Data Pipelines401k match | Charitable donation match | Commuter benefits | Flexible time off | Paid parental leaveSenior-level Full TimeRemote - Ontario, Canada R1d ago
-
Senior Software Engineer | Azure Data Analytics CAD 114K-203KC# | C++ | Database optimization | Distributed Systems | JavaSenior-level Full TimeVancouver, BC, CA1d ago
-
Senior Software Engineer, AI/LLM USD 202K-227KAPI Development | API Orchestration | Automated testing | Best practices | CI/CDFlexible PTO | Health, dental, vision coverage | Mental health & wellness benefits | Parental leave | Professional development stipendSenior-level Full TimeRemote (US/Canada) R1d ago
-
Data Developer - TG Quality Engineering TGQF CAD 120K-161K.NET | Agile | Algorithms | Amazon Web Services | Apache FlumeSenior-level Full TimeQuebec City, QC, Canada1d ago
-
Senior-level Full TimeMontreal, QC, Canada1d ago
-
Entry-level Full TimeQuebec City, QC, Canada1d ago
-
.NET | AWS | Agile | Apache Flume | Apache IcebergEntry-level Full TimeQuebec City, QC, Canada1d ago
-
C++ | Chat systems | Data Processing | Debugging | Distributed SystemsSenior-level Full TimeWaterloo, ON, Canada1d ago
-
Professional, Engineering - Multidisciplinary Design, Analysis & Optimization (MDAO / MDO) CAD 77K-127KBayesian optimization | CI/CD | Fidelity optimization | GEMSEO | GitDisability insurance | Employee assistance program | Health insurance | Life insurance | Retirement savings planMid-level Full TimeDorval, Québec, CA, H4S 1Y92d ago
-
Apache Hadoop | Apache Hive | Apache Kafka | Apache NiFi | Apache PinotSenior-level Full TimeKitchener2d ago
-
AI Engineer - New Grad CAD 56K-70KAPI | Agent Frameworks | Fine Tuning | Language Models | Large Language ModelsDiscount program | Flexible Medical Health Spending Account | Health benefits | RRSP matching | Tuition reimbursementEntry-level Full TimeGreater Toronto Area, ON, Canada2d ago
-
APIs | Deployment | Design Patterns | Java | Machine LearningMid-level Full TimeToronto, Ontario, CAN2d ago
-
MTS Software Engineer, Data Platform CAD 142K-190KAgile | Algorithms | Apache Spark | Data Structures | Distributed SystemsMedical benefits | Paid time off | Parental leave | RRSP eligibilitySenior-level Full TimeToronto, Canada2d ago
-
Airflow | Automation | Credit Rating Modeling | Credit Risk | Credit ratingHealthcare coverage | Hybrid work environment | Matched donations | Paid volunteer days | Parental leaveMid-level Full TimeToronto, ON, CA2d ago
-
AI and Data - Manager - AI/ML Engineer CAD 88K-160KAPI Design | AWS | Agile | Azure | CI/CDDental coverage | Flexible work | Learning opportunities | Medical coverage | Paid time offSenior-level Full TimeCalgary, AB, CA, T2P 1M42d ago
-
Analytics and Innovation Engineer CAD 88K-109KAgile | Angular | Bitbucket | CI/CD | Data EngineeringMid-level Full TimeCA ON Toronto, Canada2d ago