Distinguished Engineer - AI Computing System
Tasks
- Analyze training bottlenecks and propose optimizations
- Build AI training frameworks and operator libraries
- Collaborate on software hardware co optimization
- Collaborate with researchers on standards and patents
- Design cluster scheduling software architecture
- Develop acceleration libraries
- Develop training cluster technical roadmap
- Lead low precision training optimization
- Lead parallel strategy tuning
- Optimize training resource allocation
- Plan AI training framework architecture
Perks/Benefits
Skills/Tech-stack
Artificial Intelligence | Cloud Computing | Cluster computing | Cluster scheduling | Co-design | Distributed Systems | GPU Programming | Hardware co-design | Inference Optimization | Language Models | Large Language Models | Low-precision training | Machine Learning | Mixture of Experts | Model Inference | Model Inference Optimization | Multimodal Learning | NPU programming | Parallel Computing | Software-Hardware Co-Design | Software-hardware | Systems engineering | Training Optimization
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Roles
Related jobs
-
Sr. Embedded & Compute Software Developer USD 130K-160KC# | C++ | CUDA | DO-178 | Debugging401k matching | Dental insurance | Employee assistance program | Health insurance | Paid HolidaysSenior-level Full TimeRemote (United States); Canada R1d ago
-
Process Automation Engineer CAD 80K-113KAgile | Automation | Cloud platform | Dashboards | Data PipelinesAssociate discount | Health and dental benefits | Hybrid work | Learning and development programs | Performance bonusesMid-level Full TimeRichmond Hill, ON, Canada1d ago
-
Staff, Machine Learning Engineer CAD 130K-170KCI/CD | Evaluation | Grounding | Langchain | LanggraphCompany-provided benefits | Flexible PTO | Fullscript discounts | HSA | Hybrid/Remote flexibilitySenior-level Full TimeOttawa, ON1d ago
-
Gen AI Developer CAD 125K-145KAPIs | Agentic Frameworks | Anthropic Claude | FastAPI | Google GeminiSenior-level Full TimeCA-ON-Mississauga1d ago
-
AI acceleration | AI accelerators | Artificial Intelligence | C# | C++Senior-level Full TimeMarkham, Ontario, Canada2d ago
-
AI/ML Systems Design Engineer CAD 108K-159KArtificial Intelligence | Communication | Data Science | Documentation | Machine LearningMid-level Full TimeMARKHAM, ON, Canada2d ago
-
AI Engineer – AI Observability and Quality CAD 100K-143KA/B | A/B Testing | B testing | DeepEval | Evaluation FrameworksCompany paid cell phone plan | Dental insurance | Employee assistance program | Flexible work hours | Health insuranceSenior-level Full TimeToronto, ON, Canada2d ago
-
Agile | Apache Spark | Data Engineering | Design Patterns | JavaMid-level Full TimeToronto, Ontario, CAN2d ago
-
Databricks/Fabrics Expert CAD 160K-180KBest practices | CI/CD | Databricks | Deployment Pipelines | GenAIHybrid workSenior-level Full TimeToronto, ON, CA, M5J 2P12d ago
-
Databricks/Fabrics Expert CAD 160K-180KCI/CD | Databricks | GenAI | Machine Learning | Microsoft FabricHybrid work environmentSenior-level Full TimeToronto, ON, CA, M5J 2P12d ago
-
Mid-level Full TimeCAN - Richmond, Canada2d ago
-
Data Scientist Intern CAD 52K-52KBigQuery | Clustering | Data Modeling | Data Warehousing | Machine LearningCasual dress code | Dog-friendly office | Monthly team events | Remote work permitted | SkyTrain nearbyEntry-level InternshipVancouver, British Columbia, Canada2d ago
-
AI Engineer CAD 92KComputer Vision | Ignition | Industrial robots | IoT Sensors | Lean ManufacturingMid-level Full Time520 AND 550 NEWPARK BOULEVARD,L3Y 4X6,NEWMARKET,CA, …2d ago
-
Mid-level Full TimeCAN - Richmond, Canada2d ago
-
3D Geometry | AWS Step Functions | Apache Airflow | Apache Beam | Apache SparkDental insurance | Flexible hours | Health insurance | Unlimited vacation | Vision insuranceSenior-level Full TimeToronto, ON2d ago
-
Senior Data Engineer, PickupXP CAD 136K-170KAirflow | Apache Spark | Bash | Data Modeling | Data PipelinesHybrid work scheduleSenior-level Full TimeToronto, Canada2d ago
-
AI Developer – Agentic Systems, RAG & AI Infrastructure CAD 116K-148KAgentic AI | CI/CD | GitHub | GitHub Actions | Inference OptimizationHybrid workMid-level Full TimeOttawa, Ontario, Canada2d ago
-
Agentic AI Developer (Client Facing) CAD 115K-150KAI Evaluation | AI Prototyping | AWS | Agent Frameworks | AzureMid-level Full TimeRemote, Canada R2d ago
-
Architect, Machine Learning CAD 125K-170KAgentic Systems | Artificial Intelligence | Explainability | Generative AI | Language ModelsCareer development | Flexible vacation | Flexible work options | Hackathons | Mentorship programsSenior-level Full TimeRemote, Canada R3d ago
-
Principal Software Development Engineer, MLOps CAD 168K-252KAWS | Access Control | Amazon SageMaker | CI/CD | Cloud EngineeringSenior-level Full TimeCanada, ON, Toronto3d ago
-
AWS | Azure | CI/CD | Cloud Platforms | Cross-validationHybrid work model | Ongoing training and career development supportEntry-level Apprenticeship Full TimeMontreal, QC3d ago
-
Senior Software Developer, AI Solutions CAD 108K-158KAPI Management | Active Directory | Agent Orchestration | Amazon Q | AutomationEmployee share purchase program | Paid time off | Retirement planSenior-level Full TimeWaterloo, Ontario, Canada3d ago
-
Data Engineer CAD 95K-145K.NET | Agile | Apache Airflow | Apache NiFi | ArgoDental insurance | Employee assistance program | Gym discounts | Health insurance | Paid HolidaysSenior-level Full TimeOttawa 1 Chrysalis, Canada3d ago
-
Senior AI/ML Research Engineer CAD 120K-179KAgent systems | Agentic AI | Artificial Intelligence | Data Science | Generative AIConference presentation opportunities | Hybrid work model | Mentorship opportunitiesSenior-level Full TimeToronto, CAN, Canada3d ago
-
AI/ML Engineer CAD 128K-192KAWS | CI/CD | Cloud platform | Computer Vision | Data PreprocessingHybrid work modelMid-level Full TimeToronto, CAN, Canada3d ago