Senior Engineer - Large Model and Training System Performance Optimization
Tasks
- Accelerate training algorithms for AI models
- Assist project planning and technology roadmaps
- Collaborate with global research teams
- Develop proof of concept for training optimization
- File patents for critical algorithms
- Generate research reports and proposals
- Implement mixed precision and model compression
- Optimize training using optimizers and loss functions
- Publish AI research papers
- Track AI technology trends
Perks/Benefits
- N/A
Skills/Tech-stack
AI acceleration | AI accelerators | Artificial Intelligence | C# | C++ | DLProf | Deep learning | Deep reinforcement learning | DeepSpeed | Distributed Training | Full Stack | Full Stack AI Acceleration | Full-stack AI | GPU Architecture | GPU Optimization | Megatron | Mixed Precision | Model Compression | NPU optimization | Nsight Compute | Nsight Systems | Performance Analysis | Performance optimization | PyTorch | Python | Ray | Reinforcement Learning | System Performance | System Performance Optimization | VeRL
Education
Related jobs
-
Machine Learning Engineer II (Fraud) CAD 125K-175KAirflow | Apache Spark | Backtesting | CatBoost | DaskDental and vision coverage | Employee stock purchase plan | Health coverage | Paid time off | Remote workMid-level Full TimeRemote Canada R9h ago
-
Computer Vision | Data Analysis | Deep learning | Ecommerce | Fine TuningFlexible schedule | Hybrid work | Remote work up to 4 weeks per yearSenior-level Full TimeToronto, ON R10h ago
-
AWS | Anomaly Detection | C# | C++ | CsharpDental insurance | Disability insurance | Flexible work schedule | Health insurance | Life insuranceMid-level Full TimeQuébec, Qc12h ago
-
Applied Scientist CAD 145K-172KGit | Information Retrieval | Knowledge Base | Knowledge Base Retrieval | Language ModelsMid-level Full TimeKitchener, Canada14h ago
-
Data Analytics Engineer CAD 115K-149KAmazon Web Services | Azure Data | Azure Data Lake | Cloud platform | ComplianceDEI initiatives | Dental insurance | Medical insurance | Mental health support | Retirement plansSenior-level Full TimeRemote, Canada R16h ago
-
Staff Software Developer, AI/ML, Safety and Security USD 207K-300KComputer Vision | Data Processing | Debugging | Deep learning | Fine TuningSenior-level Full TimeWaterloo, ON, Canada; New York, NY, …21h ago
-
Agile | Alerting | Apache Airflow | Automation | BigQueryCareer growth and development | Community involvement opportunities | Health and wellbeing resources | Hybrid work with at least 3 days in office | Paid sick daysSenior-level Full TimeToronto, ON, M2N 5M9, CA23h ago
-
Member of Technical Staff (Applied AI Engineer) CAD 140K-260KAPI Design | Browser Automation | CSS | GraphQL | HTMLDental insurance | Gym membership subsidy | Health insurance | Offsites | Premium laptopSenior-level Full TimeToronto, Ontario, Canada1d ago
-
Customer Analytics | Data Analysis | Data Ingestion | Data cleaning | Exploratory Data AnalysisPart-time flexible hours | Project based workMid-level FreelanceCanada - Remote R1d ago
-
Freelance Machine Learning Engineer CAD 110KLangchain | MLOps | NumPy | Pandas | Prompt engineeringEnglish documentation requirement | Flexible hours | Project based workMid-level FreelanceCanada - Remote R1d ago
-
Langchain | Language Models | Large Language Models | MLOps | NumPyFlexible workload | Part-time schedule | Project based workMid-level FreelanceCanada - Remote R1d ago
-
Data and Analytics - Data Analysis, Senior Associate CAD 84K-134KAWS | Azure | BERT | CI/CD | ContainerizationFlexible work arrangements | Hybrid work environment | Inclusive benefits | Wellbeing support programsSenior-level Full TimeToronto - 18 York Street, Canada1d ago
-
Applied AI ML Engineer - Ottawa, ON CAD 105K-130KAWS | Apache Spark | Azure | CUDA | GCPEmployee resource groups | Health and dental coverage | Life insurance | Long-term disability | Mental health wellness programMid-level Full TimeOttawa, Canada1d ago
-
Applied AI ML Engineer - Ottawa, ON CAD 76K-105KAWS | Agent systems | Apache Spark | Azure | CUDAAnnual holidays | Critical illness insurance | Employee resource groups | Health and dental coverage | Life insuranceMid-level Full TimeOttawa, Canada1d ago
-
Senior Machine Learning Engineer, vLLM CAD 130K-200KComputer Vision | Deep learning | Graph theory | Inference Optimization | LLM InferenceOpen source collaboration culture | Remote work flexibilitySenior-level Full TimeRemote CA ON, Canada R1d ago
-
Senior Machine Learning Engineer CAD 128K-192KAWS | Alerting | Apache Spark | CI/CD | Code QualityGaming license support | Regulated gaming employment complianceSenior-level Full TimeRemote - Canada R1d ago
-
Staff Analytics Engineer CAD 147K-191KAccess Control | Apache Airflow | Apache Kafka | Apache Spark | BigQuerySenior-level Full TimeCanada - Toronto R1d ago
-
Senior Software Engineer, Computation CAD 252K-375KAPIs | Applied Mathematics | Backend Engineering | Batch Processing | C++Connectivity stipend | Coworking stipend | Energize Fridays | Flexible PTO | Learning and developmentSenior-level Full TimeCanada1d ago
-
Data Engineer CAD 88K-132KAirflow | Analytics engineering | BI Embedding | BigQuery | Code ReviewsFertility coverage | Flexible paid time off | Flexible work options | Health benefits | Home office setupMid-level Full TimeDistributed - Canada R1d ago
-
Data Engineer Co-Op - VL CAD 50K-55KAPIs | Data Documentation | Data Quality | Data Transformation | Data ValidationDental insurance | EAP | Medical insurance | Paid time off | Vision insuranceEntry-level Full TimeToronto, ON, M1S 5R3, CAN1d ago
-
AI Agents | AI Safety | Anthropic | Azure OpenAI | Cloud NativeAgile environment | Hybrid/Remote schedule | Mentorship opportunities | Potential permanent employmentSenior-level ContractToronto, Canada1d ago
-
Senior Platform Engineer, Machine Learning CAD 143K-200KAWS | Apache Beam | Apache Flink | Apache Spark | Automated DeploymentBonus eligible | Financial benefits | Medical benefitsSenior-level Full TimeMovable Ink - Toronto1d ago
-
A/B | A/B Testing | AWS | Azure | B testingCompany-provided laptop | Fully remote work | Home office stipend | Learning and development budget | Paid Maternity LeaveSenior-level Full TimeCanada R2d ago
-
Power Methodology Engineer, Data Center Hardware IPs CAD 108K-199KASIC design | Digital design | Dynamic Power | Graphics workload | Hardware descriptionSenior-level Full TimeMARKHAM, ON, Canada2d ago
-
Applied AI Developer CAD 90K-110KAPI Development | Agent systems | Artificial Intelligence | Cloud Computing | Cost ManagementCompany events | Fitness center access | Flexible work hours | Free onsite parking | Gym reimbursementMid-level Full TimeSurrey, BC2d ago