Senior Software Engineer, AI Inference
Tasks
- Analyze profiling results and plan optimization improvements
- Build benchmarking harnesses and automation pipelines
- Collaborate with kernel engineering and OSS teams to drive improvements
- Deploy and tune vLLM serving on GPU clusters
- Design benchmarking campaigns for LLM serving
- Document architectures and recommendations
- Profile GPU performance with Nsight Systems and Nsight Compute
Perks/Benefits
- N/A
Skills/Tech-stack
C++ | Chunked prefill | Continuous batching | Cutlass | Docker | GPU Performance | GPU performance analysis | KV cache | Kubernetes | LLM serving | Linux | Memory hierarchy | Nsight Compute | Nsight Systems | Nvidia Dynamo | Performance Analysis | Pipeline parallelism | Profiling | Python | Roofline modeling | Slurm | Tensor Parallelism | TorchInductor | Triton | VLLM
Education
Related jobs
-
Senior-level Full TimeCanada8h ago
-
Forward Deployed Developer IV, GenAI, Google Cloud CAD 204K-204KAPIs | Agent systems | Agentic Workflows | CrewAI | Data integrationSenior-level Full TimeToronto, ON, Canada12h ago
-
Agile | C++ | Cloud infrastructure | Data Processing | DebuggingSenior-level Full TimeWaterloo, ON, Canada12h ago
-
AI Scientist, Risk Modelling (Questbank) CAD 90K-120KAWS | Agile | Artificial Intelligence | Azure | CI/CDCareer growth and development | Community volunteering opportunities | Health and wellbeing resources | Hybrid work schedule | Paid sick daysMid-level Full TimeToronto, ON, M2N 5M9, CA21h ago
-
Airflow | Apache Kafka | Apache Spark | Azure | Azure EventCompetitive benefits | Employee resource groups | Flexible work schedule | Hybrid work environment | Mentorship opportunitiesSenior-level Full TimeMontreal - MRC (Papineau) (36.25), Canada23h ago
-
Applied AI Backend Software Developer CAD 76K-105KAPI | AWS | Azure | CI/CD | DockerCritical illness insurance | Employee resource groups | Health and dental coverage | Life insurance | Long-term disabilityNone Full TimeOttawa, Canada23h ago
-
Applied AI Full Stack Engineer MacOS CAD 105K-130KAgile | C++ | CI/CD | Design Patterns | DevOps14 Annual Holidays | Critical illness insurance | Employee resource groups | Health and dental coverage | Life insuranceMid-level Full TimeOttawa, Canada23h ago
-
Artificial Intelligence | Automation | C Sharp | C plus plus | ChatGPTCareer development program | Employee share purchase plan | Generous vacation policy | Hybrid work environment | Maternity/parental top-upEntry-level InternshipToronto1d ago
-
Senior Software Engineer, Analytics USD 135K-169KAWS | Debugging | GraphQL | Monitoring | Node401k retirement plan | Commuter and parking accounts | Dental insurance | Disability insurance | Emergency weather supportSenior-level Full TimeRemote - Canada R1d ago
-
Data Engineer CAD 108K-135KAWS | Ansible | Apache Airflow | Apache Spark | ChefChild care benefits | Disability benefits | Family building benefits | Flexible paid time off | Health and dental coverageMid-level Full TimeToronto, Canada R1d ago
-
Data Engineer CAD 108K-135KAWS | Airflow | Data Modeling | Data Quality | Data WarehousingChild care and pet benefits | Family building benefits | Flexible paid time off | Health and dental coverage | Health care savings accountMid-level Full TimeToronto, Canada R1d ago
-
Mid-level Full TimeToronto, Calgary1d ago
-
Staff AI Engineer | Canada | Remote CAD 186K-223KAPI Design | Agent systems | BigQuery | Cloud Functions | Cloud RunAnnual leave policy | In-person onboarding | Remote work | Restricted stock units (RSUs)Senior-level Full TimeCanada (Remote) R1d ago
-
Senior Software Developer, Data Platform Infrastructure CAD 120K-158KAlerting | Apache Spark | CDC | CI/CD | DatabricksSenior-level Full TimeMontreal (Province of Quebec, Canada)1d ago
-
Senior Data Engineer - (Python/Spark/SQL) CAD 130K-144KApache Airflow | Apache Hive | Apache Spark | Apache Zeppelin | Cloud ComputingHybrid work environment | Remote work days per weekSenior-level Full TimeToronto, ON, Canada1d ago
-
Data Integration Engineer CAD 62K-113KApache Spark | Azure | Azure Data | Azure Data Factory | Azure Data LakeHybrid work | Pension matching | Performance bonus | Profit sharing | Vacation benefitsMid-level Full TimeToronto, ON, CA, M5H1H11d ago
-
Senior Data Science Engineer CAD 120K-150KDistributed Computing | Machine Learning | Python | R | SQLSenior-level Full TimeRemote - Canada R1d ago
-
Availability | Cause analysis | Condition-Based Maintenance | Cybersecurity | DNP3Senior-level Full TimeMarkham, Canada R1d ago
-
Senior Machine Learning Engineer CAD 142K-200KBitbucket | CI/CD | Data Drift | Databricks | EmbeddingsHybrid work modelSenior-level Full TimeLOC0001549, Canada1d ago
-
Senior AI Engineer CAD 126K-164KAI orchestration | API Development | Agent Based Workflows | Agent-based | Azure OpenAICareer development | Mentoring programs | Online learning platform | Skill development | Training and onboardingSenior-level Full TimeTD Centre - TD Tower - …1d ago
-
Senior Machine Learning Engineer, vLLM CAD 131K-200KComputer Vision | Deep learning | Graph theory | Language Processing | Linear AlgebraOpen source collaboration | Remote work optionsSenior-level Full TimeRemote CA ON, Canada R1d ago
-
Engineering or Computer Science Intern CAD 67K-85KAutomated testing | DevOps | File automation | JavaScript | License File AutomationEntry-level Full Time InternshipCAN Kanata (2), ON - WR, …1d ago
-
Azure Data Platform Support Engineer CAD 75K-104KAzure Data | Azure Data Factory | Azure Data Lake | Azure Data Lake Storage | Azure WorkbooksBanking benefits | Employee recognition program | Employee share purchase plan | Paid time off | Pension planSenior-level Full TimeCIBC Square Banking Centre, Canada1d ago
-
Data Engineer CAD 90K-130KAWS Glue | Azure Purview | CI/CD | Data Governance | Data ModelingFlexible dress code | Hybrid work modelMid-level Full Time5th Avenue Place, Canada1d ago
-
AI Engineer CAD 146K-191KBigQuery | Cache Augmented Generation | Cloud Platforms | Function Calling | GitOpsCareer growth | Healthcare benefitsSenior-level Full TimeBrampton, Ontario, Canada1d ago