Sr Principal Software Engineer, Quantization (AI2324)
San Jose, California, United States
Applications have closed
SiMa.ai
Introducing the first Machine Learning SoC (MLSoC™) platform, purpose-built to let you effortlessly scale and deploy ML at the embedded edge. Effortless ML, artificial intelligence, MLSoC, Palette software, Edgematic.
Job Title: Sr Principal Software Engineer, Quantization Job Location: San Jose, CA
Job Number: AI2324 Job Description: SiMa.ai is seeking an outstanding researcher working on efficient deep learning to join the MLSoC Platform Architecture team. We are passionate about pushing the boundaries of Edge AI with power efficient inferencing. We are particularly interested in Post Training Quantization and Pruning techniques applied to quantization of CNN and Transformer based Neural Networks for inference primarily on int8 Machine Learning Accelerator (MLA) and on mixed precision MLA. You will work with an amazing team of engineers that pushes the boundaries and your contributions will have a chance to create a real impact in our products. Sr. Principal Engineer Key Responsibilities (including but not limited to):
Job Number: AI2324 Job Description: SiMa.ai is seeking an outstanding researcher working on efficient deep learning to join the MLSoC Platform Architecture team. We are passionate about pushing the boundaries of Edge AI with power efficient inferencing. We are particularly interested in Post Training Quantization and Pruning techniques applied to quantization of CNN and Transformer based Neural Networks for inference primarily on int8 Machine Learning Accelerator (MLA) and on mixed precision MLA. You will work with an amazing team of engineers that pushes the boundaries and your contributions will have a chance to create a real impact in our products. Sr. Principal Engineer Key Responsibilities (including but not limited to):
- Research, design and implement novel methods to improve PTQ techniques for both int8 and mixed-precision (int8 + bf16) quantization.
- Collaborate with other team members to understand the limitations of our Machine Learning Accelerator and adapt your strategy based on their input.
- Prototype PTQ techniques using Fake Quantization in PyTorch, as well as modify internal tools to implement quantized operators to verify accuracy.
- Understand state-of-the-art research in PTQ and apply it to CNN and Transformer based Neural Networks.
- Help define timeline and deliverables and be accountable for them.
- PhD in electrical engineering or computer science with 6+ years research numerical methods and tools in efficient Neural Network inferencing.
- Proficient in techniques like HAWQ2, and RL based methods for Mixed-precision quantization.
- Proficient in state-of-the-art PTQ techniques like Optimum Brain Compression for LLMs.
- Proficient with PyTorch or other Quantization exploration frameworks like Model Compression Toolkit.
- Excellent programming skills in C++, Python.
- Co-authored internal technical presentations, research papers and disclosures/patents on key technical topics
- Noteworthy technical contributions, which were multi-disciplinary and in collaboration with other cross-functional teams.
Job stats:
9
0
0
Categories:
Deep Learning Jobs
Engineering Jobs
Tags: Architecture Classification Computer Science Deep Learning Engineering LLMs Machine Learning PhD Python PyTorch Research
Region:
North America
Country:
United States
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
BI Developer jobsData Engineer II jobsSr. Data Engineer jobsPrincipal Data Engineer jobsStaff Data Scientist jobsPrincipal Software Engineer jobsStaff Machine Learning Engineer jobsData Science Manager jobsBusiness Intelligence Analyst jobsData Manager jobsData Science Intern jobsSoftware Engineer II jobsDevOps Engineer jobsJunior Data Analyst jobsData Analyst Intern jobsData Specialist jobsSr. Data Scientist jobsBusiness Data Analyst jobsStaff Software Engineer jobsLead Data Analyst jobsAI/ML Engineer jobsSenior Backend Engineer jobsData Governance Analyst jobsData Engineer III jobsResearch Scientist jobs
NLP jobsAirflow jobsOpen Source jobsMLOps jobsTerraform jobsKPIs jobsEconomics jobsLinux jobsKafka jobsNoSQL jobsJavaScript jobsData Warehousing jobsComputer Vision jobsGoogle Cloud jobsPostgreSQL jobsRDBMS jobsGitHub jobsScikit-learn jobsStreaming jobsPhysics jobsBanking jobsR&D jobsData warehouse jobsScala jobsHadoop jobs
dbt jobsPandas jobsBigQuery jobsClassification jobsReact jobsOracle jobsLooker jobsScrum jobsDistributed Systems jobsRAG jobsCX jobsPySpark jobsPrompt engineering jobsMicroservices jobsELT jobsRedshift jobsIndustrial jobsJira jobsRobotics jobsGPT jobsTypeScript jobsOpenAI jobsSAS jobsLangChain jobsModel training jobs