AI Model Optimization and Tool Development Engineer (NPU)
Warsaw, Poland
42dot is seeking an AI Model Optimization and Tool Development Engineer (NPU) to focus on optimizing the autonomous driving stack and on-device large language models (LLMs). This role involves developing AI model optimization techniques for NPUs and building toolchains to ensure efficient execution. The engineer will be responsible for optimizing deep learning models for hardware accelerators, designing and developing toolchains that enhance performance, and supporting the advancement of AI technologies such as autonomous driving and LLMs through hardware-aware optimizations. This position plays a crucial role in bridging AI models with hardware accelerators, ensuring seamless integration and optimal efficiency.
Responsibilities
- AI Model Porting and Optimization
- Port AI models for LLM and autonomous driving stacks to NPU hardware and optimize their performance. Improve inference speed by utilizing techniques such as model compression (quantization, pruning, etc.), operator fusion, and memory optimization.
- Toolchain Development
- Design and implement toolchains for porting AI models to NPUs. Integrate with deep learning frameworks such as TensorFlow and PyTorch to provide an efficient workflow. Develop tools for NPU-specific code generation, profiling, and debugging.
- Optimization of Autonomous Driving and LLM Stacks
- Optimize AI modules required for autonomous driving (e.g., object detection, path planning) to ensure compatibility and real-time execution performance. Enhance memory efficiency and speed through LLM inference optimization. Apply model parallelization and distributed execution techniques in multimodal AI stacks.
- Performance Analysis and Improvement
- Analyze AI model runtime performance and identify bottlenecks. Implement techniques to maximize hardware utilization.
- Research and Adoption of New Technologies
- Study the latest advancements in AI model optimization and NPU-related technologies. Experiment with and adopt new techniques to maximize NPU performance.
Qualifications
- Bachelor’s or Master’s degree in Computer Science, AI, or a related field
- At least 3 years of experience in AI model optimization and hardware acceleration
- Experience optimizing AI models using NPUs, GPUs, or ASICs
- Proficiency in deep learning frameworks and model conversion tools such as TensorFlow Lite, ONNX, and PyTorch
- Expertise in model compression and optimization techniques, including quantization, pruning, and lazy evaluation
- Proficiency in programming languages such as CUDA, C++, and Python, with experience in writing hardware-accelerated code
- Strong understanding of memory management and parallel computing techniques
Preferred Qualifications
- Experience with autonomous driving stacks, including SLAM, path planning, and object recognition
- Optimization experience for on-device AI/LLM applications
- Familiarity with compiler technologies such as LLVM and MLIR
- Experience in AI optimization for embedded systems
- Contributions to open-source AI optimization projects
Interview Process
- Application Review → Coding Test → First Interview (~1 hour) → Second Interview (~3 hours) → Final Selection
- The interview process may vary depending on the position and is subject to change based on the schedule and circumstances.
- Applicants will be individually notified of the interview schedule and results via the email provided in their application.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Job stats:
0
0
0
Categories:
Deep Learning Jobs
Engineering Jobs
Tags: Autonomous Driving Computer Science CUDA Deep Learning LLMs ONNX Open Source Python PyTorch Research SLAM TensorFlow
Region:
Europe
Country:
Poland
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.
Data Engineer II jobsSr. Data Engineer jobsStaff Data Scientist jobsPrincipal Data Engineer jobsBI Developer jobsStaff Machine Learning Engineer jobsSenior AI Engineer jobsData Manager jobsData Science Intern jobsPrincipal Software Engineer jobsBusiness Data Analyst jobsJunior Data Analyst jobsData Specialist jobsData Science Manager jobsResearch Scientist jobsSoftware Engineer II jobsLead Data Analyst jobsData Analyst Intern jobsSr. Data Scientist jobsDevOps Engineer jobsData Engineer III jobsJunior Data Engineer jobsAI/ML Engineer jobsBI Analyst jobsSoftware Engineer, Machine Learning jobs
Snowflake jobsEconomics jobsLinux jobsOpen Source jobsKafka jobsNoSQL jobsHadoop jobsData Warehousing jobsAirflow jobsRDBMS jobsBanking jobsPhysics jobsComputer Vision jobsScala jobsMLOps jobsJavaScript jobsKPIs jobsGoogle Cloud jobsClassification jobsData warehouse jobsPostgreSQL jobsScikit-learn jobsGitHub jobsOracle jobsStreaming jobs
Terraform jobsLooker jobsSAS jobsR&D jobsPySpark jobsScrum jobsPandas jobsCX jobsDistributed Systems jobsBigQuery jobsData Mining jobsJira jobsdbt jobsRobotics jobsIndustrial jobsMicroservices jobsReact jobsJenkins jobsRedshift jobsUnstructured data jobsMySQL jobsNumPy jobsRAG jobsData strategy jobsELT jobs