AI Software Solutions Engineer (AI Frameworks, Workloads)

SRR4 - SRR4 - Sarjapur 4

Intel

Intels Innovation in den Bereichen Cloud-Computing, Rechenzentren, Internet der Dinge und PC-Lösungen macht die intelligente und vernetzte digitale Welt von heute möglich.

View all jobs at Intel

Apply now Apply later

Job Details:

Job Description: 

We are looking for a senior contributor to design, develop and optimize AI frameworks for Inference. In this role, you will work with a cross-geo teams to enhance the inference stack to ensure competitive performance on deep learning inference models with a specific focus on the PyTorch framework.

The roles and responsibilities that you would need to performance may include the following:

  • Design and develop SW techniques for AI frameworks - both HW-agnostic and HW-aware
  • Contribute to enhancing and extending the Inference  and Training capabilities in our Software stack
  • Profile deep learning inference workloads as needed and identify optimization opportunities

Qualifications:

  • BTech, MS or PhD in CS or related fields with an overall experience of 5+years
  • Atleast 2 or 3 years of experience working on Inference frameworks/tools for inference for deep learning models and that have been deployed/used by customers
  • Architecture/Design contributions to Inference systems
  • Detailed understanding of machine learning systems optimization and deployment techniques such as quantization
  • Experience with optimization techniques for deployment of Large Language Models (LLMs)
  • Deep implementation knowledge of transformers and inference specific optimizations
  • Programming skills in Advanced C++, Python and parallel programming skills
  • Ability to debug complex issues in multi-layered SW systems
  • Understanding of SW integration across open source frameworks and internal framework layers
  • Strong understanding of computer architecture
  • Effective communication skills and experience with working in a cross-geo setup

Preferred

  • Experience working on and contributing to Inference serving solutions
  • Knowledge of compiler algorithms for heterogeneous systems
  • Knowledge of open source compiler infrastructure like LLVM or gcc
  • Understanding of low-level kernels

          

Job Type:

Experienced Hire

Shift:

Shift 1 (India)

Primary Location: 

India, Bangalore

Additional Locations:

Business group:

The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.

Posting Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Position of Trust

N/A

Work Model for this Role

This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change.
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  1  1  0

Tags: Architecture Deep Learning LLMs Machine Learning Open Source PhD Python PyTorch Transformers

Region: Asia/Pacific
Country: India

More jobs like this

Explore more career opportunities

Find even more open roles below ordered by popularity of job title or skills/products/technologies used.