AI SW Performance Engineer

POL - Gdansk, Poland

Intel

Stellen Sie KI im vollen Umfang bereit – mit umfassenden Hardware- und Software-Lösungen in der Cloud, in Rechenzentren, am Edge und iClient.

View all jobs at Intel

Apply now Apply later

Job Details:

Job Description: 

Come join us in the DCAI SW Solutions and Ecosystem Enabling Group. We're in search of a highly qualified candidate to join our EMEA team in close partnership with Habana team within our larger DCAI Software Group as an AI Software Performance Engineer.

In this role, you will:

  • Work with internal engineering teams, internal partners, and strategic customers to deliver optimized AI solutions using Habana Gaudi products with the goal of driving adoption of Gaudi product line.
  • Analyze and characterize customers' AI workloads and deliver to the customer a Best Known Configuration (BKC) for optimal performance on Intel/Habana.
  • Research technical trends and utilizes AI expertise to prototype solutions and develop AI software including open source libraries and models.
  • Optimize performance of AI models through deep knowledge and expertise of AI frameworks, algorithms, models, and related hardware.
  • Research, develop, and modify new or existing AI models, code, parameters, and/or quantization to address issues and modify operations to enhance customer performance.
  • Partner with AI algorithm and framework engineers as needed to optimize end-to-end AI models to Intel hardware features.
  • Serve as a trusted technical advisor and provide technical enabling across our portfolio of Strategic Customers.
  • Partner with Intel software and hardware product development teams to accelerate and optimize future products in AI domains by leading application pre-enabling and product hardening.
  • Deliver competitive and differentiated benchmark collateral and identify then drive key workloads into product requirements.
  • Architect innovative projection methodologies and tools.

Qualifications:

The ideal candidate will have the following attributes in addition to the qualifications listed below:

  • Excellent written and verbal communication skills, both technical and nontechnical
  • Major team player with demonstrated experience technically influencing others.
  • Strong problem-solving skills.
  • Passion to exceed expectations and constantly push the envelope.


Minimum Required Qualifications:

  • 2+ years of experience analyzing and optimizing software using Python to improve performance with an ability to identify and resolve performance bottlenecks.
  • 1+ years of experience running deep learning models with PyTorch (or similar deep learning frameworks) and building solutions.
  • 1+ years of experience in using transformers and large language models and understanding architecture of LLM.
  • 1+ years of experience as a user with source control systems Git and familiarized with CI/CD methodologies.


Preferred Qualifications:
Expertise in two or more of the following:

  • Experience with Gaudi products or AI on GPU
  • Experience developing, optimizing performance and integrating LLM (Llama, Mixtral ...)
  • Experience with model servings vLLM or TGI


Having the following qualifications is a plus:

  • Experience with writing low level Deep Learning kernel.
  • Experience with runtime environments with virtualization Linux containers schedulers resource managers.
  • Understanding of effective use of the hardware's memory.
  • Familiar with Habana profiler, Torch profiler, Vtune or similar profiling tools


Education Requirement:

  • Bachelor's degree in Electrical or Computer Engineering, Computer Science, Math, Physics, or related field plus 5 years of industry work experience, OR
  • Master's degree in Electrical or Computer Engineering, Computer Science, Math, Physics, or related field plus 3 years of industry work experience, OR
  • PhD in Electrical or Computer Engineering, Computer Science, Math, Physics, or related field plus 1 years of industry work experience.

What we offer:
At Intel, we offer a collaborative, supportive environment, where your equally brilliant colleagues will push you to be your best. There's no fear of failure - we know that's how innovation happens, and you'll never be bored.
We offer competitive benefits and pay, opportunities for professional development and the flexibility you need to achieve balance. Intel fosters a collaborative environment allowing the brightest minds in the world to come together to achieve exceptional results.

Competitive pay and Benefits:
Including stock programs, Quarterly Bonuses, Employee Pension Plan, Medical plan and life insurance, Peer to peer recognition, Lunch card, Multisport Card/Holiday card, Groups of enthusiasts, Exclusive employee discounts, (online) events and many more.

Opportunities for professional development and growth:
You will work in an international environment within a group of the best professionals in the world, working with the newest technologies. You'll have a chance to take part in advanced development programs, conferences and have free access to a wide library of classroom and online courses, covering both soft and technical skills.

Life and Community:
We offer opportunities for employees to refresh and recharge- flexible working time, benefits and services that support your wellbeing, and the chance to participate in Intel's Great Place to Work program which gathers people who love running, cycling, squash, tennis, cross fit, photography, and many more.

We guarantee you will be working in a safe environment, in an organization which profoundly understands the current health situation worldwide. At both our offices and in your home, the security and wellbeing of you and your family is our utmost responsibility.

Materials important for you - to learn more about Intel.
Learn more about Intel in Poland: https://intel.ly/3eq8QlY

          

Job Type:

Experienced Hire

Shift:

Shift 1 (Poland)

Primary Location: 

Poland, Gdansk

Additional Locations:

Business group:

The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.

Posting Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Position of Trust

N/A

Work Model for this Role

This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change.
Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  0  0  0

Tags: Architecture CI/CD Computer Science Deep Learning Engineering Git GPU Linux LLaMA LLMs Mathematics Open Source PhD Physics Python PyTorch Research Security Transformers vLLM

Perks/benefits: Career development Competitive pay Conferences Equity / stock options Flex hours Flex vacation Health care Insurance Startup environment Team events

Region: Europe
Country: Poland

More jobs like this