LLM Model Testing & Validation Engineer

Warszawa, Masovian Voivodeship, Poland

Full Time Senior-level / Expert EUR 95K - 176K * ^est.

Tenstorrent

Tenstorrent is a next-generation computing company that builds computers for AI. Headquartered in the U.S. with offices in Austin, Texas, and Silicon Valley, and global offices in Toronto, Belgrade, Seoul, Tokyo, and Bangalore, Tenstorrent...

View all jobs at Tenstorrent

Apply now Apply later

Posted 3 days ago

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

We are looking for an experienced engineer to drive AI workload productization and benchmarking for Large Language Models (LLMs). This role focuses on making models customer-ready, developing benchmarking infrastructure, and ensuring our AI models deliver industry-leading efficiency and scalability.

This role is on-site, based out of Warsaw, Poland.

We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

Responsibilities:

Design and execute comprehensive model testing protocols to ensure robustness and scalability of AI models.
Develop and execute performance and accuracy benchmarking tests for AI workloads across various computational environments.
Analyze and optimize system performance using advanced profiling and tuning techniques.
Conduct competitive analysis and positioning to inform strategic decision-making and product development.
Collaborate with cross-functional teams to integrate best practices and innovations in AI performance optimization.
Integrate LLMs with popular inference server platforms (e.g., vLLM), perform testing and benchmarking using these platforms, and stay up to date with the latest inference server trends to influence strategic decision-making.
Track AI model accuracy and performance in a CI/CD environment. Identify and triage regressions, and implement or drive fixes with other teams to maintain the accuracy and performance of the models.

Experience & Qualifications:

Bachelor's, Master’s, or PhD in Computer Science, Electrical Engineering, Machine Learning, or a related field.
Strong background in AI model benchmarking and profiling.
Experience with scalable AI infrastructure, including distributed computing environments.
Proficiency in Python for AI workload optimization.
Familiarity with LLM frameworks, AI accelerators, and performance tuning methodologies.
Familiarity with Github CI/CD environments is a requirement.
Familiarity with LLM inference servers (e.g. vLLM) is bonus.
Ability to interpret and analyze hardware/software interactions to maximize AI model efficiency.

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

Due to U.S. Export Control laws and regulations, Tenstorrent is required to ensure compliance with licensing regulations when transferring technology to nationals of certain countries that have been licensing conditions set by the U.S. government.

As this position will have direct and/or indirect access to information, systems, or technologies that are subject to U.S. Export Control laws and regulations, please note that citizenship/permanent residency, asylee and refugee information and supporting documentation will be required and considered as a condition of employment.

If a U.S. export license is required, employment will not begin until a license with acceptable conditions is granted by the U.S. government. If a U.S. export license with acceptable conditions is not granted by the U.S. government, then the offer of employment will be rescinded.

Apply now Apply later

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats: 1 0 0

Category: Engineering Jobs

Tags: CI/CD Computer Science Engineering GitHub LLMs Machine Learning ML infrastructure PhD Python Testing vLLM