System Engineer (Servers Hardware R&D Team)
Mäntsälä, Finland
Why work at Nebius
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.
Where we work
Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 500 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.
The role
Nebius is looking for a System Engineer (Servers Hardware R&D Team). You’re welcome to work in our data center in Mäntsälä, Finland.
Key Responsibilities:
- Participate in the design, deployment, and maintenance of high-performance cloud systems tailored for AI workloads.
- Arranging and performing hardware R&D tests and experiments on site in data center
- Troubleshoot and resolve complex system issues related to GPUs, networking (InfiniBand, NVLink), PCIe, and server infrastructure.
- Conduct deep investigations into hardware, software, and networking problems to ensure optimal system performance and reliability.
- Participate in development of tests and test methodologies for advanced GPU, InfiniBand, and Compute systems to benchmark and validate performance.
- Collaborate with cross-functional teams to improve system performance and reliability.
- Monitor system performance and continuously fine-tune configurations for maximum efficiency
Required Skills & Qualifications:
- Strong knowledge of modern server architecture, especially in high-performance GPU-based environments.
- Hands-on experience with GPUs, Network, NVLink, and PCIe.
- Proficient in Linux systems, with expertise in Python and Bash scripting for automation.
- Demonstrated ability to troubleshoot complex system issues, including hardware, software, and networking problems.
- Experience with deep problem investigation, root cause analysis, and resolving performance issues in cloud-based or high-performance computing environments.
- Strong analytical and problem-solving skills, with a focus on optimizing system performance.
- Basic electronics modification by soldering and wiring
Nice to Have:
- Knowledge of the Linux kernel and experience with kernel-level troubleshooting.
- Familiarity with electronic measurement equipment: oscilloscope, multimeter.
What we offer
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Hybrid working arrangements.
- A dynamic and collaborative work environment that values initiative and innovation.
We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Engineering GPU InfiniBand Linux Machine Learning NVLink Python R R&D
Perks/benefits: Career development Competitive pay
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.