UNIX Expert for AI/ML GPU-Cluster - Part Time (20 hours) (f/m/d)
Munich, Germany
NXP Semiconductors
Are you inspired by machine learning, data, AI, high performance computing, and are you ready for a new step by maintaining the high-performance computing farm for AI training? Do you want to contribute to enabling stability and reliability of a GPU-cluster to support the experts in enabling real world applications and to gain experience developing innovative ideas and improving the system? Then you will want to be a part of the growing Artificial Intelligence Competence Center at NXP, a leading semiconductor company.
Responsibilities:
At the AI Competence Center, we are looking for a part-time (20h/w basis) Unix expert passionate about GPU-clusters and high-performance computing. Leverage your creativity for realizing a reliable, maintainable, and secure high-performance computing cluster. Developing and deploying new solutions to enhance the usability of the cluster system with automation and scripting. Close collaboration with the local IT-team and our AI/ML experts – the users of the high-performance cluster. With your motivation and ideas, you contribute to providing a reliable setup, and you develop yourself and the team skills further.
Preferred skills:
Very strong experience with UNIX systems and scripting using python, or bash
Experience in network management, including DHCP, DNS, LDAP, Active Directory, SSSD, etc.
Experience with network attached storage systems
Experience with docker, Kubernetes, process management software
Experience with the setup and management of distributed resource management and job scheduling systems (e.g., Slurm or others)
Profound knowledge in AI-deployment and GPU-management including GPU-driver, CUDA-kernel management.
Hands on experience with setup up servers
Preferably some experience with setting up and managing some on-premises GUI monitoring dashboards (e.g., Grafana or others)
Your Profile:
Capability to work independently and accurately
Strong critical and analytical thinking
Good English communication skills for interaction with our multinational team across multiple sites
Looking for challenging the future of high-performance processing
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: CUDA Docker GPU Grafana HPC Kubernetes Machine Learning Python
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.