Lead – HPC Systems
Qatar
Sidra Medicine
Sidra Medicine is a state-of-the-art facility committed to providing women and children in Qatar with world-class tertiary healthcare services.JOB SUMMARY:
The Lead – HPC Systems provides systems programming and management functions for large scale, high performance computing systems. Responsibilities include working as member of systems team on the systems administration, integration and maintenance of high performance computing systems, clusters, as well as other systems and peripherals, including advanced filesystems, enterprise storage systems, virtualization environments, and networks. S/he works with other high-performance computing and systems administration staff and with technical team leaders to accomplish complex system integration, deployment, and administration projects, system performance analyses, problem resolution, and system security initiatives. The incumbent works with senior staff on the development of system management strategies, architectural assessments, system tools, and software for the administration of production systems. S/he provides technical assistance and consultation for researchers, and technical staff on the use of the high-performance computing platforms. The Lead – HPC Systems works closely with other staff and departmental entities to provide a comprehensive support infrastructure for academic, commercial, and government users, from a broad range of research disciplines.
KEY ROLE ACCOUNTABILITIES:
- Provides systems’ support for advanced research computing environment, to include the installation, integration and management of high-performance computer systems, clusters, operating systems, peripherals, and system interfaces, monitors system usage, ensures that the high-performance computing complex is operating at optimal performance and reliability levels, additional duties include consulting, training and the development and maintenance of systems documentation.
- Works in collaboration with senior systems staff to manage the hardware and systems software infrastructure to provide an effective, reliable, high performance, scalable computing environment.
- Participates in the configuration and tuning of batch queuing systems in a massively production environment, collects system utilization statistics, identifies and resolves computer system anomalies and operational problems, and provides systems support for electronic mail, name resolution, and file sharing services.
- Maintains an understanding of state-of-the-art computing systems and peripherals, computer operating systems, and scalable architectures.
- Works with users and other computational professionals in evaluating user requirements, and in the configuration and deployment of computational resources.
- Works with computer hardware and software vendors to maintain an understanding of industry trends and evolving technology.
- Provides consulting and technical support for marketing and outreach activities.
- Serves as project leader on small to medium-sized projects.
- Solves moderately complex problems and tasks independently.
- Performs miscellaneous job-related duties as assigned.
- Works with vendors on new trends and measure to cut cost, time and improve productivity.
- Manages Storage and Backup licenses and support subscriptions.
- Develops and maintains backup and recovery strategy for all Environments.
- Adheres to Sidra’s standards as they appear in the Code of Conduct and Conflict of Interest policies
- Adheres to and promotes Sidra’s Values
QUALIFICATIONS, EXPERIENCE AND SKILLS:
ESSENTIAL PREFERRED Education Bachelor’s Degree in Computer Science or related field Experience7+ years of experience in relevant field inclusive of at least 2+ years of progressive management experience:
- Experience in configuring/managing environments with failover clustering, high availability and disaster recovery.
- Experience with storage and backup technologies and strategies oriented around the backup and restoration of large datasets
- Experience in healthcare-related fields, demonstrated expertise in healthcare operations, health information knowledge, change management and project management
- Experienced with (or equivalent) the following regulations and frameworks: PCI, HIPAA, and ISO/IEC 2700x
- Good understanding of Regulatory Compliance, Risk Management, Privacy
- PMP
- ITIL V3
- Good working knowledge of one or more scripting languages such as csh, Bash, Awk, perl, Python, etc.
- Good working knowledge of high performance computing systems, scalable and Linux operating system.
- Knowledge of advanced data storage technologies and high-speed network interfaces.
- Ability to contribute to the development of technical design decisions involving software or hardware implementation strategies.
- Ability to monitor system usage and performance statistics and to understand the impacts of operating system tuning parameters.
- Understanding of system administration, troubleshooting and performance tuning of RedHat Enterprise Linux, & Windows Servers.
- Skilled experience in the installation and configuration of operating systems and applications software.
- Experience with complex problem resolution procedures, testing and evaluation methods, and programming tools.
- Experience with network security procedures and protocols.
- Ability to assist technical management and Direct in gathering user requirements and planning and designing computer systems.
- Ability to understand and follow established methods and procedures for the integration, testing and installation of system modifications.
- Ability to analyze requirements and determine computational resource impacts.
- Excellent communication skills.
- Demonstrates strong technical judgment.
- Proficiency with Microsoft Office suite
- Fluency in written and spoken English
- Working knowledge of one or more high-level programming languages such as C, C++, or Fortran.
- Knowledge of Virtualization (OVM/VMware)
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Architecture Clustering Computer Science Consulting Fortran HPC ITIL Linux Perl Privacy Python Research Security Statistics Testing
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.