Postdoctoral Appointee — HPC & AI Interconnect
Lemont, IL USA, United States
Full Time Mid-level / Intermediate USD 70K - 117K
Argonne National Laboratory
The Argonne Leadership Computing Facility (ALCF) is dedicated to advancing scientific discoveries and engineering breakthroughs by providing world-class computing facilities in collaboration with the computational science community. We empower researchers to tackle some of the most complex global challenges through our unique blend of supercomputing resources and computational science expertise.
The ALCF is seeking a postdoctoral appointee to join our team focused on designing the communication infrastructure for next-generation High-Performance Computing (HPC) and Artificial Intelligence (AI) systems. This position offers an exciting opportunity to work at the intersection of HPC and AI, addressing critical communication bottlenecks and optimizing network interconnects for large-scale distributed systems.
Objective:
- Develop and optimize workload-specialized interconnects and network-aware communication strategies to enhance the performance of AI and HPC workloads.
- Implement adaptive routing techniques and Software-Defined Networking (SDN) solutions to dynamically manage network congestion and improve communication efficiency.
- Research and develop topology-aware collective communication algorithms to optimize data-intensive operations in scientific and AI applications.
- Investigate machine learning techniques to inform heuristic methods for routing optimization, bridging theoretical insights with practical implementations.
- Address inter-job interference through traffic pattern awareness, creating isolated virtual network domains to maximize resource utilization and minimize performance variability.
Benefit to ALCF:
This postdoctoral position will contribute to the development of scalable and intelligent communication systems for large-scale computing platforms. By optimizing network interconnects and communication strategies, the appointee will help ALCF enhance the performance of AI-driven applications and HPC workloads, ensuring efficient utilization of resources and improved system predictability.
Position Requirements
Required skills and qualifications:
- A recent PhD (completed within the last 5 years) in computer science, electrical engineering, or a related field.
- Strong background in network interconnect design and optimization, with experience in adaptive routing and SDN technologies.
- Proficiency in programming languages such as Python, C/C++, and experience with parallel computing frameworks.
- Effective written and oral communication skills.
- Ability to model Argonne’s Core Values: Impact, Safety, Respect, Integrity, and Teamwork.
Preferred skills and qualifications:
- Experience with MPI and other communication libraries on supercomputers.
- Experience in writing technical papers and presentations.
- Ability to create, maintain, and support high-quality software is essential. The successful candidate will be expected to work with and contribute to domain-specific software and models.
- Experience with version control software such as git is essential.
Job Family
PostdoctoralJob Profile
Postdoctoral AppointeeWorker Type
Long-Term (Fixed Term)Time Type
Full timeThe expected hiring range for this position is $70,758.00-$117,925.00.Please note that the pay range information is a general guideline only. The pay offered to a selected candidate will be determined based on factors such as, but not limited to, the scope and responsibilities of the position, the qualifications of the selected candidate, business considerations, internal equity, and external market pay for comparable jobs. Additionally, comprehensive benefits are part of the total rewards package.
Click here to view Argonne employee benefits!
As an equal employment opportunity employer, and in accordance with our core values of impact, safety, respect, integrity and teamwork, Argonne National Laboratory is committed to a safe and welcoming workplace that fosters collaborative scientific discovery and innovation. Argonne encourages everyone to apply for employment. Argonne is committed to nondiscrimination and considers all qualified applicants for employment without regard to any characteristic protected by law.
Argonne employees, and certain guest researchers and contractors, are subject to particular restrictions related to participation in Foreign Government Sponsored or Affiliated Activities, as defined and detailed in United States Department of Energy Order 486.1A. You will be asked to disclose any such participation in the application phase for review by Argonne's Legal Department.
All Argonne offers of employment are contingent upon a background check that includes an assessment of criminal conviction history conducted on an individualized and case-by-case basis. Please be advised that Argonne positions require upon hire (or may require in the future) for the individual be to obtain a government access authorization that involves additional background check requirements. Failure to obtain or maintain such government access authorization could result in the withdrawal of a job offer or future termination of employment.
Tags: Computer Science Distributed Systems Engineering Git HPC Machine Learning PhD Python Research
Perks/benefits: Career development Equity / stock options
More jobs like this
Explore more career opportunities
Find even more open roles below ordered by popularity of job title or skills/products/technologies used.