Lead HPC Systems Operations Engineer

Penn State University

University Park Campus
Date Announced:
Date Closing:
open until filled
Job Number:
Level/Salary Band:
  • 03 – N – Exempt
  • 04 – O – Exempt
Work Unit:
Vice President for Research
The Institute for CyberScience
Full/Part Time:


The Institute for CyberScience (ICS) seeks a Lead HPC Systems Operations Engineer to join our team. You will be an integral contributor to a dynamic and growing team of specialists supported by an exciting $60 million Penn State investment in advanced research cyberinfrastructure (ICS-ACI). Penn State’s elite researchers use our high-performance research cloud to solve real-world problems by conducting simulations, data mining, and other high-performance computing operations. You will participate as an important member of our community of specialists to support this ground-breaking work. As our Lead HPC Systems Operations Engineer, you will work as part of a team to: Lead the operations of researher-focused HPC systems overseeing a team of systems administrators and engineers; define enhancements and provide engineered system solutions for HPC operations; identify requirements and lead implementation projects to improve HPC operations and researcher utilization; deliver HPC systems documentation and compliance to support research and education; and, investigate complex HPC system problems and/or user issues and lead innovative solution approaches. This job will be filled as a level 3, or level 4, depending upon the successful candidate's competencies, education, and experience. Typically requires a Bachelor's degree or higher in an Engineering or Science discipline (Master's degree preferred) or higher plus five years of related experience, or an equivalent combination of education and experience for a level 3. Additional experience and/or education and competencies are required for higher level jobs. Desired skills include: Knowledge and operation of cyberinfrastructure including: large-scale, multi-user compute clusters; high-speed networks (e.g. Infiniband); parallel file systems (e.g. GPFS); cluster resource managers and schedulers (e.g. Moab, PBS, SLURM). Experience with the following: Linux operating systems (e.g. RHEL); monitoring tools (e.g. Nagios, Solarwinds); automated configuration management (e.g. Puppet); scripting languages (e.g. Python, bash). Application of accepted engineering practices that enable the design, development, implementation, and analysis of engineered systems, software, and interconnects. Experience with the following is a plus: Cloud computing platforms, such as OpenStack; distributed Windows computing infrastructure; network based services (e.g. DNS, LDAP, NFS); virtualization technologies and concepts (e.g. VMware); software installation and maintenance in a multi-user Linux environment. Ability to explain concepts to users with varied HPC experience; strong interpersonal skills and the ability to work well in a team environment. Training, education and professional development opportunities are available and encouraged. To learn more about working for ICS, please visit http://ics.psu.edu/careers.

These salary bands have been established to provide salary guidelines for staff positions.

Salary Band Minimum Midpoint Maximum
A $16,584 $24,456 $32,328
B $18,240 $26,904 $35,556
C $19,728 $29,592 $39,456
D $21,708 $32,568 $43,416
E $24,312 $36,468 $48,612
F $27,228 $40,848 $54,456
G $30,012 $45,744 $61,500
H $34,188 $52,140 $70,080
I $38,988 $59,424 $79,908
J $43,716 $67,740 $91,812
K $50,712 $78,600 $106,488
L $58,836 $91,176 $123,528
M $68,232 $105,756 $143,292
N $80,508 $124,788 $169,068
O $93,492 $147,252 $201,024
P $110,340 $173,760 $237,192
Q $126,396 $199,056 $271,728
R $151,668 $238,872 $326,088