High-Performance Computing System Administrator | Yale University - Military Veterans
at HERC - Metro New York & Southern Connecticut
Configure, deploy, and support HPC clusters to support university research. Install, administer and maintain hardware, system software, networking, accounts, and security measures to maintain performance, stability, and security. Troubleshoot and fix issues with HPC hardware. Deploy and support large-scale data storage and backup for critical research data. Diagnose and correct system issues, whether these be issues with correct operation or performance. Reinstate integrity of systems as quickly as possible following an outage in order to minimize downtime. Manage end-user accounts. Triage and solve user-submitted tickets related to HPC infrastructure. Track system health and resource usage using monitoring software, and respond to issues. Develop and maintain documentation for team members and occasionally for end users. Research developments in HPC architectures and new technologies, processes, and methodologies. Update and patch system software and firmware and software as needed to maintain performance and security. Participate in determination of specifications for new systems, and tailor these to meet research needs. Perform on-site installations and maintenance at data centers. Apply technical expertise to identify and resolving system deficiencies. Provide system services and analyze system performance for stakeholders and intended end users. Perform other duties as assigned. Required Skill/ability 1: Proven expertise with Linux operating system distributions. Required Skill/ability 2: Expertise with bash and at least one other scripting language. Demonstrated expertise with Linux system administration, including OS, networking, storage, and security. Required Skill/ability 3: Proven ability to work in team environment in fast-moving technology field. Required Skill/ability 4: Excellent verbal and writing skills. Ability to interact well with team members and end users. Ability to work independently and across units. Required Skill/ability 5: Attention to detail with the proven ability to take the care necessary to be entrusted with a system that hundreds of users depend on for research computation and the storage of research data. Preferred Education: HPC clusters, preferably with administration thereof Computational accelerators such as GPUs Cluster provisioning and management tools Batch schedulers Technology in a research environment High-speed networking, e.g., InfiniBand Large storage systems and parallel file systems such as GPFS and Lustre Server hardware component replacement Working in a data-center or server-room environment Work Week: Standard (M-F equal number of hours per day) Posting Position Title: System Administrator (HPC) University Job Title: High-Performance Computing System Administrator Preferred Education, Experience and Skills: HPC clusters, preferably with administration thereof Computational accelerators such as GPUs Cluster provisioning and management tools Batch schedulers Technology in a research environment High-speed networking, e.g., InfiniBand Large storage systems and parallel file systems such as GPFS and Lustre Server hardware component replacement Working in a data-center or server-room environment Bachelor's Degree in a related field and a minimum of four years of related work experience or an equivalent combination of education and experience.
New Haven, CT
The Higher Education Recruitment Consortium (HERC) is a national nonprofit network of higher education and affiliated employers, committed to institutional collaboration, creating diverse workplaces, and assisting dual career couples. Searching for a job in higher ed? Our job board hosts over 30,000 faculty and staff jobs at workplaces that value diversity, equity, and inclusion. Set up your job seeker account today at: http://www.hercjobs.org For our member institutions, we offer recruitment and retention resources, vibrant regional networks, and a new online community of practice, HERConnect. All of our resources can help you advance inclusive excellence at your institution.