DIRECTOR- DEPARTMENTAL COMPUTING
University of Washington - seattle, WA
Apply NowJob Description
If you are using a screen reader and experience any difficulty accessing our web pages, please call or email UWHires and we will be happy to assist you. Benefits: As a UW employee, you will enjoy generous benefits and work/life programs. For a complete description of our benefits for this position, please visit our website, click here. As a UW employee, you have a unique opportunity to change lives on our campuses, in our state and around the world. UW employees offer their boundless energy, creative problem-solving skills, and dedication to build stronger minds and a healthier world. The Department of Biochemistry has an outstanding opportunity for a Director-Departmental Computing to join their team. POSITION PURPOSE Located at the University of Washington in Seattle, the Institute for Protein Design is a unique interdisciplinary environment where world-class researchers create new biomolecules. Our mission is to create proteins that solve modern challenges in medicine, technology, and sustainability by leveraging computational and AI tools for protein design. As such the computational infrastructure of the institute is critical to the overall success of the IPD. The Director of Computing at the Institute for Protein Design (IPD) is a key leadership role within the institute and will manage a team of Professional Staff in IT that will oversee the smooth running of the IPD's high performance compute infrastructure. They will have responsibility for planning and managing the computing infrastructure, ensuring cyber security, developing internal software tools for the institute, and ensuring that they remain at the cutting edge. POSITION COMPLEXITIES The significant expansion of IPD's computational infrastructure necessitates a dedicated team to ensure that the computational infrastructure of the IPD is maintained as state-of-the-art, as it is of central importance to the success of the IPD as a whole. The IPD has ~350 research and students spread across five sites on campus with dry-lab office and wet-lab spaces, plus a presence in two data centers where it co-locates all of its computing equipment. Over the past three years, IPD's computational infrastructure has grown from 100 to 1000 GPUs, representing an overall investment in excess of $4.5M, exceeding other computational resources within the UW. DUTIES AND RESPONSIBILITIES Compute Infrastructure (50%) High performance computing (HPC) cluster Growth strategy and capacity planning for the on-premise HPC cluster with currently ~14,000 CPU-cores and ~1,000 GPUs located in two data centers. Selection and requisition of hardware and its lifecycle management, including compute nodes, file servers, and networking hardware. Selection and configuration of software packages, both for IT and scientific software, with particular emphasis on GPU-accelerated machine learning tools. Configuration, management, troubleshooting and extension of the Slurm job scheduler, for detailed resource utilization reports and bottleneck identification. Storage capacity planning, vendor selection and implementation. Performance tuning, including identifying and addressing bottlenecks. Network security Responsible for the overall security of the IPD network. Disaster recovery planning and implementation, including onsite and offsite backups. Drafting and implementing remote access policies, including the selection of applicable technology. Implementing and performing security and rate retention audits. Liaison between the IPD and other UW and non-UW HPC centers. Point of contact for various equipment vendors for compute and storage. Software development (30%) Supervise 2 FTEs that develop custom software tools for the IPD for: Infrastructure and user management. Management and processing of scientific data and lab workflows. Set up and maintain automation tools for the management of Linux workstations and servers, including cloning from custom software images. Other (20%) Directly supervise 2 FTEs that manage and maintain: About 300 lab desktop workstations across five sites on campus, including equipment procurement and lifecycle management. About 100 lab instrument computers, with help from additional undergraduate student assistants, as needed. User-facing services such as the Rosetta@home distributed computing project. Create reports and projections for the IPD leadership. Lead Responsibilities: May delegate tasks to IPD trainees as necessary and provide advice/consultation to IPD trainees and staff as needed. MINIMUM REQUIREMENTS Bachelor's Degree in Computer Science, Information Management, Engineering or related field and 10 years or more of experience in a related field. Equivalent education/experience will substitute for all minimum qualifications except when there are legal requirements, such as a license/certification/registration. ADDITIONAL REQUIREMENTS Broad knowledge of hardware and software technologies with a strong emphasis on Unix server management. Excellent understanding of networking concepts and the hardware and software level, such as IP, subnetting, routing, VLANs, DHCP, VPNs, tunnels, and firewalls. Experience with the management of an HPC compute environment. Hands-on experience with management of petabyte-scale storage systems. DESIRED QUALIFICATIONS Ph.D in Computer Science, Information Management or related field strongly preferred. Demonstrated experience with the management of heterogeneous OS environments with the Linux, Windows and OSX operating systems. Experience working with server-grade hardware, including installation and maintenance of data center hardware. Experience with ZFS, network and/or distributed file systems. Experience with automation tools such as Ansible. Experience with software development of services and web-based applications. Experience developing and using software for protein structure prediction and design. Good familiarity with scripting languages (e.g., bash or python). Interest and/or experience with forensic network analysis tools (Wireshark, tcpdump). Background in life sciences. Experience with machine learning software packages (pytorch, tensorflow). Application Process: The application process may include completion of a variety of online assessments to obtain additional information that will be used in the evaluation process. These assessments may include Work Authorization, Cover Letter and/or others. Any assessments that you need to complete will appear on your screen as soon as you select "Apply to this position". Once you begin an assessment, it must be completed at that time; if you do not complete the assessment, you will be prompted to do so the next time you access your "My Jobs" page. Committed to attracting and retaining a diverse staff, the University of Washington will honor your experiences, perspectives and unique identity. Together, our community strives to create and maintain working and learning environments that are inclusive, equitable and welcoming. To request disability accommodation in the application process, contact the Disability Services Office at or . Applicants considered for this position will be required to disclose if they are the subject of any substantiated findings or current investigations related to sexual misconduct at their current employment and past employment. Disclosure is required under Washington state law. #J-18808-Ljbffr
Created: 2024-11-05