CASY-MSCCN Jobs

CASY-MSCCN Logo

Job Information

University of Washington DIRECTOR- DEPARTMENTAL COMPUTING in Seattle, Washington

Req #: 239072

Department: BIOCHEMISTRY

Posting Date: 09/23/2024

Closing Info: Open Until Filled

Salary: $16,000 - $18,000 per month

Shift: First Shift

Notes: As a UW employee, you will enjoy generous benefits and work/life programs. For a complete description of our benefits for this position, please visit our website, click here. (https://hr.uw.edu/benefits/wp-content/uploads/sites/3/2018/02/benefits-professional-staff-librarians-academic-staff-20230701_a11y.pdf)

As a UW employee, you have a unique opportunity to change lives on our campuses, in our state and around the world. UW employees offer their boundless energy, creative problem-solving skills, and dedication to build stronger minds and a healthier world.

UW faculty and staff also enjoy outstanding benefits, professional growth opportunities and unique resources in an environment noted for diversity, intellectual excitement, artistic pursuits, and natural beauty.

The Department of Biochemistry has an outstanding opportunity for a Director-Departmental Computing to join their team. POSITION PURPOSE Located at the University of Washington in Seattle, the Institute for Protein Design is a unique interdisciplinary environment where world-class researchers create new biomolecules. Our mission is to create proteins that solve modern challenges in medicine, technology, and sustainability by leveraging computational and AI tools for protein design. As such the computational infrastructure of the institute is critical to the overall success of the IPD.

The Director of Computing at the Institute for Protein Design (IPD) is a key leadership role within the institute and will manage a team of Professional Staff in IT that will oversee the smooth running of the IPD’s high performance compute infrastructure. They will have responsibility for planning and managing the computing infrastructure, ensuring cyber security, developing internal software tools for the institute, and ensuring that they remain at the cutting edge.

POSITION COMPLEXITIES The significant expansion of IPD’s computational infrastructure necessitates a dedicated team to ensure that the computational infrastructure of the IPD is maintained as state-of-the-art, as it is of central importance to the success of the IPD as a whole.

The IPD has 350 research and students spread across five site on campus with dry-lab office and wet-lab spaces, plus a presence in two data centers where it co-locates all of its computing equipment. Over the past three years, IPD’s computational infrastructure has grown from 100 to 1000 GPUs, representing an overall investment in excess of $4.5M, exceeding other computational resources within the UW.

DUTIES AND RESPONSIBILITIES Compute Infrastructure (50%)

  • High performance computing (HPC) cluster

  • Growth strategy and capacity planning for the on-premise HPC cluster with currently 14,000 CPU-cores and 1,000 GPUs located in two data centers.

  • Selection and requisition of hardware and its lifecycle management, including compute nodes, file servers, and networking hardware.

  • Selection and configuration of software packages, both for IT and scientific software, with particular emphasis on GPU-accelerated machine learning tools.

  • Configuration, management, troubleshooting and extension of the Slurm job scheduler, for detailed resource utilization reports and bottleneck identification.

  • Storage capacity planning, vendor selection and implementation.

  • Performance tuning, including identifying and addressing bottlenecks.

  • Network security

  • Responsible for the overall security of the IPD network.

  • Disaster recovery planning and implementation, including onsite and offsite backups.

  • Drafting and implementing remote access policies, including the selection of applicable technology.

  • Implementing and performing security and rate retention audits.

  • Liaison between the IPD and other UW and non-UW HPC centers.

  • Point of contact for various equipment vendors for compute and storage.Software development (30%)

  • Supervise 2 FTEs that develop custom software tools for the IPD for:

  • Infrastructure and user management.

  • Management and processing of scientific data and lab workflows.

  • Set up and maintain automation tools for the management of Linux workstations and servers, including cloning from custom software images.Other (20%)

  • Directly supervise 2 FTEs that manage and maintain:

  • About 300 lab desktop workstations across five sites on campus, including equipment procurement and lifecycle management.

  • About 100 lab instrument computers, with help from additional undergraduate student assistants, as needed.

  • User-facing services such as the Rosetta@home distributed computing project.

  • Create reports and projections for the IPD leadership.Lead Responsibilities:

  • May delegate tasks to IPD trainees as necessary and provide advice/consultation to IPD trainees and staff as needed. MINIMUM REQUIREMENTS

  • Bachelor's Degree in Computer Science, Information Management, Engineering or related field and 10 years or more or experience in related field. Equivalent education/experience will substitute for all minimum qualifications except when there are legal requirements, such as a license/certification/registration. ADDITIONAL REQUIREMENTS

  • Broad knowledge of hardware and software technologies with a strong emphasis on Unix server management.

  • Excellent understanding of networking concepts and the hardware and software level, such as IP, subnetting, routing, VLANs, DHCP, VPNs, tunnels, and firewalls.

  • Experience with the management of an HPC compute environment.

  • Hands-on experience with management of petabyte-scale storage systems. DESIRED QUALIFICATIONS

  • Ph.D in Computer Science, Information Management or related field strongly preferred.

  • Demonstrated experience with the management of heterogeneous OS environments with the Linux, Windows and OSX operating systems.

  • Experience working with server-grade hardware, including installation and maintenance of data center hardware.

  • Experience with ZFS, network and/or distributed file systems.

  • Experience with automation tools such as Ansible.

  • Experience with software development of services and web-based applications.

  • Experience developing and using software for protein structure prediction and design.

  • Good familiarity with scripting languages (e.g., bash or python).

  • Interest and/or experience with forensic network analysis tools (Wireshark, tcpdump).

  • Background in life sciences.

  • Experience with machine learning software packages (pytorch, tensorflow). Application Process: The application process may include completion of a variety of online assessments to obtain additional information that will be used in the evaluation process. These assessments may include Work Authorization, Cover Letter and/or others. Any assessments that you need to complete will appear on your screen as soon as you select “Apply to this position”. Once you begin an assessment, it must be completed at that time; if you do not complete the assessment, you will be prompted to do so the next time you access your “My Jobs” page. If you select to take it later, it will appear on your "My Jobs" page to take when you are access ready. Please note that your application will not be reviewed, and you will not be considered for this position until all required assessments have been completed.

University of Washington is an affirmative action and equal opportunity employer. All qualified applicants will receive consideration for employment without regard to, among other things, race, religion, color, national origin, sexual orientation, gender identity, sex, age, protected veteran or disabled status, or genetic information.

DirectEmployers