FAS Research Computing (FASRC) is seeking a Systems Engineer to put their Linux skills to work building, supporting, and maintaining a range of research computing systems (storage, cluster computing, network, database, virtualized systems, etc.) to meet the bleeding-edge computational needs of faculty and researchers.
The system engineer will participate in group projects such as building large Lustre-based storage systems, rearchitecting/redesigning client storage solutions, working with networking staff and compute cluster management to update Infiniband connectivity to current technology, and working with Software Engineering and client support teams to facilitate projects.
Department Description:
FAS Research Computing continues to evolve, expand services, and support its leading research faculty and their collaborators around the world. These services include maintaining a Top 100 academic high-performance computing cluster, cloud computing, storage, databases, instrumentation core facility workstations, and other development platforms. We directly engage with researchers through help requests, monitoring, office hours, training, and in-depth consultations. Research Computing has numerous other successful collaborations, including building the MGHPCC (http://www.mghpcc.org/) in Holyoke, MA, with leading partner universities. Research Computing at Harvard has a proven track record of building partnerships to accelerate research and collaboration.
This is a full-time position with flexible hours and a hybrid in-person/remote work schedule option to be agreed upon at hire. The selected candidate will periodically need to be on campus as business needs require. All remote work must be performed in a state where Harvard is registered to do business (CA, CT, MA, MD, ME, NH, NY, RI, and VT).
Occasional travel to Boston and Holyoke data centers.
Basic Qualifications
Minimum of five years’ post-secondary education or relevant work experience
Additional Qualifications and Skills
Two years minimum experience in deploying and managing data center class Linux systems (e.g., storage, cluster computing, network, database, virtualized systems);
Broad knowledge of and experience in the deployment and management of Linux systems (e.g., storage, cluster computing, network, database, virtualized systems);
Experience with configuration management (e.g., Puppet, Ansible) and version control (e.g., GitLab);
Experience in managing database systems;
Basic knowledge of TCP/IP networking protocols;
Demonstrated ability to be productive in team environments, with a service mindset approach, and the ability to act as a trusted advisor.