The Rockefeller University, the world’s premier research university, seeks a DevOps Engineer/ Senior Systems Administrator to join the Information Technology group.
As the DevOps Engineer/ Senior Systems Administrator will develop and evangelize the Department's adoption of ansible-based DevOps best practices for a broad spectrum of Lunix System Administration and scientific computing tasks.
You will also:
participate in the Department's adoption of DevOps best practices for a broad spectrum of Linux System Administration and scientific computing tasks
advocate a DevOps approach to systems administration and product development, demonstrating best practice by example
assist in defining and maintaining a configuration management system based on Ansible
participate in the implementation and management of a CI/CD environment, and automate provisioning, deployment, configuration management, monitoring and all other infrastructure and ops activities
participate in all aspects of the planning, evaluation, selection, installation, and maintenance of data storage technologies and data lifecycle management applications, including petascale enterprise-class storage technologies (e.g. OneFS, GPFS, VNX) in a multi-protocol environment and enterprise backup applications (e.g. Networker(DLP), Amanda, Avamar)
assist in the planning and implementation of the Department's real time performance monitoring systems (including Nagios, Splunk and influxDB), maintain and continuously expand a library of scientific and numerical software, including open-source and commercial applications, with a particular emphasis on reproducibility of scientific workflows, collaborate with laboratories and core facilities, design, implement, and perform on-going maintenance of data and compute-intensive systems, software pipelines, and infrastructure, and design, implement, and support data collection, analysis, processing, visualization, dissemination, and long-term archiving solutions
Bachelor's degree or equivalent experience in computer science, applied mathematics, engineering or a related technical field required; graduate degree preferred
Experience with large scale monitoring and reporting tools (e.g. Nagios, OMD, ELK, Splunk, New Relic, Grafana), collaborative TDD/BDD approach and automated test tools (e.g. Cucumber, Selenium), and at least one configuration management system (Ansible, Puppet, Chef, SaltStack, or similar), required
Experience in an academic computer environment or scientific laboratory highly desired
Must have fluency and a minimum of 5 years of demonstrated experience with the Linux operating system, some experience working in an Agile environment, familiarity with virtualization technologies and related tools (e.g. Vagrant, VirtualBox, VMware, KVM, Packer.io), and experience in software development and/or DevOps/SRE practices, including proficiency with Git and CI/CD tools/workflows
Strong shell scripting and programming skills (two or more of BASH, Python, Ruby, Perl, or similar) for systems administration purposes necessary. Basic familiarity with networking principles and protocols (e.g. TCP/IP, SSH, HTTP[S], DNS, DHCP, NAT, SNMP, PXE), experience building and installing open-source software with complex dependencies, and working knowledge of computer hardware (desktops, servers, storage, networking, and display), as well as desktop operating systems (Windows and Max OS X), required
Must have the ability to organize and coordinate both short and long term projects, and independently prioritize based on overall objectives of multiple groups. Proven ability and passion for continuous learning, ability to critically evaluate new technologies, and strong analytical and troubleshooting skills required
Familiarity with provisioning and container technologies (e.g. Docker, OpenStack, Kubernetes) and with X windows, VNC, and other remote visualization technologies preferred. Some familiarity with commercial cloud infrastructures (e.g. Amazon, Azure, Google Compute Engine) and working knowledge of web based application development and hosting (e.g. Rails, Angular, NodeJS), including common protocols aused on the web, and common web hosting technologies (e.g. Apache, Nginx) highly desired. Must be an excellent team player and possess a positive, constructive approach with an emphasis on collaboration and good execution
How to Apply
To apply to this job, please visit the URL below and apply to job code ‘IRC20127’.
The Rockefeller University is an Equal Opportunity Employer -Minorities/Women/Disabled/Veterans
Additional Salary Information: Commensurate with experience.