Sysman

From NJIT-ARCS HPC Wiki
Jump to: navigation, search


ARCS Systems Manager, Level XXX

Responsibilities

  • Manage public-access and research laboratories of Linux workstations
  • Provide installation and maintenance of Linux and MacOS desktops and laptops
  • Install and configure hardware, troubleshoot hardware problems
  • Provide AFS and Kerberos support for Linux and MacOS workstations and laptops
  • Write and maintain scripts to automate system administration functions
  • Act as backup to for the datbases supported by ARCS - e.g., MySQL, Oracle, MongoDB
  • Ensure that all systems are fully patched
  • Monitor for file system intrusion (intrusion detection)
  • Monitor security advisories for operating system and infrastructure software, and take appropriate actions to safeguard resources
  • Respond to alerts form system monitoring tools
  • Perform day-to-day operational procedures to keep infrastructure running in a reliable and secure fashion
  • Upgrade firmware as needed
  • Support users, in-person or remotely
  • Restore files from backups as needed
  • Manage and/or resolve service tickets
  • Team with system administrators on troubleshooting of desktops, laptops, servers, and virtualization infrastructure
  • Provide accurate, comprehensive and timely documentation, as well as change and problem management information
  • Interact with hardware and software vendors, maintenance vendors, students, faculty and staff
  • Participate in the installation and testing of new infrastructure and software
  • Provide on-call support for infrastructure platform production issues

Qualifications

Essential Qualifications

  • Bachelor's degree in Computer Science or related field with 2-5 years of experience, or 5-10 years of relevant experience in lieu of a bachelor's degree
  • Experience in the installation, maintenance, operation, tuning, and troubleshooting of Linux and related systems and software
  • Experience with programming and shell scripting languages
  • Experience with system automation tools such as Ansible, Puppet, or Chef
  • Ability to manage time effectively, working both independently and as part of a team; ability to focus on priority issues
  • Ability to complete detailed technical work based on both documentation and information provided by others
  • Enthusiasm for learning new skills and adapting to a dynamic environment
  • Strong interpersonal skills, enthusiasm for customer service, and the ability to work with students, staff, and faculty from diverse backgrounds
  • Excellent written and oral communication skills

Preferred Qualifications

  • Experience with at least one high-level programming language
  • Experiemce with systems automation using scripting
  • Experience using open source software tools (e.g., Nagios, Munin)