HPC Administrator II
Date: Mar 19, 2023
Location: Savannah, GA, US
Company: Gulfstream Aerospace Corporation
HPC Administrator II in GAC Savannah
Unique Skills:
HPC cluster support and design
Education and Experience Requirements
Job Description
- Assume the responsibility for the day-to-day operations of Gulfstream's production HPC cluster.
- Troubleshoots and maintains the Infiniband and Ethernet networks.
- Understands, maintains and supports high performance parallel storage systems.
- Assists end users running applications on the HPC cluster.
- Provides third level support for end users who experience problems on engineering workstations and remote visualization systems.
- Assists with the mentoring and professional development of junior level HPC Administrators.
- Manage, maintain, monitor, and control interactive and batch processes, both scheduled and unscheduled (including on-request processing).
- Complete engineering-defined batch processing and backups in the correct sequence and within the established time periods.
- On an ongoing basis, suggest improvements to processing capabilities and efficiencies through system tuning and other hardware and software optimizations and improvements.
- Perform regular monitoring of utilization needs and efficiencies, and report regularly on tuning initiatives.
- Perform proactive failure trend analysis and root cause analysis for all system failures.
- Produce trend reports to highlight production issues and follow predetermined action and escalation procedures when issues are encountered.
- Monitor, verify, and make appropriate adjustments to support proper application executions.
- Provide technical solutions that meet the performance and processing objectives of the business areas.
- Perform upgrades thoroughly and accurately that comply with corporate policies and industry best practices.
- Provide leadership to junior level HPC Administrators during system upgrades and outages.
- Maintain technical relationships with multiple hardware and software vendors. .
- Work multiple operational windows as required. to support business objectives. .
- Provide on-call support 24x7 .
- Develop and implement technical standards, hardware standards, and software standards. .
- Experience with management of infiniband-based Linux-based HPC clusters, high performance parallel storage, and configuration and management of cluster scheduling software.
- Experience managing High Performance Computing low-latency, high-bandwidth interconnects.
- Experience supporting Linux based scientific workstations running visualization applications.
Additional Information
Requisition Number: 208390
Category: Information Systems
Percentage of Travel: Up to 25%
Shift: First
Employment Type: Full-time
Posting End Date: 02/06/2023
Equal Opportunity Employer/Veterans/Disabled.
Gulfstream does not provide work visa sponsorship for this position, unless the applicant is a currently sponsored Gulfstream employee.
#managedjobs
Legal Information | Site Utilities | Contacts | Sitemap
Copyright © 2020 Gulfstream Aerospace Corporation. All Rights Reserved. A General Dynamics Company.
Gulfstream Aerospace Corporation, a wholly-owned subsidiary of General Dynamics (NYSE: GD), designs, develops, manufactures, markets, services and supports the world's most technologically-advanced business jet aircraft
Nearest Major Market: Savannah
Job Segment:
Computer Science, Engineer, Linux, Aerospace, Information Systems, Technology, Engineering, Aviation