Senior Linux IT Specialist
Addison Group - Houston, TX
Apply NowJob Description
Senior Linux IT Specialist Location: Houston, TX Salary: $90k-110k We are seeking a highly experienced and skilled Senior Linux IT Specialist to join our IT team. This role will play a vital part in contributing to the global HPC and Digital Platform team, ensuring seamless HPC and Cloud infrastructure performance. The successful candidate will have a proven track record in Linux administration, with a strong understanding of system administration, troubleshooting, and IT service management. Key Responsibilities: Install, configure, maintain, and repair Linux-based hardware and software, ensuring high functionality, performance, and reliability. Organize and schedule upgrades, perform routine maintenance, and ensure system security and privacy compliance. Provide expert-level technical support to users and junior team members, including mentoring and training to enhance skills and knowledge. Respond to support tickets, troubleshoot complex issues, and resolve system crashes, performance degradation, and security breaches. Monitor systems and services through performance monitoring, log analysis, and proactive issue detection. Continuously improve services through automation, optimization, and standardization, while staying updated on technology trends and best practices. Configure and manage SLURM job queues, troubleshoot job scheduling issues, and support high-performance computing environments. Collaborate with cross-functional teams to align IT services with user and business needs and ensure compliance with company IT policies and standards. Develop and maintain technical documentation, including system diagrams, configuration files, and troubleshooting guides, to facilitate knowledge sharing and operational continuity. Demonstrate strong project and time management skills to prioritize tasks effectively in a dynamic environment. Support in-house software applications, adhering to organizational standards and procedures. Skills & Competencies: Essential: 5+ years of experience in Linux administration, preferably in an HPC environment. Strong understanding of system administration, troubleshooting, and IT service management. Experience with automation/configuration management using either Puppet, Chef, Salt, Ansible, Gitlab, or an equivalent. Ability to use a wide variety of open-source technologies and cloud services. Experience with Docker and container orchestration. Familiarity with code and script (Bash, Python, Perl); shell scripting. Excellent troubleshooting and problem-solving skills. Desirable: Experience with DevOps methodologies and tools such as OpenStack, Kubernetes, CI/CD, etc. Experience in cloud administration, virtualization, and hardware maintenance (Storage/CPU/GPU). Certifications like CCNA or CompTIA Network+ are a plus. ITIL Foundation level certification. Knowledge of GPUs. Experience with High Performance Computing (HPC) and clustering technology (object storage, parallel file systems, RAID storage). Understanding of networks, RAID, and tape subsystems. Experience in a high-volume critical production service environment. Virtualization knowledge and experience. Qualifications and Experience: Bachelor's degree in IT, Computer Science, Computer Engineering, or a related field (or equivalent work experience). 5+ years of extensive experience in Linux administration, preferably in an HPC environment. Experience in scripting and automation, cloud technologies, and DevOps practices is preferred. Knowledge of internet security and data privacy principles. Excellent communications, presentation, and customer service skills, and must have an outstanding track record of meeting customer expectations. Must be detail-oriented and work well in a team environment. Must have legal right to live and work in the United States. #J-18808-Ljbffr
Created: 2025-02-01