HPC Engineer/Architect
Net2Source Inc. - new york city, NY
Apply NowJob Description
Net2Source Inc. is an award-winning total workforce solutions company recognized by Staffing Industry Analysts for our accelerated growth of 300% in the last 3 years with over 5500+ employees globally, with over 30+ locations in the US and global operations in 32 countries. We believe in providing staffing solutions to address the current talent gap - Right Talent - Right Time - Right Place - Right Price and acting as a Career Coach to our consultants.Job title: HPC EngineerArchitectJob Location: New York, NY 10065 (Hybrid)Contract Job Summary:You will support day-to-day operations of large-scale parallel file systems, deploy and maintain Linux HPC infrastructure across multiple data centers, and assist HPC engineers and architects with day-to-day operations and tickets.• Support day-to-day operations of large-scale parallel file systems• Deploy and Maintain Linux HPC infrastructure across multiple datacenters• Assist HPC engineers and architects with day-to-day operations and ticketsRequired Skills:• Linux Operating Systems (RHELCentOS), Parallel file system (GPFS), Job Scheduler LSFSlurm• Ansible, Python, Shell scripting• GPU-based compute infrastructure (including CUDA)• CentOS 4.5• HPCCResponsibilities:• Design, architect and oversee implementation of Linux based HPC clusters and storage• Deploy physical hardware using HPC deployment tools and configuration and orchestration tools (Ansible)• Parallel file system (GPFS) performance tuning, monitoring and troubleshooting• Perform systems benchmarking, and developing automated tests for the HPC environment, ensuring the reliability and efficiency of our computational infrastructure• InfiniBand network maintenance and troubleshooting• Automate and monitor the HPC user lifecycle process• Slurm installation, configuration, performance tuning and troubleshooting• Plan, design and implement a transition from the LSF scheduler to Slurm• Manage the Slurm scheduler and translate Research policies into scheduler configurations• Consult with faculty and students to develop research pipelines for use on the HPC cluster• Develop and maintain user lifecycle software suite in Python, implement CICD pipeline• Test and automate upgrades of critical system applications using Ansible and shell scripts.• The ability to communicate effectively with clinicians, researchers, and other team members to develop technological solutions is keyQualifications:• Experience working in large-scale research based HPC environment• Proven experience working with distributed file storage solutions (i.e., GPFS)• Experience with deploying and troubleshooting Linux Operating Systems (RHELCentOS)• Experience with Scripting and Automation (Ansible, Python, Shell Scripting)• Solid understanding of job schedulers (LSFSLURM)• Experience with GPU-based compute infrastructure (including CUDA)Why work with us - At Net2Source, we believe everyone has an opportunity to lead. We see the importance of your perspective and your ability to create value. We want you to fit in"”with an inclusive culture, focus on work-life fit and well-being, and a supportive, connected environment; but we also want you to stand out"”with opportunities to have a strategic impact, innovate, and take necessary steps to make your mark. We help clients with new skilling, talent strategy, leadership development, employee experience, transformational change management and beyond.Equal Employment Opportunity Statement:Net2Source is an Equal Opportunity Employer. We believe that no one should be discriminated against because of their differences, such as age, disability, ethnicity, gender, gender identity and expression, religion or sexual orientation. Our rich diversity makes us more innovative, more competitive, and more creative, which helps us better serve our clients and our communities. All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by federal, state, or local law.Awards and Accolades:• America's Most Honored Businesses (Top 10%)• Awarded by USPAAC for Fastest Growing Business in the US• 12th Fastest Growing Staffing Company in USA by Staffing industry Analysts in the US (2020, 2019, 2020)• Fastest 50 by NJ Biz (2020, 2019, 2020)• INC 5000 Fastest growing for 8 consecutive years in a row (only 1.26% companies make it to this list)• Top 100 by Dallas Business Journal (2020 and 2019)• Proven Supplier of the Year by Workforce Logiq (2020 and 2019)• 2019 Spirit of Alliance Award by Agile1• 2018 Best of the Best Platinum Award by Agile1• 2018 TechServe Alliance Excellence Awards Winner• 2017 Best of the Best Gold Award by Agile1(Act1 Group)Thanks & RegardsAbhishek KumarSr. Technical
Created: 2025-02-20