Principal Research Software Engineer- Khoury College ...
Northeastern University - boston, MA
Apply NowJob Description
About the Opportunity We are seeking a highly skilled Principal Research Software Engineer with experience in Machine Learning and Large Language Model interpretability research methods, to assist in developing the National Deep Inference Fabric, an open-source deep learning interpretability research computing infrastructure project. You will be responsible for full stack development, doing both back-end and front-end software development to help create a robust, high-throughput, highly usable, and flexible multi-tenant AI inference service to enable research nationwide. Some of the day-to-day activities include solving security, stability, integration, and performance issues involved in providing a large-scale research inference service for open-source AI models. We are looking for someone who can implement state-of-the-art parallel GPU inference methods, and incorporate them into a system with job scheduling, routing, quota management, authentication, authorization, and telemetry to create a high-performance computing infrastructure. This person should be expert in Python and working internals of PyTorch along with Unix/Linux service development, HPC/cloud environments, and all other aspects of the software development life cycle. Responsibilities: Design, build, and maintain a high-performance backend compute infrastructure to handle large volumes of traffic and throughput requests from researchers. Address software and hardware issues related to security, compliance, privacy, reliability, scalability performance, load testing, and load balancing. Independently implement critical components such as scheduling, routing, quota management, authentication, authorization, log analysis and telemetry for efficient backend operations. Lead and actively participate in full-stack web development activities, encompassing both front-end and back-end tasks. Implement and manage a robust CI/CD pipeline, following Agile methodology, to facilitate the seamless and efficient deployment of code into production. This includes actively participating in bug/feature triaging and prioritization processes and meticulously documenting release notes. Additionally, conduct thorough testing of software to ensure code quality and reliability before promoting changes to the production environment. Implement changes, enhancements, and optimizations to ensure a seamless and user-friendly experience for researchers interacting with the platform. Apply deep experience in Machine Learning and Large Language Model interpretability research methods to contribute to the ongoing research computing infrastructure project. Collaborate with researchers to understand their requirements and integrate different open-source machine learning models into the platform effectively. Stay abreast of advancements in the field of machine learning, interpretability research, and PyTorch to ensure the platform aligns with the latest industry standards and practices. Qualifications: Six plus years of experience as a Staff Full-Stack Software Engineer or Research/Staff Scientist. Master's degree (or more) in artificial intelligence, computer science, or equivalent. Knowledge of Machine Learning methods, including Transformer Language Models. Excellent software development skills including expertise with Pytorch & Python internals. Strong written and verbal communication team-player who can think and work independently Documents to Submit: CV and cover letter About Khoury College: Khoury College is committed to building a diverse and inclusive community among students, faculty and staff. We strongly encourage applicants from underrepresented groups, individuals with disabilities, veterans, and those with diverse experiences and backgrounds. We invite you to join a fast-moving, ambitious college with an underlying mission that is driven by our motto of "CS for Everyone." Position Type Research Additional Information Northeastern University considers factors such as candidate work experience, education and skills when extending an offer. Northeastern has a comprehensive benefits package for benefit eligible employees. This includes medical, vision, dental, paid time off, tuition assistance, wellness & life, retirement- as well as commuting & transportation. Visit for more information. Northeastern University is an equal opportunity employer, seeking to recruit and support a broadly diverse community of faculty and staff. Northeastern values and celebrates diversity in all its forms and strives to foster an inclusive culture built on respect that affirms inter-group relations and builds cohesion. All qualified applicants are encouraged to apply and will receive consideration for employment without regard to race, religion, color, national origin, age, sex, sexual orientation, disability status, or any other characteristic protected by applicable law. To learn more about Northeastern University's commitment and support of diversity and inclusion, please see
Created: 2024-11-05