Senior Site Reliability Engineer
Huxley - boston, MA
Apply NowJob Description
We are hiring a highly experienced Site Reliability Engineer to join a Boston-based startup! Led by industry-leading executives working on building the future of technology, You'll join an international team responsible for the development, maintenance, evolution, and operation of cloud-service infrastructure for an organization and its enterprise clients. This role involves cross-functional collaboration with customers, developers, the product team, and other stakeholders. Responsibilities Maintain the health of cloud-based development and production environments through monitoring and daily administrative tasks. Manage CDNs, DNS, web servers, and other supporting Internet services. Ensure best practices for reliability, fault tolerance, availability, latency, performance, efficiency, monitoring, emergency response, and capacity planning. Implement security practices such as npm audit, encryption, and threat modeling. Be on-call and respond to issues identified by alerts and reported incidents. Automate infrastructure and operational activities by anticipating failures and coding appropriate responses. Develop tools to balance access needs without being overly broad. Adapt to various cloud environments and perform custom work appropriate to the scale of the problem, keeping up to date with changes and assisting other teams in understanding new opportunities. Understand the requirements and metrics needed to achieve performance goals while finding cost-effective ways to deliver performance. Build systems to monitor evolution, operational levels, and the potential costs and benefits of different solutions. Find cost-effective methods to improve availability and assess performance issues. Collaborate with developers to deliver software releases, configuration updates, and other release requirements while developing self-service tools. Determine appropriate approaches to networking problems and build in resiliency. Monitor capacity management by anticipating upcoming issues and focusing on design portability to sidestep problems as needed. Ensure delivery focuses on continuous integration, testing, reliability, chaos engineering, micro-services, containers, orchestration, and cluster management. About You You are a problem solver who enjoys team collaboration and proactively anticipates customer needs. You are familiar with the software development life cycle and comfortable with tools such as source control and setting up a development environment. You have experience with systems engineering at scale, understanding how things fail, determining data/configuration placement, and evaluating cost-effectiveness. You are an experienced programmer who solves problems using code, makes pragmatic language selection decisions, follows local styles when fixing or extending components, and writes readable, maintainable code. You excel at selecting tools to solve problems and making that tooling adaptable to change. Please note this is a hybrid on-site/remote role based in Boston, MA. EOE Statement: Specialist Staffing Group is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status. To find out more about Huxley, please visit
Created: 2024-10-19