Sr. Manager, Production Support
Charles Schwab Corporation - Westlake, TX
Apply NowJob Description
Your Opportunity At Schwab, you're empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us "challenge the status quo" and transform the finance industry together. As a Sr. Manager, Production Support, you will be leading and mentoring a team of Site Reliability Engineers (SRE) while fostering a culture of continuous improvement and innovation focused on reliability, performance, automation, and operational support. This role will also involve collaborating with cross-functional teams to ensure alignment on reliability and performance goals as well as staying current on industry trends and best practices to ensure our systems and processes remain in line with SRE tenets. Essential Functions * Lead the team in their SRE maturity journey and overall individual performance management. * Oversee Production Engineering efforts to ensure systems are designed for operational excellence and reliability. * Provide leadership in incident management and root cause analysis to resolve production issues and prevent recurrence. * Establish and maintain operational support practices, including monitoring, alerting, and incident response. * Conduct post-mortem reviews to identify areas for improvement and implement solutions to enhance system reliability. * Implement and promote performance engineering practices to ensure optimal system performance. * Develop and execute strategies for destructive testing to identify potential points of failure and improve system resilience. What you have Required Qualifications * 8+ years of experience leading a Production Support/SRE team responsible for enterprise applications, infrastructure, and systems. * 8+ years of experience in measuring, tracking, improving, and reporting on SLO/SLA's/KPI's. * 5+ years of experience working Enterprise ITSM Business Processes. * Availability for after-hours calls/incident management. * ITIL v. 4 Experience with Enterprise Systems that includes but not limited to: * Event and Incident Management * Release and deployment * Enterprise Change Management experience * Experience managing multi-shift based teams. * Recent experience leading an Operations organization that focuses on event and incident management. * Experience monitoring tools with a focus on ITIL capabilities. * Experience with GitHub, Bamboo, Bitbucket, Splunk, ThousandEyes, and AppDynamics.
Created: 2024-10-13