Principal Site Reliability Engineer (Advanced Threat ...
Palo Alto Networks - santa clara, CA
Apply NowJob Description
Our MissionAt Palo Alto Networks® everything starts and ends with our mission:Being the cybersecurity partner of choice, protecting our digital way of life.Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are.Who We AreWe take our mission of protecting the digital way of life seriously. We are relentless in protecting our customers and we believe that the unique ideas of every member of our team contributes to our collective success. Our values were crowdsourced by employees and are brought to life through each of us everyday - from disruptive innovation and collaboration, to execution. From showing up for each other with integrity to creating an environment where we all feel included.As a member of our team, you will be shaping the future of cybersecurity. We work fast, value ongoing learning, and we respect each employee as a unique individual. Knowing we all have different needs, our development and personal wellbeing programs are designed to give you choice in how you are supported. This includes our FLEXBenefits wellbeing spending account with over 1,000 eligible items selected by employees, our mental and financial health resources, and our personalized learning opportunities - just to name a few!At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work full time from our office with flexibility offered where needed. This setup fosters casual conversations, problem-solving, and trusted relationships. Our goal is to create an environment where we all win with precision.Job DescriptionYour CareerWe are looking for an exceptional Principal Site Reliability Engineer to enhance our ATP Infra team. This role will work on producing mission-critical platforms, tools, and processes that will ensure the highest levels of availability and reliability of all our applications. We need creative and innovative problem solvers who can partner with our developers and researchers to make the services more usable. The ideal candidate will possess a deep understanding of cloud infrastructure, particularly within the Google Cloud Platform (GCP), and have a proactive approach to exploring new toolsframeworks to elevate our infrastructure automation, stability and scalability.Your ImpactWrite automation code for provisioning and operating infrastructure at massive scaleDesign, build and operate Cloud infrastructure to enable reliable and rapid deployment of microservices with effective monitoring and resilient operationsWork with development teams to make sure the applications are production ready, scalable and reliable from the grounds upIdentify and drive opportunities to improve automation for code deployment, management, and visibility of application servicesDevelop tools and framework to automate operational tasks, deployment of machines, services, applicationsEstablish end-to-end monitoring and alerting on all critical components of the applicationParticipate in the on-call rotation supporting the platform and or the production applicationDirects root cause analysis of critical business and production issuesDevelop and mentor other SREs on standard methodology from Infra orchestration and troubleshooting application service in productionRepresent SRE in design reviews and work cross-functionally with Engineering teams on operational readinessQualificationsYour ExperienceBS or MS in Computer Science, a related field, or equivalent professional experience or equivalent military experience requiredExpertise in configuration management with a framework such as Terraform, Ansible, and HelmStrong experience with KubernetesStrong Linux administration, internals, and network troubleshootingExpertise in Google cloud computing (GCP) and resource managementoperations on its related servicesProficiency with a programming language like Python and shell scripting to automate tasksStrong experience with CICD pipeline, GitHub, Jenkins, ArtifactoryStrong experience with metrics and monitoring tools such as Grafana and PrometheusAbility to diagnose and troubleshoot complex distributed systems handling high volume transactionsStrong fundamentals in API gateway including Nginx or EnvoyExperience with cloud infrastructure and their performance & cost optimizationsExperience with AWS is a big plusExcellent interpersonal skills and the ability to work well in a teamPassionate to learn, understand, and dissect new technology stack quickly on ownHave experience on building and managing large relational database cluster (MySQLPercona etc.) will be a plusAdditional InformationThe TeamOur engineering team is at the core of our products - connected directly to the mission of preventing cyberattacks. We are constantly innovating - challenging the way we, and the industry, think about cybersecurity. Our engineers don't shy away from building products to solve problems no one has pursued before.We define the industry, instead of waiting for directions. We need individuals who feel comfortable in ambiguity, excited by the prospect of a challenge, and empowered by the unknown risks facing our everyday lives that are only enabled by a secure digital environment.and pensation DisclosureThe compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for salescommissioned roles) is expected to be between $147000 - $225000YR. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.Our CommitmentWe're problem solvers that take risks and challenge cybersecurity's status quo. It's simple: we can't accomplish our mission without diverse teams innovating, together.We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.All your information will be kept confidential according to EEO guidelines.
Created: 2024-11-06