Site Reliability Engineer (Application SRE)
The Dignify Solutions LLC - malvern, PA
Apply NowJob Description
Demonstrable experience on Java (JDK 8+) and Microservices architecture Hand-on experience in AWS services (EC2, ECS, S3, Cloud Formation template, Aurora DB, Dynamo DB. Lambda, SQS, SNS, RDS, API Gateway, VPC, Route 53, Kinesis, Cloudwatch AWS SDK) Experience with monitoring and testing subsystems with Splunk, Honeycomb, Open Telemtry and Grafan Experience with UI development tools (Angular and Node JS) 5+ years of direct implementation of AWS Architecture solutions Programming experience of either Shell, PowerShell, Python, Java or Scal 4+ years of hands-on experience in cloud-native Architecture design, implementation of distributed fault tolerant enterprise application for cloud Preferably AWS Certified Solution Architect - Professional Responsibilities: Map an applications deployment architecture including cloud infrastructure and dependencies Experience with Chaos testing scenarios (using Gremlin preferably) bility to identify Failure Modes with end to end journeys (across UI, authentication layer, Application code, 3rd party systems, Databases, Data, Capacity , Infrastructure , Firewall and Network) Integrate SRE practices into Incident Management and Change Deployment process Implementation of SRE practices inline with AWS security best practices and Well Architected Frameworks Develop and Maintain SRE runbooks Understand and share resiliency architectures Strong understanding of SLO, SLI, Error Budgets and their implementation into SRE areas
Created: 2024-07-07