SRE (Software Engineer)
Apex Systems - Dearborn, MI
Apply NowJob Description
Job#: 2059290Job Description:Location: Hybrid to SE MI or Palo Alto, CA Duration: 12+ month contract Description: We are seeking a talented Full Stack / Site Reliability Engineer to play a key role in developing a comprehensive Internal Developer Platform (IDP) that includes CI/CD pipelines, managed infrastructure, observability, and a developer portal. The Bedrock and Customer Success and SRE team is responsible for ensuring that our customers derive maximum value from our platform. This team acts as the primary point of contact for customers, helping them onboard, adopt, and optimize their use of our platform offerings. This team also works to ensure the stability of the platform that hosts the cloud applications that power our customer's connected vehicle experiences. Responsibilities: Strong background in software development and systems administration, as well as excellent problem-solving and communication skills. Run a production environment by monitoring availability and taking a holistic view of system health. Developing, improving, and operating the deployment and orchestration of a complex distributed system Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve Provide primary operational and engineering Support for multiple large, distributed software applications Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovation Collaborating with development teams to design, build, and operate scalable and resilient software systems Automating build, deployment, monitoring, and incident response processes Performing root cause analysis of production incidents and implementing preventive measures Conducting performance analysis and optimization of the system Ensuring compliance with security and regulatory standards Implementing and maintaining disaster recovery processes Providing technical guidance and mentorship to other team members Participating in an on-call rotation for incident response and support. Qualifications 4 Year College Degree in Computer Science or Equivalent Experience 5 - 6 years' experience with Golang, Java, NoSQL/SQL Datastore, Spring Boot, Google Cloud Platform/AWS/Azure, Docker/K8 in Maintenance and Development of multi-tier applications. Understanding of gRPC & RESTful APIs, and microservices platform 4 - 5 Years of experience with any of APM and other monitoring tools such as Grafana Cloud, Dynatrace, New Relic, ELK, Splunk, Prometheus, Sensu, Nagios, Kafka, DataDog, PagerDuty. Strong experience with product & development teams to establish error budgets by identifying the right SLOs (Service level objective), SLIs (Service level indicators), KPIs (Key performance indicators) and effectively drive the use of the budget to ensure maximum domain availability/uptime. Regularly review key site technical metrics such as transactions errors, logging, response times, caching strategies, conversion/bounce rates, capacity & resource utilization. Proactively identify stability risks & work with engineering leadership to establish appropriate mitigation plans Experience in solving complex architecture/design & business problems, work to simplify, optimize, remove bottlenecks, etc. Architect, design & develop automation to reduce toil, improve recoverability, availability, latency & scalability of supported applications with understanding of MTTD (Mean Time to Detection) & MTTR (Mean Time to Resolution) Maintain knowledge repository that includes Standard operating procedure, Release checklists, Runbooks for incident recovery EEO Employer Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at or . Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico.
Created: 2025-03-11