Operations Engineer
IRIS Consulting Corporation - Minneapolis, MN
Apply NowJob Description
IRIS Consulting Company is a trusted leader in providing IT staffing needs to our clients. With offices in the Minneapolis/St. Paul and Atlanta metro, we have built solid business relationships with our clients in the airline, manufacturing, insurance, healthcare, and tech industries. Full SDLC support means we get to know our clients and our candidates to find not just a match, but a true fit on both sides. With over 25 years of experience, we can truly deliver.Operations Support EngineerðŸ“Location: Minneapolis, MN (Hybrid - 2-3 days per week in-office)🚫Travel: 0% client travel required📞On-Call Rotation: One week per month (off-business hours support can be remote from home)Position SummaryThe Operations Support Engineer ensures the stability, reliability, and continuous improvement of our platform through effective incident management, triage, and change management processes. This role provides 24x7 on-call support, conducts Root Cause Analyses (RCAs), and collaborates with cross-functional teams to implement necessary changes and enhancements.The engineer will generate comprehensive reports to provide insights into platform operations and work closely with client operations teams, product architecture, and platform solutions for disaster recovery planning and execution. The ideal candidate is proactive, has excellent communication skills, and is passionate about ensuring seamless platform operations.Key ResponsibilitiesIncident Management & SupportProvide 24x7 on-call support (one week per month).Manage tier 1 operations, escalating critical issues as needed.Lead Root Cause Analyses (RCAs) and conduct retrospectives for Sev 1 & Sev 2 incidents.Resolve escalated tickets from client operations teams and escalate complex issues appropriately.Monitoring & ReportingMonitor platform health and proactively enhance stability.Develop dashboards, monitors, and alerts using Grafana, AppDynamics, Splunk, Elastic, CloudWatch.Generate monthly operational reports and SSL certificate expiration reports (bi-monthly).Provide insights to optimize platform performance and reliability.Collaboration & Process ImprovementWork with DevOps, client operations teams, and product architecture for documentation and process enhancements.Assist in change management to ensure smooth deployments and minimal disruptions.Manage OpsGenie configuration and escalation trees.Support SOC audit compliance and security reviews.Required Skills & Qualifications✅ Incident Management & Triage Experience (24x7 support, escalations)✅ Root Cause Analysis (RCA)& troubleshooting experience✅ Monitoring & Alerting Tools (Grafana, AppDynamics, Splunk, CloudWatch)✅ Basic Maintenance Coding skills✅ Strong problem-solving & collaboration abilities✅ Flexible & adaptable to new technologiesPreferred SkillsWindows & Linux server infrastructure experienceAWS Operations experienceCorporate License Management knowledgeAgile Kanban methodology experienceFamiliarity with Git, Jira, and ConfluenceEducation & Minimum Qualifications🎓Bachelor's degree in a relevant field or equivalent experience💡 Strong communication, problem-solving, and organizational skills📢Equal Opportunity Employer- We encourage applications from all backgrounds, including individuals with disabilities and veterans.
Created: 2025-02-25