Staff Reliability Engineer - Application Owner

  • The Hartford
  • Columbus, Ohio
  • Full Time

Staff Reliability Engineer - IE07KE

We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals - and to help others accomplish theirs, too. Join our team as we help shape the future.

We are seeking a highly motivated and technically skilled Reliability Engineer - Application Owner to join our Reliability Engineering team. This individual contributor role is responsible for the end-to-end reliability, performance, and lifecycle management of critical applications within our Claims and Operations IT ecosystem. You will work closely with engineering, infrastructure, and business teams to ensure our systems are resilient, observable, and continuously improving.

Key Responsibilities:

Application Ownership

  • Serve as the technical owner for one or more applications, ensuring their reliability, scalability, and performance.
  • Drive adoption of best practices in observability, automation, and incident prevention.
  • Ensure compliance with enterprise architecture, security, and regulatory standards.

Reliability Engineering

  • Design and implement automation to reduce manual toil and improve operational efficiency.
  • Build and maintain monitoring, alerting, and self-healing capabilities using tools like Dynatrace, Splunk, and CloudWatch.
  • Lead root cause analysis and implement long-term fixes for recurring issues.

DevSecOps & CI/CD

  • Collaborate with DevOps teams to enhance CI/CD pipelines for secure and efficient deployments.
  • Integrate security and compliance checks into the software delivery lifecycle.
  • Promote infrastructure-as-code (IaC) practices using tools like Terraform or CloudFormation.

Incident & Problem Management

  • Lead triage and resolution of high-severity incidents, minimizing business impact.
  • Improve incident response processes and reduce mean time to recovery (MTTR).
  • Maintain accurate documentation, runbooks, and operational metadata.

Collaboration & Influence

  • Partner with development, QA, and infrastructure teams to drive reliability initiatives.
  • Contribute to the Reliability Engineering Community of Practice.
  • Mentor junior engineers and promote a culture of continuous improvement.

Qualifications:

Technical Expertise

  • 7+ years of experience in software engineering, SRE, or application support.
  • Strong knowledge of AWS services (EC2, Lambda, S3, CloudWatch, IAM).
  • Proficiency in scripting (Python, NodeJS Bash, PowerShell) and automation.
  • Experience with observability tools (Dynatrace, Splunk, Prometheus, Grafana).
  • Familiarity with CI/CD tools (Jenkins, GitHub Actions, Azure DevOps).
  • Hands-on experience with containerization (Docker, Kubernetes, ECS/EKS).
  • Proficiency in infrastructure-as-code (Terraform, CloudFormation).

Operational Excellence

  • Proven ability to lead incident response and root cause analysis.
  • Experience implementing SLIs, SLOs, and SLAs.
  • Ability to design and implement runbooks, playbooks, and automated health checks.

Security & Compliance

  • Understanding of DevSecOps principles and secure software delivery.
  • Familiarity with compliance frameworks (SOC2, HIPAA, PCI-DSS).

Collaboration & Communication

  • Strong cross-functional collaboration and communication skills.
  • Ability to explain technical concepts to non-technical stakeholders.
  • Experience mentoring or leading technical discussions.

Preferred Qualifications

  • AWS Certified DevOps Engineer, CKA, or Google SRE certification.
  • Experience in financial services or insurance, especially contact center or claims operations.
  • Exposure to hybrid cloud environments and legacy modernization.

Why Join Us?

This role is part of a strategic transformation initiative to embed Reliability Engineering across Claims and Operations IT. You'll play a key role in modernizing our systems, improving customer experience, and driving operational excellence.

This role will have a Hybrid work schedule, with the expectation of working in an office (Columbus, OH, Chicago, IL, Hartford, CT or Charlotte, NC) 3 days a week (Tuesday through Thursday).

Candidates must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position.

Compensation

The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is:

$126,160 - $189,240

Equal Opportunity Employer/Sex/Race/Color/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age

About Us | Our Culture | What It's Like to Work Here | Perks & Benefits

Job ID: 487611822
Originally Posted on: 8/1/2025

Want to find more Quality Control opportunities?

Check out the 28,237 verified Quality Control jobs on iHireQualityControl