Share this Job
Apply now »

Monitoring and Reliability Engineer


We are Farmers!

Join a team of diverse professionals at Farmers to acquire skills on the job and apply your learned knowledge to future roles at Farmers. Farmers Insurance also offers extensive training opportunities through the award winning University of Farmers named by Training magazine amongst top 10 corporate training units in the world.

Job Summary

The Application Operations (AO) organization’s overall responsibility is to ensure the availability of over 100+ applications used by Farmers’ exclusive agents, independent agents, customers, and customer service advocates nationally.  Team members are responsible for proactively increasing system availability, process efficiency, and the overall quality of the end-user experience.  Team members will work with IT and Business Partners across the Farmers landscape to evaluate alerts, monitor system/application/network health and recommend action to resolve potential issues before they negatively impact our customers.

Essential Job Functions

  • Be a contributing member of the Detection and Resiliency team within Applications Operations, focusing on support activities for agent-, customer-, and employee-facing applications.  Manage the lifecycle of production incidents, restoring services as quickly as possible when interruptions occur.
  • Drive instrumentation, automation and process improvements.
  • Develop dashboards for IT and Business stakeholders.
  • Analyze and identify thresholds to establish meaningful alerts.
  • Help define rules to correlate alerts.  Use a wide perspective to correlate events from many sources to reduce the time to identify and repair the root cause of issues. 
  • Work with teams to understand critical business transactions and provide full traceability through monitoring to pinpoint when and where those transactions stop working optimally.
  • Partner with Incident Management teams to recommend a course of action that resolves potential issues before they negatively impact our customers.
  • Partner with development teams to help build an effective monitoring strategy post change implementation. 
  • Develop an understanding on how the current environment is architected and suggest resiliency improvements to improve uptime for critical services.
  • Participate in a team on-call rotation to help provide support to our 24x7 business operation
  • Work with the Change Management and Problem Management teams to proactively reduce issue occurrence/recurrence.

Physical Actions

Physical Environment

Education Requirements

Bachelor’s degree in computer science, engineering, mathematics, or related discipline OR commensurate experience preferred. High school diploma or equivalent required. 

Experience Requirements

Level II: 4 years of experience in application development / testing or business / system analysis role or 2 year as an Application SME Level I.

Special Skill Requirement

Problem-solving/analytical skills – must excel at performing detailed analysis and resolving user problems of varying magnitudes 

Technical expertise –must demonstrate the ability to understand large scale IT ecosystems and the ability to associate business processes to the IT systems supporting them.  Should have at least one strong area of technical expertise in areas like software development, infrastructure, telephony, or databases.

Performance monitoring tools – Experience with Splunk, AppDynamics, Moogsoft or similar tools required.  Other monitoring tool experience (SevOne, Cloudwatch, TeaLeaf) would be a plus.

Multitasking – experience handling technical issues from multiple customers at the same time and assessing/prioritizing the associated impact.

Communications – Excellent communication, both verbal and written, including emails, reports, presentations, etc.  Must have the ability to calmly focus a group of resources on the tasks required to recover business services when they are down.

Customer service – Must be able to communicate effectively with customers and co-workers at varying levels within an organization.

Process improvements – Experience identifying and driving process improvements resulting in efficiency gains and/or improved customer experience.

Additional Qualification

Job Location(s): US - MI - Caledonia, US - CA - Los Angeles, US - CA - Woodland Hills, US - OK - Oklahoma City

Want to learn more about our culture & opportunities? Check out farmers.com/careers and be sure to follow us on Instagram and LinkedIn!

Nearest Major Market: Grand Rapids

Apply now »