Site Reliability Engineer

JP Morgan

Location: Glasgow City

Job Type: Full time

Posted

Men
16%
Women
Show that gap who’s boss!
Women are 16% less likely than men to apply to a job once they’ve viewed it, but are 16% more likely to get hired after applying to a job.*
*LinkedIn Talent Solutions Gender Insights Report 2019

As a Site Reliability Engineer (SRE), you'll help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems.

In the role of Site Reliability Engineer, you will work in a collaborative team of software professionals and be responsible for improving the health of the applications. You will be working with other SRE members as well as product development teams to build and support innovative technology solutions including user interfaces, middle-tier and server-side components, and will need to ensure adherence to architecture standards, risk management, and security policies.

Much of our support and software development focuses on optimizing existing systems, building infrastructure and reducing work through automation. You’ll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. As an SRE you’ll be focused on running better production applications and systems.

Responsibilities

  • Troubleshoot priority incidents, facilitate blameless postmortems and ensure permanent closure of incidents
  • Collaborate across Application Development, Product and production management to establish and maintain Service Level Objective (SLO), Service Level Indicator (SLI) and Error Budget for key Production services.
  • Identify application patterns and analytics in support of better service level objectives
  • Implement required telemetry and observability to monitor and measure the quality of service in real-time against the established SLO.
  • Manage, track and validate all changes to the Production, Disaster Recovery environment
  • Manage priority incidents and leverage cross-functional teams to quickly eliminate impacts
  • Escalate issues/Risks effectively when necessary across supporting framework
  • Troubleshoot Key technical issues or escalate and work with appropriate technology teams to provide solutions.
  • Manage application and infrastructure to maximize stability and resiliency. Leverage and improve monitoring and alerting capabilities to ensure application SLAs are met.
  • Provide Level 3 production support to applications in production. Handle Code deployments for application releases.
  • Strong focus on automation and processes. Design, implement, improve and utilize key monitoring tools.
  • Automate software and product upgrades, change management, and release management solutions

Qualifications & Skills

  • 5+ years of experience in similar functions, tools & technologies.
  • Bachelor’s degree or equivalent experience in an software engineering discipline.
  • Hand-on experience in instrumentation, customization and usage of modern monitoring toolset such as Dynatrace, Grafana, Geneos etc
  • Exposure to Splunk, Elastic search/ Kibana would be a plus.
  • Exposure to Kubernetes / AWS would be a plus.
  • Hands-on in one technology stack (Java/J2EE) with coding, testing, and delivering software
  • Working knowledge in at least one of the relational database ( MS SQL Server, Oracle, Cassandra etc.)
  • Comfortable working in Agile mode and proficient in Continuous Integration and Continuous Delivery.
  • Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm.
  • Working knowledge of infrastructure components. (E.g. routers, load balancers, cloud products, container systems, compute, storage and networks).
  • Excellent debugging and trouble shooting skills.
  • Experience in performance monitoring and capacity management of large systems using various tools.
  • Solid analytical and problem solving skills.
  • Attention to detail and time-management skills.
You’ve got this!