Engineering Enablement Site Reliability Engineer
Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in Site Reliability Engineering.
As a Site Reliability Engineer at JPMorgan Chase within the Corporate Oversight and Governance (COG), Architecture & Engineering team, you work collaboratively with stakeholders to define non-functional requirements (NFRs) and availability targets for the services in your application and product lines. You will ensure those NFRs are accounted for in your products’ design and test phases, that your service level indicators are effectively measuring customer experience, and that service level objectives are defined with stakeholders and implemented in production. You will solve complex problems in code with a quality driven Product Centric approach.
Corporate Oversight and Governance Technology is responsible for developing solutions that support the Compliance, Controls Management, Resiliency, Legal, Regulatory, and Audit line of businesses. The solutions support 1st, 2nd, and 3rd line independent review, monitoring and oversight of business operations with a focus on legal and regulatory obligations related to the offerings of the firm’s products and services.
Architecture and Engineering is a cross-functional group residing within Corporate Oversight & Governance Technology performing a multi-faceted function covering engineering practices, architectural governance and data management; providing guidance, setting mandates, and delivering solutions.
Job responsibilities
- Contribute to creating high quality designs, roadmaps, plans, standards, and program charters that are delivered by you, the team you are working in, or the wider COGT engineering community.
- Demonstrates site reliability culture, principles and practices every day and champions the adoption of site reliability.
- Collaborates with others to create and implement observability and reliability designs for complex systems that are robust, stable, and do not incur additional toil or technical debt.
- Collaborate in the design, creation and advocacy of SRE products that can be used to scale the implementation of SRE best practices within COGT.
- Evolves and debug critical components of applications and platforms.
- Contributes to JPMorgan Chase’s site reliability community via internal forums, communities of practice, guilds, and conferences.
- Participates in architecting, designing and building highly distributed systems and SRE products, solving complex problems in code.
- Maintain and promote best practices in software engineering, leading by example.
Required qualifications, capabilities, and skills.
- Demonstrable applied experience of SRE concepts, strategies, and culture.
- Knowledge and experience in observability such as white and black box monitoring, service level objectives, alerting, and telemetry collection using tools such as OTEL, Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
- Competency in at least one of the following programming languages. JAVA, Go, Python, TypeScript, JavaScript.
- Familiarity with software design patterns
- Understanding of the software delivery life cycle and associated tooling, with an understanding of branching and testing strategies.
- Experience with developing containerised, serverless and event driven systems.
- An agile practitioner.
- Ability to anticipate, identify, and troubleshoot defects found during testing.