Applications for this job have closed. This page will redirect to the American Express employer page in 10 seconds.

Engineering Director

Greater London
Full time
employer logo
American Express
Banking, investment & finance
10,001+ employees
Compare top employers

You Lead the Way. We’ve Got Your Back.

With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a career journey that’s unique and meaningful to you with benefits, programs, and flexibility that support you personally and professionally.

At American Express, you’ll be recognized for your contributions, leadership, and impact—every colleague has the opportunity to share in the company’s success. Together, we’ll win as a team, striving to uphold our company values and powerful backing promise to provide the world’s best customer experience every day. And we’ll do it with the utmost integrity, and in an environment where everyone is seen, heard and feels like they belong.

Join Team Amex and let's lead the way together.

Are you interested in a career creating breakthrough software and making an impact on an audience of millions?

American Express has embarked on an exciting journey transforming our Site Reliability Engineering & Application Support (SRE & AS) Organization. This role will be responsible for working on platforms that are low latency, always available and highly resilient supporting operations that provide 24x7, 365 days a year coverage. The Customer Journey SRE & AS team is a cross functional, collaborative and innovative team responsible for the availability of the real-time authorization network and clearing and settlement platform.

You will be a key Engineering Leader that influences the future of SRE & AS at American Express.

How will you make an impact in this role?

  • Lead a team of engineers and grow the technical backbone of our organization.
  • Implement software development practices to build observability, alerting, tracing, automation and self-healing capabilities to maintain the highest levels of platform availability.
  • Performance tune and enhance the reliability of the infrastructure stack, for both public and private cloud.
  • Hands on contribution to enterprise solutions, tooling, and initiatives leveraging your technical experience.
  • Nurture an environment of innovation and continuous improvement, leading changes that drive efficiencies into existing engineering and delivery processes.
  • Lead experimentation and proof of concepts of new open-source technologies to solve observability, testing and resiliency challenges. Influence the technology adoption for the Customer Journey organization and broader company platforms.
  • Implement shift left automated testing to prevent defects from reaching production
  • Ensure all new critical subsystems, microservices, databases and external calls meet the 5 9's availability requirement.
  • Provide consultation for all significant functionality changes and peer review critical production hotfixes
  • Conduct technical code reviews and drive innovation across the organization to adopt industry best practices.
  • Be part of a global operations team that support a 24/7 model, willingness to work holidays and weekends.

Minimum Qualifications:

  • Experience coding with Java, python, node, or a similar language with a strong desire to learn new languages. Experience with ML is a plus.
  • Excellent coding and scripting skills (Terraform/Ansible), knowledge of CICD tools (Jenkins, Gitlab and Artifactory), experience with monitoring/alerting/logging solutions (Splunk, Datadog, AWS, GCP Stackdriver, etc.)
  • Experience as an SRE in a public cloud environment with experience in designing and building cloud-native applications. Must have hands on experience with Kubernetes.
  • High degree of technical knowledge, ranging across several technologies (e.g., platform enablers including Prometheus, Consul, Vault, ELK and Infrastructure platforms including Cloud, networking and storage)
  • Hands on experience in building and enhancing distributed micro-service systems. Must have hands on experience with ServiceMesh products such as Istio.
  • Awareness of the challenges of distributed systems and practices of building highly available platforms.
  • An affinity to connect with openness and transparency and a passion to learn new technologies and optimize them to their potential.
  • Bachelor's Degree in Computer Science, Computer Engineering, or equivalent work experience.
  • Experience in a start up is a plus.

We back our colleagues and their loved ones with benefits and programs that support their holistic well-being. That means we prioritize their physical, financial, and mental health through each stage of life. Benefits include:

  • Competitive base salaries
  • Bonus incentives
  • Support for financial-well-being and retirement
  • Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location)
  • Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need
  • Generous paid parental leave policies (depending on your location)
  • Free access to global on-site wellness centers staffed with nurses and doctors (depending on location)
  • Free and confidential counseling support through our Healthy Minds program
  • Career development and training opportunities

Offer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to applicable laws and regulations.