Senior Site Reliability Engineer, Enterprise Agility

Atlassian

Location: Mountain View, California

Job Type: Full time

Posted


Working at Atlassian

Atlassian can hire people in any country where we have a legal entity. Assuming you have eligible working rights and a sufficient time zone overlap with your team, you can choose to work remotely or return to an office as they reopen (unless it’s necessary for your role to be performed in the office). Interviews and onboarding are conducted virtually, a part of being a distributed-first company.

Love staying ahead of the cloud growth curve and experimenting with new software and environments? Get on board as an Atlassian Senior Site Reliability Engineer.

As an Engineer in the Enterprise Agility Site Reliability Engineering team, you'll build solutions to enhance availability, performance and stability for the hundreds of Atlassian enterprise customers in the Cloud as well as automating away repetitive work.

You will help secure the cloud architecture with penetration testing, vulnerability resolution and compliance audit responses.

You'll be working on non-production and production environments, monitoring, data collection and configuration management, as well as disaster recovery planning, capacity engineering, reliability improvement initiatives, and platform automation. You'll also respond to pings, pages and alerts to investigate issues in our products that you can really sink your teeth into. The best person for this role is someone that has a collaborative spirit - in our world, it's not about being a hero and having all the answers, it's about sometimes saying "I don't know" and working on finding solutions rather than starting with an assumption. The team needs someone who can ask questions, learn from others and turn chaos into order.

This role would be a great fit for someone with creative and innovative problem solving skills and a willingness to take responsibility for the code you write all the way to production. You will develop and implement solutions that operate at scale - seeing your own technology efforts directly improve the reliability of our products. Our teams are empowered and expected to improve our products to truly deliver a reliable experience to customers. You will own development efforts in each and every sprint from planning to delivery to realize this goal and collaborate with different team to review code.

One thing we promise: You'll never be bored.

On your first day, you will have experience in:

  • Required understanding of Windows systems
  • Network security, encryption technology, and roles permission mgmg
  • Previous experience in compliance industries SOC2, ISO27k, HIPAA
  • Software development in Python or Powershell
  • Hands on experience with cloud infrastructure AWS/Azure - minimum of 3 years
  • Experience monitoring distributed systems application architecture with tools like Splunk/SignalFX
  • Experience using Configuration Management tools like Terraform, Cloud Formation Template
  • Serious troubleshooting skills across different levels of the stack: Network, IAM roles, EC2, RDS, DynamoDB
  • Exposure to and maintenance of configuration management and orchestration tools at scale
  • Experience troubleshooting deployment/Orchestration Tools (Octopus Deploy, Jenkins)

We'd be super excited if you have:

  • Software development expertise in Golang or Java
  • Understanding of terminology like incident, problem, SLO, MTBF
  • Build systems (bitbucket, bamboo)
More about our team
Atlassian Site Reliability Engineering is a rapidly growing group within the organization. We are in the process of building our teams, tools and systems as part of Atlassian's mission to build the best SaaS services in the world. This is a truly exciting team to join - we are currently or are planning to be involved with every technical team across Atlassian.

We enable Atlassian to go fast by providing real time feedback on production systems. We work side by side with the product family and platform developers to maintain and improve services and performance.

We live the company values with a strong customer focus and possess a healthy sense of urgency. We are a heavily data driven team, utilizing a variety of data collection, enrichment, analytics and visualizations to learn about our complex systems.

We also live the 'Play, as a team' value by having a strong focus on sharing learning experiences from the front line with the development teams. So, the options for people in the team are vast. If you like mastering a domain and going deep, we need you. If you can juggle three tasks and coordinate multiple people in the heat of an incident, we need you. If you love the benefits of process and methodical improvement, you will love it here. If you want to keep your head down, headphones on and bash out code to support the team, we have a spot for you too.

Our perks & benefits

To support you at work and play, our perks and benefits include ample time off, an annual education budget, paid volunteer days, and so much more.

About Atlassian

The world’s best teams work better together with Atlassian. From medicine and space travel, to disaster response and pizza deliveries, Atlassian software products help teams all over the planet. At Atlassian, we're motivated by a common goal: to unleash the potential of every team.

We believe that the unique contributions of all Atlassians create our success. To ensure that our products and culture continue to incorporate everyone's perspectives and experience, we never discriminate based on race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. All your information will be kept confidential according to EEO guidelines.

To learn more about our culture and hiring process, explore our Candidate Resource Hub.
You’ve got this!