Senior Site Reliability Engineer - AZURE Reliability - Cloud Engineering EMEA
Microsoft
Lisbon, Lisbon, Portugal
há 1 dia

We are the Azure Reliability team. We are a multidisciplinary engineering organization tasked with leading reliability holistically across the Azure platform our goal is to Make Azure the World’s Safest and Most Reliable Cloud.

For the most important Azure services and products, Azure Reliability adopts the Site Reliability Engineering (SRE) approach, where skilled teams of software engineers collaborate closely with product development teams to improve the availability, reliability, observability, and operability of our planet-scale distributed systems.

Azure SRE teams strive to improve reliability fundamentals via software engineering, preferring long-lasting platform improvements delivered as engineering projects over repetitive manual operations.

We contribute to the product fundamentals and architecture, share knowledge, and code, and prefer reuse over re-invention, always looking for ways to make what we build useful to multiple teams and products.

We know that the SRE discipline is evolving; we learn from our peers in the industry and aim to contribute to this evolution by innovating on SRE within our group and sharing those innovations in public.

Our people have a wide variety of professional experiences, and we are interested to meet both candidates with traditional engineering backgrounds and those without.

Some of us are industry veterans, while others joined quite recently. Together we form a varied and talented team, and we want to continue building our diversity with our new hires.

We strongly believe that diversity and an environment where everyone can feel safe to contribute their own insights is the key to making the best workplace possible.

We know that the best workplace makes the best products and services : not only is it the smart thing to do, but it is also the right thing.

We are not looking for people who know it all, we are looking for people who want to learn it all.

If you are excited by this type of challenge and you love to work in groups of people who are similarly excited : come join us!

We value the input of people who aren’t afraid to be learning all the time and embrace mistakes as they continuously improve both our services and themselves.

Today we are looking for people with SRE mindset and experience who are also excited about large scale Database As A Service solutions to join our Azure Data SRE team in Dublin, Ireland and join us in our journey of making Azure SQL DB an even better service for our customers.

Responsibilities

Billions of users across the world rely on our products, and to meet this demand we design and implement world-class distributed systems.

As a Software Engineer in our Azure Data SRE team, you will be responsible for improving the reliability of key Azure services, such as Azure SQL Database.

The Azure SRE key focus areas are :

  • Defining our systems’ reliability goals via Service Level Objectives (SLOs).
  • Improving our systems’ production posture via targeted observability and operability enhancements (telemetry, alerting, incident management (including On-Call for the service), change management, safe production changes).
  • Building reusable automation to empower multiple teams to achieve their reliability goals.
  • Influencing the product architecture and roadmap to make sure the customer-experienced reliability is always a key consideration when evolving the product.
  • Qualifications

    We are looking for engineers passionate about the above areas who are also interested in :

  • Providing technical leadership for engineers across multiple teams within Azure.
  • Mentoring engineers on SRE principles, practices, and tools.
  • 5+ years of software development experience in the area of online services.
  • 5+ years of experience using programming languages such as C#, C++, Java, Python, Go and scripting such as PowerShell or Bash.
  • Experience working with (design, implementation and support) large-scale distributed systems cloud computing providers, SaaS services, etc.
  • ideally with millions or billions of users) or similarly complex environments.

    Preferred or in Computer Engineering, Computer Science or related fields.

  • Experience working on large and unfamiliar codebases (millions of lines of code)
  • Experience as a technical lead or engineering manager, collaborating cross-team and cross-region.
  • NET experience.
  • Awareness of, and ability to reason about, modern distributed software design patterns and cloud systems architecture, including microservices, containers, load-balancing, queuing, caching.
  • AZCXP#AzRelJobs

    Reportar esta oferta de trabalho
    checkmark

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Candidatar
    Meu email
    Ao clicar em "Continuar", autorizo a neuvoo a processar os meus dados e a enviar-me alertas de e-mail, conforme detalhado na Política de Privacidade da neuvoo . Posso retirar o meu consentimento ou cancelar a subscrição a qualquer momento.
    Continuar
    Formulário de candidatura