Systems Reliability Engineer (SRE) - Edge
Cloudflare
Lisbon, Portugal
há 3 dias

About Us

At Cloudflare, we have our eyes set on an ambitious goal : to help build a better Internet. Today the company runs one of the world’s largest networks that powers approximately 25 million Internet properties, for customers ranging from individual bloggers to SMBs to Fortune 500 companies Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code.

Internet properties powered by Cloudflare have all web traffic routed through its intelligent global network, which gets smarter with every request.

As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company.

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that.

We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us!

Systems Reliability Engineer (SRE) - Edge

An engineering role at Cloudflare provides an opportunity to address some big challenges, at scale. We believe that with our talented team, we can solve some of the biggest security, reliability and performance problems facing the Internet.

We are looking for talented Systems Reliability Engineers to build and operate the Edge platform running in more than 200 cities in over 100 countries which makes Cloudflare customers place their trust in us.

Our SREs come from a variety of technical backgrounds and have built up their knowledge working in different environments.

But the common factors across all of our reliability-focused engineers include a passion for automation, scalability, and operational excellence.

Our SRE teams monitor our network in a follow the sun model with offices in Singapore, London, Lisbon, Austin and San Francisco.

We are still a small team, well-funded, growing quickly and focused on building an extraordinary company. This is a superb opportunity to join a high-performing team and scale our high-growth network as Cloudflare’s business grows.

You will build tools to constantly improve availability, performance, uptime and response times. You will nurture a passion for an automate everything approach that makes systems failure-resistant and ready-to-scale.

Our SREs focus on the immediate state and functionality of the Cloudflare platform around the world, leveraging an array of monitoring, alerting and diagnostics tools while developing and enhancing the Cloudflare platform and its capabilities.

The Edge SRE is a devops team, responsible for reliability engineering across a wide portfolio of applications and services, leveraging developer and operator patterns.

The ideal SRE candidate has a passionate curiosity about how the Internet fundamentally works and has a strong knowledge of DNS, Linux and TLS along with strong coding ability in Bash, Python or Go.

Requisite Skills

  • Linux systems experience
  • 3 years experience in an SRE role or a role with similar functions
  • Intermediate level software development skills in Python, Go or Bash
  • Strong skills in network services, including DNS, TLS / SSL and HTTP
  • Network fundamentals DHCP, ARP, subnetting, routing, firewalls, IPv6
  • Examples of desirable skills, knowledge and experience

  • Experience with the Linux kernel and Linux software packaging
  • Performance analysis and debugging
  • Configuration management systems such as Saltstack, Chef, Puppet or Ansible
  • Load balancing and reverse proxies such as Nginx, Varnish, HAProxy, Apache
  • SQL databases
  • Time series databases (OpenTSDB, Graphite, Prometheus, Grafana)
  • Key / Value stores (Redis, KyotoTycoon, Cassandra, LevelDB)
  • Internetworking and BGP
  • Bonus Points

  • Experience with continuous / rapid release engineering
  • Strong tooling and automations development experience
  • Experience working in a 24 / 7 / 365 service environment
  • High-bandwidth transit Internetworking and routing experience
  • Experience working with large scale production distributed systems
  • Some tools that we use

  • Kubernetes
  • Reportar esta oferta de trabalho
    checkmark

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Candidate-se
    Meu email
    Ao clicar em "Continue", autorizo a neuvoo a processar os meus dados e a enviar-me alertas de e-mail, conforme detalhado na Política de Privacidade da neuvoo . Posso retirar o meu consentimento ou cancelar a subscrição a qualquer momento.
    Continue
    Formulário de candidatura