Application Monitoring and Reliability Engineer
Global Shares
Porto, Porto District, PT
há 5 dias
source : BidRecruit Ltd.

Who we are :

Global Shares is a leading provider of public and private global stock plan services. Our employee share ownership plan technology is award-winning, and our range of products and services make equity plan administration simple, secure, and globally compliant.

Who we are :

People -We care about our people. We treat our people with respect. We embrace diversity and inclusion. We build open and honest relationships, collaborating across boundaries to meet our clients’ needs.

Bravery -We dare to do things differently. We provide best service and technology through innovation, creativity and high performance.

We challenge the norm, we challenge ourselves and we challenge complexity with simplicity.

Global -We are one global network. We are champions of employee ownership. We work together, incorporating clients and partners as an extension of our team.

We foster a global and diverse community, where our people are united through ambition, commitment and shared goals. We are in this journey together.

Integrity -We are committed to professional integrity We conduct our business to the highest standards with skill, diligence and responsibility.

Professional trust, honesty and compliance are at the core of our culture

Service -We are client focused. We strive to provide our best service, to drive great client experience through teamwork and high performance.

Why we need you :

We specialize in the delivery of high-quality web-based applications for participants and administrators for equity-based incentives and share plans.

We are headquartered in Ireland and we also have a world-class software development team located in Lisbon.

This is a varied role in a fast-paced, agile / scrum environment. The role will be responsible for ensuring our production environments are as reliable, resilient, and performant as possible.

This includes reacting to issues and coordinating the appropriate response / action to ensure minimal disruption to our clients.

This is a hands-on position requiring solid technical skills, as well as excellent interpersonal and communication skills.

What you will do :

  • Configuration and management of application monitoring tools.
  • Evaluation of application monitoring results with emphasis on continuous improvement.
  • Understand log analytics monitoring with a monitoring tool (such as Graylog and / or Splunk)
  • Alert management configuration to ensure critical alerts are dealt with appropriately while preventing alert fatigue.
  • Investigate production issues with a focus on fast resolution
  • Perform and / or assist with root cause analysis to resolve long term issues and prevent re-occurrence.
  • Proven ability to coordinate and manage production incidents, including communication to business.
  • Organize and contribute to incident postmortems. Drive to constantly improve processes.
  • Strong reporting skills, providing key data and metrics to management.
  • Triage production issues as part of the support team.
  • Understanding of high-availability, fault-tolerant, scalable, and distributed systems.
  • Closely integrated with DevOps to help drive high availability and fault tolerance within the infrastructure.
  • Closely integrated with Architecture and Delivery Teams to ensure minimum issues (bugs, performance, etc.) within the software.
  • Initial investigation, logging issues, and tracking / reporting of errors / performance-issues found via monitoring tools.
  • Experience working in a small team with minimal supervision and a can-do attitude.
  • Share domain and technical expertise, providing technical mentorship and cross-training to other peers and team members.
  • What you have :

  • A BSc in Computer Science or equivalent technical discipline.
  • Experience in one of the major cloud providers (AWS, Azure, or GCP).
  • Tools / Software
  • Knowledge of monitoring (Grafana, InfluxDB, Prometheus, New Relic, Pingdom)
  • Logging (Graylog, Splunk)
  • Incident & Alert Management - OpsGenie.
  • Excellent written and verbal communication.
  • Flexible, team player, get-it-done personality.
  • Ability to organize and plan work independently.
  • Ability to work in a rapidly changing environment.
  • Ability to multi-task and context-switch effectively between different activities and teams.
  • What we offer :

  • Opportunity to be part of something special, Global Shares is growing fast, and we want you to be part of our journey
  • Competitive salary
  • Employee Assistance Programme
  • Flexible working
  • Active Social Club with events throughout the year
  • Fresh fruit
  • Casual dress code
  • Fully subsidised CEP exams
  • Opportunity to travel and work in our global offices if desired
  • What our Interview Process is like :

    Step 1 - After you apply, a recruiter may reach out to you for an introductory call

    Step 2 - If your background is a match for the role, you may be required to complete a technical assessment (role depended) and / or phone interview with 1-2 people

    Step 3 - If you continue through the process, you will come onsite 1-2 times to interview

    We are committed to an inclusive and diverse Global Shares. Global Shares is an equal opportunity employer.

    Want to see more? Have a look at what life is like in Global Shares in our video linked on our career’s page :

    Reportar esta oferta de trabalho
    checkmark

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    Candidate-se
    Meu e-mail
    Ao clicar em "Continue", autorizo a neuvoo a processar os meus dados e a enviar-me alertas de e-mail, conforme detalhado na Política de Privacidade da neuvoo . Posso retirar o meu consentimento ou cancelar a subscrição a qualquer momento.
    Continue
    Formulário de candidatura