Python (nice to have)
Amazon AWS (nice to have)
Kibana (nice to have)
Google Cloud (junior)
Elasticsearch (regular)
We are currently recruiting for a Site Reliability Engineer to work for a Global Banking client.
Key Accountabilities :
Analyze and understand IT organization requirements and processes; and use this knowledge to assist in achieving full adoption and utilization of visibility solutions to maximize ROI and business value
Work towards detailed monitoring strategy to ensure high system-uptime, improved operational efficiency and reduced support and operational costs
Participate in product / vendor evaluation exercise, and work as trusted advisor to provide recommendations on tool procurement and adoption strategy
Present and articulate advanced product features, benefits, and overall product solutions
Perform hands on deployment, configuration, troubleshooting and demonstration of SRE / viability solutions
Oversee the design and technical aspects of assigned projects and provide technical leadership and guidance to project resources and teams.
Work on automated incident avoidance, self-heal & incident management solutions
Planning, conception, and implementation in case of maintenance and health-check activities
Performance and capacity management
Requirements :
Experience with Elastic currently use Elasticsearch but looking to expand in the future.
Experience in Performance Engineering across applications, network, and infrastructure.
Understands complex application deployments and architecting monitoring strategy across applications, logs, servers, server-less computes, integration components, LAN and WAN networks and infrastructure
Thorough hands-on experience setting necessary visibility / alerting based on business requirements; as well as providing recommendations based on industry practices to avoid visibility gaps
Will work to build and expand SRE practices across the department working with architecture teams elsewhere to ensure alignment with wider strategy and best practices
Thorough O / S concepts from performance tuning perspective
Deep understanding of internal workings of legacy and modern application development technologies.
Nice to have :
Good understanding on external Cloud Providers (e.g. AWS, GCP, Azure)
Knowledge of programming or scripting like Python, ruby, Perl, RESTful APIs, PowerShell
Solid understanding of multitier application architectures (Thick / Thin client, Cloud, Hybrid, SoA, etc.)
Very good understanding of web application architecture and software
Hands on experience with deployment and usage of multiple Application / Network / Infrastructure enterprise tools or similar.
Fully remote, long-term contract.