Back to list of positions

Site Reliability Engineer

Key Responsibilities:

Incident Management:

● Lead the management of incidents from detection to resolution, ensuring timely communication and minimal impact on customers. ● Coordinate with on-call teams (both Devops and R&D) to address critical issues and provide rapid resolutions. ● Act as the incident manager during major events, providing updates and ensuring adherence to established incident protocols.

Root Cause Analysis:

● Perform post-incident root cause analysis (RCA) and create reports with actionable recommendations for improvement. ● Lead post-mortem discussions to identify system weaknesses and areas for enhancement.

Monitoring and Alerting:

● Improve and maintain real-time monitoring systems and alerts to ensure early detection of issues across our platforms. ● Work closely with development and devops teams to enhance observability and increase system visibility.

Automation:

● Identify repetitive tasks in the incident management process and automate them to reduce manual intervention and response times. ● Implement tools and processes that improve system resilience and reduce the frequency of incidents.

Qualifications:

● Proven experience in Site Reliability Engineering or a similar role with a strong focus on incident management. ● Strong understanding of incident response protocols, root cause analysis, and post-mortem processes. ● Experience with monitoring and alerting tools such as Prometheus, Grafana, Coralogix, or equivalent. ● Proficiency in cloud management (AWS) and a deep understanding of scaling and reliability practices. ● Familiarity with CI/CD pipelines and automation tools (e.g., Jenkins, Terraform, Github Actions). ● Excellent problem-solving skills with the ability to manage high-pressure situations. ● Strong communication skills, with the ability to clearly articulate technical issues to both technical and non-technical stakeholders. ● Experience working in an on-call rotation and leading incident response efforts.

You can contact us at info@apitree.cz or at +420 602 609 112

Let's talk about you

Whether you are looking for a supplier for your new software or want to be part of the team, leave us a message and we will get back to you as soon as possible.

HR department direct contact:

ApiTree s.r.o.

Francouzská 75/4, Praha 2 Vinohrady, 120 00

ApiTree s.r.o. is registered in the Commercial Register at the Municipal Court in Prague, under file no. C 279944

ID: 06308643
VAT: CZ06308643

Bank information

Česká spořitelna
Account number: 4885827379/0800
IBAN: CZ21 0800 0000 0048 8582 7379
SWIFT: GIBACZPX
ČSOB
Account number: 340250698/0300
IBAN: CZ31 0300 0000 0003 4025 0698
SWIFT: CEKOCZPP
Copyright 2020 ApiTree s.r.o. All rights reserved. The website was created and designed by ApiTree s.r.o.