Principal Site Reliability Engineer

Kharidle Online Enterprise

Municipality of Zaragoza, Spain

11 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Shift work

Languages

English

Experience level

Senior

Compensation

€ 72K

Job location

Municipality of Zaragoza, Spain

Tech stack

Java

Amazon Web Services (AWS)

Azure

Backup Devices

Bash

Cloud Computing

Computer Networks

DevOps

Disaster Recovery

Distributed Systems

Python

Linux System Administration

Reliability Engineering

Software Engineering

Data Logging

Scripting (Bash/Python/Go/Ruby)

Google Cloud Platform

Reliability of Systems

Cloudformation

Infrastructure Automation Frameworks

Information Technology

Terraform

Docker

Programming Languages

Job description

As a Principal Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, performance, and security of mission-critical systems. You will provide technical leadership, establish reliability standards, and collaborate closely with engineering teams to build resilient services that support our business growth., * Lead the architecture, implementation, and optimization of highly available cloud infrastructure.

Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and reliability metrics.
Drive automation initiatives to improve operational efficiency and reduce manual intervention.
Design and maintain monitoring, logging, alerting, and incident response systems.
Lead root cause analysis and post-incident reviews to prevent recurring issues.
Collaborate with software engineering teams to improve system reliability throughout the development lifecycle.
Develop disaster recovery, backup, and business continuity strategies.
Mentor and guide engineers on reliability engineering best practices.
Evaluate and implement new technologies that improve platform stability and performance.

Requirements

Do you have experience in Disaster recovery?, Do you have a Bachelor's degree?, * Bachelor's degree in Computer Science, Engineering, Information Technology, or a related field, or equivalent practical experience.

8+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or Infrastructure Engineering.
Strong experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
Expertise in Linux system administration and networking concepts.
Experience with Infrastructure as Code tools such as Terraform or CloudFormation.
Strong knowledge of containerization and orchestration technologies, including Docker and Kubernetes.
Experience with CI/CD pipelines and automation tools.
Proficiency in scripting or programming languages such as Python, Go, Bash, or Java.
Excellent troubleshooting, communication, and leadership skills., * Experience leading large-scale distributed systems.
Professional cloud certifications.
Experience with observability platforms and advanced monitoring solutions.
Knowledge of security best practices and compliance standards.

Benefits & conditions

Pulled from the full job description

Health insurance
Dental insurance
Life insurance, * Competitive salary of €72,000 per year.
Flexible working arrangements.
Professional development and training opportunities.
Collaborative and innovative work environment.
Paid vacation and public holidays.
Career growth opportunities within a growing organization.

Sueldo: 72.000,00€ al año

About the company

Kharidle Online Enterprise is a growing technology-driven company focused on delivering reliable, scalable, and high-performance online services. We are seeking an experienced Principal Site Reliability Engineer (SRE) to lead the design, operation, and continuous improvement of our infrastructure and platform reliability.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all