Principal Site Reliability Engineer

Kharidle Online Enterprise
Municipality of Zaragoza, Spain
11 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Shift work
Languages
English
Experience level
Senior
Compensation
€ 72K

Job location

Municipality of Zaragoza, Spain

Tech stack

Java
Amazon Web Services (AWS)
Azure
Backup Devices
Bash
Cloud Computing
Computer Networks
DevOps
Disaster Recovery
Distributed Systems
Python
Linux System Administration
Reliability Engineering
Software Engineering
Data Logging
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Reliability of Systems
Cloudformation
Infrastructure Automation Frameworks
Information Technology
Terraform
Docker
Go
Programming Languages

Job description

As a Principal Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, performance, and security of mission-critical systems. You will provide technical leadership, establish reliability standards, and collaborate closely with engineering teams to build resilient services that support our business growth., * Lead the architecture, implementation, and optimization of highly available cloud infrastructure.

  • Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and reliability metrics.
  • Drive automation initiatives to improve operational efficiency and reduce manual intervention.
  • Design and maintain monitoring, logging, alerting, and incident response systems.
  • Lead root cause analysis and post-incident reviews to prevent recurring issues.
  • Collaborate with software engineering teams to improve system reliability throughout the development lifecycle.
  • Develop disaster recovery, backup, and business continuity strategies.
  • Mentor and guide engineers on reliability engineering best practices.
  • Evaluate and implement new technologies that improve platform stability and performance.

Requirements

Do you have experience in Disaster recovery?, Do you have a Bachelor's degree?, * Bachelor's degree in Computer Science, Engineering, Information Technology, or a related field, or equivalent practical experience.

  • 8+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or Infrastructure Engineering.
  • Strong experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
  • Expertise in Linux system administration and networking concepts.
  • Experience with Infrastructure as Code tools such as Terraform or CloudFormation.
  • Strong knowledge of containerization and orchestration technologies, including Docker and Kubernetes.
  • Experience with CI/CD pipelines and automation tools.
  • Proficiency in scripting or programming languages such as Python, Go, Bash, or Java.
  • Excellent troubleshooting, communication, and leadership skills., * Experience leading large-scale distributed systems.
  • Professional cloud certifications.
  • Experience with observability platforms and advanced monitoring solutions.
  • Knowledge of security best practices and compliance standards.

Benefits & conditions

Pulled from the full job description

  • Health insurance
  • Dental insurance
  • Life insurance, * Competitive salary of €72,000 per year.
  • Flexible working arrangements.
  • Professional development and training opportunities.
  • Collaborative and innovative work environment.
  • Paid vacation and public holidays.
  • Career growth opportunities within a growing organization.

Sueldo: 72.000,00€ al año

About the company

Kharidle Online Enterprise is a growing technology-driven company focused on delivering reliable, scalable, and high-performance online services. We are seeking an experienced Principal Site Reliability Engineer (SRE) to lead the design, operation, and continuous improvement of our infrastructure and platform reliability.

Apply for this position