Site Reliability Engineering

UST España
Pontevedra, Spain
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Remote
Pontevedra, Spain

Tech stack

Java
JavaScript
Amazon Web Services (AWS)
Azure
Cloud Engineering
DevOps
Distributed Systems
Fault Tolerance
Python
Nagios
Reliability Engineering
Data Logging
System Availability
Grafana
New Relic (SaaS)
Appdynamics
Docker
Pagerduty
Jenkins
ServiceNow

Job description

Role & Responsibilities Work as part of a cross-functional engineering team applying Site Reliability Engineering (SRE) principles to in-house developed front-end applications. Maintain clear oversight of system dependencies and continuously iterate improvements to achieve high availability and fault tolerance . Improve reliability, monitoring, and observability of services deployed on AWS within a complex, distributed environment. Identify opportunities to reduce operational overhead through automation and observability . Define, implement, and meet Service Level Objectives (SLOs) . Participate in system design, platform management, and service reviews. Be willing to participate in a 24/7 on-call rotation (paid additionally).

Requirements

Experience required: Experience as a DevOps Engineer, Cloud Engineer , or similar role. Strong experience in cloud architecture and adoption patterns (AWS preferred; Azure/GCP is a plus). Proven experience with distributed systems , focusing on performance, scalability, and resiliency. Technical Skills Infrastructure as Code and CI/CD pipelines (Jenkins). Containers and orchestration: Docker & Kubernetes . Monitoring, logging, and alerting tools: New Relic, Grafana, AppDynamics, Logz.io. Observability implementations with measurable results. Experience with SLI/SLO metrics in critical systems. Process automation. Support platforms: ServiceNow, PagerDuty . Ability to develop code in Java, JavaScript, and Python .

Benefits & conditions

Work schedule Business Hours. No intensive working days for friday or summer. What can we offer? ?? 23 days of Annual Leave plus the 24th and 31st of December as discretionary days! ?? Numerous benefits (Health Care Plan, teleworking compensation, Life and Accident Insurances). `Retribución Flexible ´ Program: (Meals, Kinder Garden, Transport, online English lessons, Health Care Plan...) Free access to several training platforms Professional stability and career plans UST also, compensates referrals from which you could benefit when you refer professionals. The option to pick between 12 or 14 payments along the year. Real Work Life Balance measures (flexibility, WFH or remote work policy, compacted hours during summertime...) UST Club Platform discounts and gym Access discounts If you would like to know more, don't hesitate to apply and we'll get in touch to fill you in detail. We are waiting for you! In UST we are committed to equal opportunities in our selection processes and do not discriminate based on race, gender, disability, age, religion, sexual orientation or nationality. We have a special commitment to Disability & Inclusion, so we are interested in hiring people with disability certificate.

About the company

More in details, UST is a multinational company based in North America, certified as a Top Employer company with over ****** employees all over the world and presence in more than 30 countries. We are leaders on digital technology services, and we provide large-scale technologic solutions to big companies. What are we looking for? We are looking for a Site Reliability Engineering to join our team and work with a highly digitalized top-tier banking customer in the US & UK market . Location: Full remote Only in Spain Language : English & Spanish are a must.

Apply for this position