Site Reliability Engineer

Playson

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Compensation

€ 42K

Job location

Remote

Tech stack

Amazon Web Services (AWS)

Cloud Computing

Computer Networks

Elasticsearch

Monitoring of Systems

Python

Node.js

Octopus Deploy

Reliability Engineering

Logstash

Prometheus

Scripting (Bash/Python/Go/Ruby)

Grafana

GIT

Kubernetes

Infrastructure Automation Frameworks

Kibana

Terraform

Software Version Control

Docker

Pagerduty

ELK

Job description

events. Document issues and remediation steps. Proactively create monitors within the EKS/K8s ecosystem. Deploy to EKS/K8s cluster using Terraform and Helm/Flux. Enhance infrastructure health by implementing checks and scripts to address known issues. Maintain and develop deployment code. Implement/integrate new technologies into our Cloud Infrastructure. Collaborate with other teams to provide top-notch support and assistance. Prioritize customer focus in planning deployments/updates, ensuring minimal impact. Conduct RCA and take necessary corrective actions to prevent issue recurrence. Assign alert-related actions to the appropriate team after investigation. Handle support requests for environment-specific actions. To Succeed In This Role, You Will Need Proficiency in Kubernetes (deployment, scaling, troubleshooting). Experience with configuration management tools like Flux CD/Argo CD. Strong experience with issue processing (RCA, Postmortems). Familiarity with AWS, Terraform

Requirements

Docker, CI/CD. Experience with monitoring tools like Data Dog, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack) or AWS Cloud Watch. Strong understanding of networking concepts and protocols. Proficiency in at least one scripting language (e.g., Python, Node JS, Go). Proficiency in Git or other version control systems. Familiarity with incident response and management tools like Pager Duty, Opsgenie, or Victor Ops. Ownership, proactiveness, persistence, and passion for maintaining a high-traffic online platform. What You Get In Return Competitive Salary and annual performance/salary reviews Realistic and transparent Bonus system (15-20%), paid quarterly Unlimited paid vacation leave & paid sick leave Flexible work schedule to accommodate your needs 100% Remote Financial Support for Life Events & Extended Parental Leave Paid professional development courses and trainings B2 B contracts The Recruitment Process Includes The Next Steps 1. HR Interview (30-45 min) 2. Meeting with a Product Owner (60 min) 3. Technical interview with live coding (90 min) 4. Final Interview with CTO & Software Architect (60 min) #J-18808-Ljbffr ", "employmentType": "FULL_TIME", "industry": "Site Reliability", "jobLocation" : { "@type": "Place", "address": { "@type": "PostalAddress", "streetAddress": "n/a", "addressLocality": "Spain", "addressRegion": "Spain", "addressCountry": "ES", "postalCode": "n/a" } }, "salaryCurrency": "EUR", "title": "Senior site reliability engineer (platform tribe) - playson", "hiringOrganization" : { "@type" : "Organization", "name" : "Playson" } }

About the company

{ "@context": "http://schema.org", "@type": "JobPosting", "baseSalary" : { "@type": "MonetaryAmount", "currency": "EUR", "value": { "@type": "QuantitativeValue", "value": 0.00, "unitText": "MONTH" } }, "datePosted": "2026-03-10", "validThrough" : "2026-06-28", "description": "Founded in 2012, Playson is a leading i Gaming supplier recognised worldwide. We provide our customers with a high-end micro-service-based platform as a service that aims to process billions of financial transactions per day. We provide a cross-regional setup and are chasing latency reduction down to zero. We highly invest in delivering the best game experience and smooth connection regardless of the internet coverage and bandwidth of the game clients. We are currently seeking an experienced Senior Site Reliability Engineer to join our dynamic Platform Tribe. What Will You Be Doing Manage day-to-day alerts, system checks, and issue escalation as necessary. Provide 24x7 on-call support for critical Saa S

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all