experiencedSenior SRE/Dev Opsto
Role details
Job location
Tech stack
Job description
for critical Saa S events.Document issues and remediation steps.Proactively create monitors within the EKS/K8s ecosystem.Deploy to EKS/K8s cluster using Terraform and Helm/Flux.Enhance infrastructure health by implementing checks and scripts to address known issues.Maintain and develop deployment code.Implement/integrate new technologies into our Cloud Infrastructure.Collaborate with other teams to provide top-notch support and assistance.Prioritise customer focus in planning deployments/updates, ensuring minimal impact.Conduct RCA and take necessary corrective actions to prevent issue recurrence.Assign alert-related actions to the appropriate team after investigation.Handle support requests for environment-specific actions.To succeed in this role, you will needProficiency in Kubernetes (deployment, scaling, troubleshooting).Experience with configuration management tools like Flux CD/Argo CD.Strong experience with issue processing (RCA, Postmortems).Familiarity with AWS, Terraform
Requirements
Docker, CI/CD.Experience with monitoring tools like Data Dog, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack) or AWS Cloud Watch.Strong understanding of networking concepts and protocols.Proficiency in at least one scripting language (e.g., Python, Node JS, Go).Proficiency in Git or other version control systems.Familiarity with incident response and management tools like Pager Duty, Opsgenie, or Victor Ops.Ownership, proactiveness, persistence, and passion for maintaining a high-traffic online platform.What you get in returnCompetitive Salary and annual performance/salary reviewsRealistic and transparent Bonus system (15-20%), paid quarterlyUnlimited paid vacation leave & paid sick leaveFlexible work schedule to accommodate your needs100% RemoteMedical Insurance for you +1Financial Support for Life Events & Extended Parental LeavePaid professional development courses and trainingsB2 B contracts1. HR Interview (30-45 min)2. Meeting with a Product Owner (60 min)3. Technical interview (90 min)4. Final Interview with CTO & Software Architect (60 min)", "employmentType": "FULL_TIME", "industry": "Site Reliability", "jobLocation" : { "@type": "Place", "address": { "@type": "PostalAddress", "streetAddress": "Valencia", "addressLocality": "Valencia", "addressRegion": "Valencia", "addressCountry": "ES", "postalCode": "n/a" } }, "salaryCurrency": "EUR", "title": "Senior site reliability engineer (platform tribe)", "hiringOrganization" : { "@type" : "Organization", "name" : "Playson" } }