Junior+ Site Reliability Engineer

Gamingtec

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Shift work

Languages

English

Experience level

Junior

Job location

Remote

Tech stack

Amazon Web Services (AWS)

Software Debugging

Linux

DevOps

DNS

Hypertext Transfer Protocols (HTTP)

Identity and Access Management

CURL

Log Analysis

Reliability Engineering

Ansible

TCP/IP

Data Logging

Scripting (Bash/Python/Go/Ruby)

Load Balancing

Amazon Web Services (AWS)

Kubernetes

Terraform

Docker

Job description

We are hiring a Junior+ Site Reliability Engineer to support the stability, monitoring, and incident response of high-availability distributed systems in a 24/7 iGaming production environment Remote London Limassol Tbilisi Yerevan

Are you ready to take your reliability engineering skills to the next level - in the high-stakes world of iGaming?

We are expanding the engineering team responsible for ensuring the stability and predictable behavior of our distributed services and platforms. This role involves hands-on production work, including monitoring, incident response, troubleshooting, and continuous improvements that increase platform reliability over time.

You will work as part of an SRE shift rotation covering late-evening and night hours, ensuring end-to-end ownership of incidents - from identifying user impact to post-incident follow-ups and preventive improvements., * Working in shift-based operations: monitoring, alert response, incident handling, escalation when needed;

Participating in incident handling: initial classification, technical investigation, coordination with engineering teams, and following-up improvements;
Developing and refining observability across platforms (metrics/alerts, dashboards, logs);
Reducing operational toil: small automation, runbooks, and repeatable processes (the "make it easier next time" mindset);
Collaborating with development teams to improve production readiness (basic reliability practices, cleaner incident follow-ups).

So, why Gamingtec?

If you are a person with passion, ideas, and a thirst to advance your career, you will love our corporate culture. We are an international team that treats each other with respect and moves towards the same goals. We believe in freedom and flexibility and trust our employees to do their jobs in a way that works for them. We have an ambitious and rewarding work environment, a flat organisational structure and almost zero bureaucracy. Our employees' ideas are what move the company forward. Everyone has equal opportunities in every aspect of work, learning and development!

Requirements

Good Linux skills in production environments (debugging basics, system services, logs, performance basics);
Solid understanding of networking fundamentals (TCP/IP, DNS, HTTP, load balancing basics, TLS fundamentals);
Experience with containers and image lifecycle basics (Docker or compatible runtimes);
Ability to troubleshoot across application, network, and infrastructure layers using logs/metrics and simple tools (curl, basic traffic/log analysis; scripting is a plus);
Basic familiarity with observability: metrics and alerting, dashboards, logging (any modern stack is fine).

SRE fundamentals (basic understanding):

You understand the difference between "just running infra" and SRE as a discipline: reliability targets, fast detection, clear escalation, and consistent follow-up;
You're familiar with SLI/SLO and can explain them in simple words (high-level understanding is enough).

Experience:

1+ year in a production-focused role (Ops / Support L2+ / DevOps / Junior SRE - what matters is real production exposure);
Participation in production incidents (triage, investigation, escalation, basic follow-ups);
Availability to cover late-evening and night shifts, in rotation.

Also, it will be great if you have:

Familiarity with Kubernetes (we don't require deep production ownership yet);
Exposure to AWS services such as EC2, ALB/NLB, RDS, S3, and IAM basics;
Exposure to Terraform and/or Ansible (small changes, basic understanding of principles);
Experience working in high-availability environments where downtime actually matters.

Benefits & conditions

Competitive salaries. We want only the top performers, so we offer the appropriate remuneration for their experience and knowledge;
Fully remote work. If you are in one of the areas where one of our offices is located, you will also have the option to go to the office;
Paid vacation and sick leave days. We believe that everyone should have a good work-life balance and no one should burn out;
Constant career development & learning opportunities!
Enjoy the corporate atmosphere with awesome parties and team-building events throughout the year;
Refer your friends and get rewarded with a bonus after they pass their probation period;
Find the right private medical insurance that works for you and receive compensation for it. Compensation (full/partial) depends on the cost;
Flexible Benefits plan. Decide which of your activities/expenses you want the company to compensate you for. For example: gym subscription, language courses, Netflix subscription, a spa day, etc;
Education foundation for learning something new. Be part of our biannual raffle that gives you the chance to learn something new, unrelated to your job.

And this is how our interview process goes:

A 30-minute interview with a member of our HR team to get to know you and your experience;
1st stage of technical interview (1 h) to assess your theoretical skills;
2nd stage of technical interview (1 h) to assess your hard skills;
A final interview to gauge your fit with our culture and working style.