Job location
Municipality of Madrid, Spain
Tech stack
Bash
Configuration Management
Databases
Software Debugging
Linux
DevOps
File Systems
Distributed Systems
DNS
Fault Tolerance
Monitoring of Systems
Node.js
Ansible
TCP/IP
Zabbix
Grafana
Kubernetes
Bare Metal
Puppet
Job description
infrastructure (bare-metal, LXC, VMs, cloud) Run and debug Kubernetes clusters in production Design fault-tolerant, high-performance systems Work deeply with networking (TCP/IP, routing, connectivity issues) Build and maintain Infrastructure as Code Own incidents end-to-end Collaborate with engineers to solve problems - not escalate them Must-Have 4+ years in Linux / DevOps / SRE Strong Linux fundamentals (processes, memory, networking, filesystem) Solid understanding of networking (TCP/IP, DNS, debugging connectivity) Real Kubernetes experience in production - not just usage, but understanding how it works internally Experience with: Bash scripting Configuration management (Ansible / Puppet) Monitoring systems (Grafana, Zabbix, etc.) Experience with distributed systems / databases We Value A Lot Experience with bare-metal infrastructure Kubernetes at node / networking / runtime level Debugging real production incidents High-load systems experience Clear, structured communication
Requirements
Strong ownership mindset If you're someone who enjoys solving real infrastructure problems - we'd love to hear from you.
About the company
Founded in 2003 and headquartered in Singapore , Group-IB is a leading creator of cybersecurity technologies to investigate, prevent, and fight digital crime. Combating cybercrime is in the company's DNA, shaping its technological capabilities to defend businesses, citizens, and support law enforcement operations. Group-IB is hiring a Senior DevOps Engineer who understands infrastructure beyond abstractions - Linux, networking, and Kubernetes at a deep level. You'll be working on high-load, high-availability infrastructure that supports production systems at scale. This role requires real depth: You understand how systems work under the hood You can debug issues beyond dashboards You've worked with infrastructure, not just abstractions Your work will directly impact system reliability, performance under load, and incident response and recovery . This is real infrastructure engineering - not internal tooling or purely cloud-managed environments. What You'll Do Operate and improve core