Infrastructure Engineer

Jetbrains

4 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

€ 70K

Job location

Tech stack

Amazon Web Services (AWS)

Azure

Cloud Computing

Continuous Integration

Software Debugging

DevOps

Disaster Recovery

Distributed Systems

Github

Identity and Access Management

Key Management

PostgreSQL

Linux kernel

Network Segmentation

Performance Tuning

Reliability Engineering

Site Reliability Engineering Practices

Prometheus

Data Logging

Google Cloud Platform

Cloud Platform System

Istio

Grafana

Caching

Infrastructure as Code (IaC)

Data Layers

Containerization

Kubernetes

Infrastructure Automation Frameworks

Deployment Automation

TeamCity

Terraform

Vulnerability Analysis

Job description

Terraform GitHub Actions Prometheus GitOps tools (Flux) Descripción del empleo, * Design, implement, and manage the core infrastructure powering Kineto's platform on Google Cloud Platform (GCP), including networking, security, and identity management.

Build and operate resilient, highly available distributed systems using Kubernetes (GKE), Knative, Istio, and related cloud-native technologies.
Automate the entire infrastructure life cycle (IaC) using Terraform and Terragrunt, ensuring secure, reproducible, and auditable environments.
Implement and maintain CI/CD pipelines (e.g. GitHub Actions and TeamCity) and deployment tools like Flux and Helm for GitOps-driven application delivery.
Optimize and manage the multi-tenant data layer on Postgres and Neon, focusing on robust tenant isolation, performance, backups, and safe schema management.

Operational Excellence and Reliability

Drive site reliability engineering (SRE) practices, including monitoring, alerting (Prometheus, Grafana), logging (Loki), and incident response.
Solve complex operational challenges, such as optimizing scale-to-zero for cost efficiency, minimizing cold starts, enhancing autoscaling behavior, and managing queue backpressure.
Implement platform-wide performance tuning (e.g. container resource limits, distributed locks, caching strategies, and GC configurations).
Ensure platform security and compliance by implementing best practices for secrets management, network segmentation, and vulnerability scanning.

Technical Leadership

Own major infrastructure roadmap items, including multi-region deployments, disaster recovery planning, advanced tenancy separation, and ephemeral preview environments.
Champion DevOps and SRE principles across the engineering team, mentoring engineers on cloud-native best practices, operational readiness, and debugging complex distributed systems.
Collaborate with product and engineering teams to define the long-term vision for the platform's architecture and operational model.

Requirements

5+ years of experience in building large-scale cloud-native infrastructure.
Deep expertise with GCP and Kubernetes, especially GKE.
Proficient with IaC tools like Terraform., * Design and manage infrastructure on Google Cloud Platform.
Build resilient systems using Kubernetes and related technologies.
Automate the infrastructure life cycle using Terraform.

Conocimientos

Cloud-native infrastructure design DevOps/SRE practices Kubernetes administration Infrastructure as Code (IaC) Linux internals Networking expertise, * Have five or more years of experience building and operating large-scale, commercial cloud-native infrastructure, with a strong focus on DevOps/SRE practices.

Possess deep, hands-on expertise with GCP (or AWS/Azure) and Kubernetes administration and operations (GKE experience is a strong plus).
Are proficient with infrastructure-as-code (IaC) tools, particularly Terraform, for managing complex environments.
Have a solid understanding of Linux internals, networking (CNI and service mesh), security, and distributed system design.
Are familiar with CI/CD tools, GitOps (e.g. Flux), monitoring stacks (Prometheus/Grafana), and logging systems.
Thrive in cross-functional teams and excel at communicating complex infrastructure ideas clearly.

About the company

JetBrains is seeking an experienced Infrastructure Engineer to design and maintain our AI-powered platform in Madrid. The role involves managing cloud infrastructure on Google Cloud, implementing SRE practices, and leveraging Kubernetes. Ideal candidates will have over five years of cloud-native infrastructure experience, strong knowledge of IaC tools, and a robust understanding of distributed systems. Join us to foster an inclusive workplace while shaping the future of web applications.

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all