Infrastructure Engineer

Jetbrains
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
€ 70K

Job location

Tech stack

Amazon Web Services (AWS)
Azure
Cloud Computing
Continuous Integration
Software Debugging
DevOps
Disaster Recovery
Distributed Systems
Github
Identity and Access Management
Key Management
PostgreSQL
Linux kernel
Network Segmentation
Performance Tuning
Reliability Engineering
Site Reliability Engineering Practices
Prometheus
Data Logging
Google Cloud Platform
Cloud Platform System
Istio
Grafana
Caching
Infrastructure as Code (IaC)
Data Layers
Containerization
Kubernetes
Infrastructure Automation Frameworks
Deployment Automation
TeamCity
Terraform
Vulnerability Analysis

Job description

Terraform GitHub Actions Prometheus GitOps tools (Flux) Descripción del empleo, * Design, implement, and manage the core infrastructure powering Kineto's platform on Google Cloud Platform (GCP), including networking, security, and identity management.

  • Build and operate resilient, highly available distributed systems using Kubernetes (GKE), Knative, Istio, and related cloud-native technologies.
  • Automate the entire infrastructure life cycle (IaC) using Terraform and Terragrunt, ensuring secure, reproducible, and auditable environments.
  • Implement and maintain CI/CD pipelines (e.g. GitHub Actions and TeamCity) and deployment tools like Flux and Helm for GitOps-driven application delivery.
  • Optimize and manage the multi-tenant data layer on Postgres and Neon, focusing on robust tenant isolation, performance, backups, and safe schema management.

Operational Excellence and Reliability

  • Drive site reliability engineering (SRE) practices, including monitoring, alerting (Prometheus, Grafana), logging (Loki), and incident response.
  • Solve complex operational challenges, such as optimizing scale-to-zero for cost efficiency, minimizing cold starts, enhancing autoscaling behavior, and managing queue backpressure.
  • Implement platform-wide performance tuning (e.g. container resource limits, distributed locks, caching strategies, and GC configurations).
  • Ensure platform security and compliance by implementing best practices for secrets management, network segmentation, and vulnerability scanning.

Technical Leadership

  • Own major infrastructure roadmap items, including multi-region deployments, disaster recovery planning, advanced tenancy separation, and ephemeral preview environments.
  • Champion DevOps and SRE principles across the engineering team, mentoring engineers on cloud-native best practices, operational readiness, and debugging complex distributed systems.
  • Collaborate with product and engineering teams to define the long-term vision for the platform's architecture and operational model.

Requirements

  • 5+ years of experience in building large-scale cloud-native infrastructure.
  • Deep expertise with GCP and Kubernetes, especially GKE.
  • Proficient with IaC tools like Terraform., * Design and manage infrastructure on Google Cloud Platform.
  • Build resilient systems using Kubernetes and related technologies.
  • Automate the infrastructure life cycle using Terraform.

Conocimientos

Cloud-native infrastructure design DevOps/SRE practices Kubernetes administration Infrastructure as Code (IaC) Linux internals Networking expertise, * Have five or more years of experience building and operating large-scale, commercial cloud-native infrastructure, with a strong focus on DevOps/SRE practices.

  • Possess deep, hands-on expertise with GCP (or AWS/Azure) and Kubernetes administration and operations (GKE experience is a strong plus).
  • Are proficient with infrastructure-as-code (IaC) tools, particularly Terraform, for managing complex environments.
  • Have a solid understanding of Linux internals, networking (CNI and service mesh), security, and distributed system design.
  • Are familiar with CI/CD tools, GitOps (e.g. Flux), monitoring stacks (Prometheus/Grafana), and logging systems.
  • Thrive in cross-functional teams and excel at communicating complex infrastructure ideas clearly.

About the company

JetBrains is seeking an experienced Infrastructure Engineer to design and maintain our AI-powered platform in Madrid. The role involves managing cloud infrastructure on Google Cloud, implementing SRE practices, and leveraging Kubernetes. Ideal candidates will have over five years of cloud-native infrastructure experience, strong knowledge of IaC tools, and a robust understanding of distributed systems. Join us to foster an inclusive workplace while shaping the future of web applications.

Apply for this position