Platform Infrastructure Engineer (Containers)

Menlo Security
Argençola, Spain
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
€ 70K

Job location

Argençola, Spain

Tech stack

Amazon Web Services (AWS)
Bash
Cloud Computing
Configuration Management
Collaborative Software
Communications Protocols
Computer Programming
Continuous Integration
Disaster Recovery
DNS
Network Topologies
Identity and Access Management
Python
Network Protocols
Software Architecture
Role-Based Access Control
Prometheus
Software Engineering
TCP/IP
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Spring Cloud
Istio
Large Language Models
Grafana
Amazon Web Services (AWS)
Kubernetes
Infrastructure Automation Frameworks
Information Technology
Terraform
Go
Programming Languages

Job description

Menlo Security is hiring for a Platform Infrastructure Engineering position in Catalunya, Spain. You will be responsible for the design and maintenance of infrastructure on GCP and AWS, with a strong focus on Kubernetes and Terraform for resource management. Our ideal candidate has a Bachelor's degree in Computer Science, proficiency in modern programming languages like Python, and a solid understanding of network protocols. Join us to enhance our cloud infrastructure and enable client security without compromise., * Design, deploy, and maintain VM and Kubernetes infrastructure on GCP and AWS.

  • Build and maintain Infrastructure as Code (IaC) using Terraform.

  • Implement workflows with multi-layer configuration management.

  • Build observability solutions using Grafana Cloud and Prometheus.

  • Manage certificate lifecycle, DNS automation, and service mesh networking.

  • Collaborate with teams for architectural decisions and capacity planning.

  • Automate tasks to improve operational efficiency.

  • Participate in a 24x7 on-call rotation., Platform Infrastructure Engineering is responsible for building and operating Menlo Security's Infrastructure Platform. Together with the rest of our engineering teams, we enable our customers to connect to the Internet without compromise. Our environment provides services globally. We expect failure, build security in by design, create evolvable systems, and enable multi-tenancy across the infrastructure. Automation is an absolute for us. We are committed to getting it done properly, the first time. Responsibilities

  • Design, deploy, and maintain VM and Kubernetes infrastructure on GCP and AWS across dozens of clusters spanning development, staging, and production environments in multiple regions.

  • Coordinate with your peers in your direct team as well as across teams to ensure that the tasks you're working on are going to solve the problems that we need them to solve.

  • Build and maintain Infrastructure as Code (IaC) using Terraform modules, managing resources through Spacelift or equivalent Terraform Automation and Collaboration Software (TACOS). Provision cloud infrastructure including networking, compute, storage, and security components primarily on GCP, with secondary AWS support.

  • Implement and manage workflows with sophisticated multi-layer configuration management.

  • Build and maintain comprehensive observability solutions using Grafana Cloud, Prometheus/Mimir, and OTel collectors. Design Grafana dashboards, configure alerting rules, and ensure visibility across all platform components.

  • Manage certificate lifecycle, DNS automation, ingress controllers, and service mesh networking with Cilium.

  • Partner with Engineering, Product, Compliance, and Security teams to design resilient, scalable systems. Consult on capacity planning, disaster recovery, and architectural decisions for cloud-native applications.

  • Identify and eliminate toil through automation. Write scripts, develop tools, and build CI/CD pipelines to improve operational efficiency and reduce manual work.

  • Participate in a 24x7 on-call rotation as part of a globally distributed team, responding to incidents and driving post-incident reviews.

Requirements

  • Bachelor's degree in Computer Science, similar technical field, or equivalent.
  • Proficient in common programming & scripting languages such as Python, Bash, Go.
  • Strong understanding of network topologies and communication protocols., Proficiency in programming and scripting languages (Python, Bash, Go) Understanding of network protocols (TCP/IP, HTTP/S, UDP, TLS) Kubernetes expertise Experience with Terraform Knowledge of Google Cloud Platform services Experience with GitOps methodologies Automation and CI/CD skills

Formação académica

Bachelor's degree in Computer Science or equivalent, * Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience.

  • Proficiency in common programming & scripting languages. We use a lot of python, bash and go.
  • Understanding of network topologies, communication protocols (ie. TCP/IP, HTTP/S, UDP, TLS) and enterprise grade connectivity solutions.
  • Kubernetes expertise including cluster administration, RBAC, networking, workload management, and troubleshooting across production environments.
  • Proven experience with Terraform for infrastructure provisioning and management.
  • Knowledge of Google Cloud Platform services including GKE, VPC networking, Cloud DNS, Artifact Registry, Secret Manager, IAM, Gemini Code Assist, and Workload Identity.
  • Experience with GitOps methodologies and tools.
  • Clear understanding of how to use LLM code assist tools to effectively build software.

Apply for this position