Kubernetes Engineer

OpenKyber LLC
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Tech stack

Distributed Data Store
Linux System Administration
Prometheus
Ceph
Delivery Pipeline
Grafana
Kubernetes
Deployment Automation

Job description

  • Manage and support 6+ on-prem Kubernetes clusters across multiple sites.
  • Maintain platform reliability, availability, and operational consistency across all environments.
  • Monitor and troubleshoot issues involving cluster health, workloads, nodes, networking, ingress, storage, and supporting platform services.
  • Support cluster lifecycle activities including provisioning, upgrades, patching, scaling, and general maintenance
  • Improve observability, monitoring, and alerting using tools such as Prometheus and Grafana.
  • Support GitOps and deployment workflows using ArgoCD.
  • Help manage and support container image workflows and registry integrations through Harbor.
  • Work with MAAS, Juju, and Charmed Kubernetes to support cluster and infrastructure lifecycle management in an on-prem environment.
  • Support and improve persistent storage capabilities leveraging Ceph and related storage integrations.
  • Drive platform enhancements that improve resiliency, scalability, security, automation, and supportability across all sites
  • Standardize operational processes, cluster configurations, and support models where possible.
  • Participate in incident response, root cause analysis, and long-term reliability improvements.
  • Create and maintain documentation for architecture, operational procedures, and best practices.

Requirements

  • Strong experience administering Kubernetes in production environments.
  • Experience supporting multiple Kubernetes clusters across distributed or multi-site environments.
  • Strong Linux systems administration and infrastructure troubleshooting skills.
  • Experience with Charmed Kubernetes, Juju, and MAAS in an on-prem environment.
  • Experience with ArgoCD or similar GitOps deployment tools.
  • Experience with Prometheus and Grafana for monitoring, alerting, and platform observability.
  • Experience with Harbor or similar container registries.
  • Experience with Ceph or similar distributed storage platforms supporting Kubernetes workloads.
  • Solid understanding of Kubernetes networking, ingress, storage, and cluster operations.
  • Experience improving reliability, automation, and operational maturity in production platforms.
  • Strong collaboration skills across infrastructure, platform, and application teams.

Apply for this position