Kubernetes Engineer
OpenKyber LLC
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
EnglishJob location
Tech stack
Distributed Data Store
Linux System Administration
Prometheus
Ceph
Delivery Pipeline
Grafana
Kubernetes
Deployment Automation
Job description
- Manage and support 6+ on-prem Kubernetes clusters across multiple sites.
- Maintain platform reliability, availability, and operational consistency across all environments.
- Monitor and troubleshoot issues involving cluster health, workloads, nodes, networking, ingress, storage, and supporting platform services.
- Support cluster lifecycle activities including provisioning, upgrades, patching, scaling, and general maintenance
- Improve observability, monitoring, and alerting using tools such as Prometheus and Grafana.
- Support GitOps and deployment workflows using ArgoCD.
- Help manage and support container image workflows and registry integrations through Harbor.
- Work with MAAS, Juju, and Charmed Kubernetes to support cluster and infrastructure lifecycle management in an on-prem environment.
- Support and improve persistent storage capabilities leveraging Ceph and related storage integrations.
- Drive platform enhancements that improve resiliency, scalability, security, automation, and supportability across all sites
- Standardize operational processes, cluster configurations, and support models where possible.
- Participate in incident response, root cause analysis, and long-term reliability improvements.
- Create and maintain documentation for architecture, operational procedures, and best practices.
Requirements
- Strong experience administering Kubernetes in production environments.
- Experience supporting multiple Kubernetes clusters across distributed or multi-site environments.
- Strong Linux systems administration and infrastructure troubleshooting skills.
- Experience with Charmed Kubernetes, Juju, and MAAS in an on-prem environment.
- Experience with ArgoCD or similar GitOps deployment tools.
- Experience with Prometheus and Grafana for monitoring, alerting, and platform observability.
- Experience with Harbor or similar container registries.
- Experience with Ceph or similar distributed storage platforms supporting Kubernetes workloads.
- Solid understanding of Kubernetes networking, ingress, storage, and cluster operations.
- Experience improving reliability, automation, and operational maturity in production platforms.
- Strong collaboration skills across infrastructure, platform, and application teams.