DevOps Lead

Valsoft Corporation
1 month ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Tech stack

Java
Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Azure
Bash
Cloud Computing
Configuration Management
Computer Security
Computer Programming
Continuous Integration
DevOps
Distributed Systems
Python
Node.js
Ansible
Prometheus
Data Logging
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Grafana
Reliability of Systems
Cloudformation
Gitlab-ci
Kubernetes
Nintex
Machine Learning Operations
Terraform
Devsecops
Docker
Jenkins
Go

Job description

The DevOps Lead is responsible for defining the strategy, governance, and execution of all DevOps practices within the organization. This role combines technical leadership, operational excellence, and strategic vision, ensuring scalability, automation, and reliability across the entire software delivery lifecycle. The DevOps Lead will lead a multidisciplinary team and collaborate closely with Development, Operations, and AI Engineering groups to drive modernization, automation, and innovation across cloud and on-premise environments., Leadership & Strategy

  • Define and implement the DevOps strategy, ensuring alignment with business and technology goals.
  • Lead, mentor, and grow the DevOps engineering team, promoting a culture of automation, continuous improvement, and operational excellence.
  • Define best practices and governance for CI/CD, cloud infrastructure, observability, and security.
  • Coordinate with senior management and project stakeholders to ensure DevOps initiatives contribute to the broader digital transformation strategy.

Automation & Delivery

  • Oversee the design and optimization of CI/CD pipelines for continuous integration, testing, and deployment across multiple products and platforms.
  • Implement Infrastructure-as-Code for standardized, reproducible environments.
  • Drive automation across configuration management, testing, and monitoring.

Cloud & Platform Engineering

  • Supervise cloud operations ensuring reliability, scalability, and cost efficiency.
  • Guide the transition toward cloud-native architectures and container orchestration (Kubernetes, Docker).
  • Ensure the integration of AI/ML workloads (MLOps) within the company's infrastructure, supporting model deployment and lifecycle management., * Define and maintain SLAs/SLOs, ensuring optimal system reliability and uptime.
  • Implement monitoring, logging, and observability frameworks (Prometheus, Grafana, ELK) in collaboration with Operations.
  • Lead incident management and root-cause analysis, ensuring post-mortem documentation and continuous improvement.

Security & Compliance

  • Enforce DevSecOps principles, integrating security checks within CI/CD and IaC pipelines.
  • Collaborate with IT Security to ensure compliance with industry standards (ISO 27001, SOCx, GDPR, etc.).

Requirements

Do you have experience in Terraform?, Do you have a Bachelor's degree?, * Proven experience (5+ years) in DevOps or Site Reliability Engineering roles, including leadership or senior responsibilities.

  • Strong technical background in (or similar):
  • Automation tools: Jenkins, GitLab CI/CD, Ansible, n8n, Airflow.
  • Containers & Orchestration: Docker, Kubernetes, Nomad, Consul.
  • Cloud platforms: AWS, Azure, GCP (at least one required).
  • IaC: Terraform, CloudFormation.
  • Monitoring & Logging: Prometheus, Grafana, ELK.
  • Programming/Scripting: Python, Bash, Go, Java or Node.js.
  • Solid understanding of networking, system design, and distributed architectures., * Strong leadership, team-building, and mentoring abilities.
  • Excellent communication and collaboration across technical and non-technical teams.
  • Analytical mindset and problem-solving skills in complex environments.
  • Strategic thinking and decision-making capability.
  • Proactive attitude and passion for innovation in cloud, automation, and AI., * Experience in multi-cloud strategies or hybrid environments.
  • Knowledge of AI/ML platform operations.

Apply for this position