Cloud Services-Platform Engineering-Public Cloud
Role details
Job location
Tech stack
Job description
This role is responsible for overseeing complex Azure-based operations, driving improvements in cloud service delivery, and ensuring alignment with client service level agreements. The individual leads technical teams, implements automation, and fosters innovation in cloud operations to enhance organizational performance and reliability. (1.) Key Responsibilities
- Lead Azure IaaS/PaaS operations by architecting, deploying, and managing virtual machines, networking, and storage solutions to ensure high availability and performance.
- Implement and optimize Infrastructure as Code using Terraform and Azure Resource Manager templates to automate provisioning, configuration, and lifecycle management of cloud resources.
- Oversee Azure Kubernetes Service (AKS) clusters, ensuring secure deployment, scaling, and monitoring of containerized applications in production environments.
- Drive continuous improvement in operational processes by analyzing incident trends, implementing automation, and refining monitoring with Azure Monitor and Log Analytics.
- Mentor and empower the support team, providing expert technical leadership in troubleshooting complex production issues and ensuring adherence to best practices.
- Ensure compliance with security and governance standards by applying Azure Policy, RBAC, and cost management tools to maintain regulatory and budgetary requirements.
- Collaborate with stakeholders to translate business needs into robust cloud solutions, ensuring alignment with SLAs and client expectations.
- Foster innovation by evaluating and integrating new Azure services and operational tools to enhance reliability, scalability, and efficiency., The role focuses on ensuring the reliability, security, and performance of Microsoft-based server environments within a dynamic IT infrastructure. The position involves both day-to-day operational management and participation in strategic projects, such as migrations and deployments. The successful candidate will be responsible for maintaining Windows Server systems, managing virtualization platforms, and safeguarding data integrity through robust backup solutions. Collaboration with various IT teams is essential to deliver consistent, high-quality services and to adapt to the evolving technology landscape., Install, configure, and administrate Windows Server operating systems and associated services (AD, DNS, DHCP, GPO, etc.).
Monitor and maintain the operational condition of physical and virtual servers (VMware, Hyper-V, or equivalents).
Manage accounts and access rights via Active Directory and other directory services.
Implement and monitor security policies (patch management, antivirus, system hardening, backups).
Proactively monitor system performance and availability (using monitoring tools).
Handle level 2/3 incidents and support requests, in collaboration with local support and other technical teams.
Participate in infrastructure evolution projects, including migrations, new solution deployments, and automation (PowerShell scripts, deployment tools).
Document procedures, configurations, and interventions to ensure knowledge retention.
Collaborate with network, security, and application teams to ensure consistency and quality across the IT system.
Contribute to technology watch to anticipate changes in the Microsoft ecosystem and propose improvements.
Requirements
- Advanced Proficiency In Infrastructure As Code Using Terraform And Arm Templates For Azure Provisioning.
- Excellent Skills In Azure Kubernetes Service (Aks) Deployment, Scaling, And Management.
- Strong Expertise In Azure Monitoring, Logging, And Diagnostics Tools.
- Advanced Understanding Of Azure Security Controls, Rbac, And Policy Management.
- Excellent Troubleshooting And Incident Management Abilities In Complex, MultiTenant Azure Envi, Windows Server (recent versions + migrations)
Active Directory (users, GPO, authentication, trusts)
Related network services (DNS, DHCP, DFS, WSUS)
Virtualization & Cloud/Hybrid Environments:
Experience with VMware, Hyper-V
Ideally, knowledge of Microsoft Azure (IaaS/PaaS)
Security & Backup Management:
Patch management, antivirus/EDR, system hardening strategies
Backup/restore tools (Veeam, Commvault, etc.)
Appreciated Skills
Scripting & Automation:
PowerShell
Deployment tools (SCCM/Intune)
Monitoring & Operations:
Experience with monitoring tools
Ability to analyze and resolve performance issues
Support & Incident Management:
Level 2/3 ticket resolution
Advanced diagnostics (logs, BSOD, perfmon, event viewer)
Documentation & Procedures:
Experience documenting configurations, operational procedures, and support guidelines
Cross-Team Collaboration:
Working with network, security, and application teams
Experience in ITIL environments (incident and change management)
Technology Watch & Evolution:
Participation in migration/modernization projects
Understanding of Agile team methodologies