DevOps Engineer
Role details
Job location
Tech stack
Job description
We are seeking a high-level DevOps Platform Engineer to lead the evolution of our Multi-Cloud Platform. This role is dedicated to supporting Global Solutions Consultants and Enablement teams by leveraging Artificial Intelligence (AI) and Public Cloud Service Providers (CSPs), focusing on creating a cloud-native, intelligent, and hyper-scalable ecosystem-primarily centered on Google Cloud Platform (GCP)-that eliminates manual overhead and utilizes AIOps to maintain a world-class training environment. This role offers an exciting opportunity for professional development and career advancement as you enhance the team's understanding of cloud platform features and best practices., * Architectural AI Integration: Design and implement AI-driven workflows using Google Vertex AI and LLMs to automate complex environment staging, documentation generation, and user support.
- Multi-Cloud Ecosystem Leadership: Drive the strategy and management of production environments across GCP, AWS, and Azure, ensuring architectural consistency and cross-cloud resilience.
- AIOps & Predictive Maintenance: Build self-healing infrastructure that utilizes machine learning to analyze telemetry data, predicting and remediating failures before they impact the user experience.
- Advanced CI/CD & GitOps: Develop sophisticated pipelines that treat infrastructure as a living software product, incorporating automated security gates and AI-assisted code reviews.
- Cloud-Native Governance: Oversee multi-tenant cloud environments with a focus on Zero Trust IAM, global security policy enforcement, and AI-optimized cost management.
Requirements
Your Experience
- Solid understanding of LLMOps and AI automation pipelines. You have a track record of integrating artificial intelligence APIs like Google Vertex AI or OpenAI directly into production DevOps workflows, managing complex prompt structures, and assisting with model adjustments.
- High-level scripting capability for custom tools. You possess a background utilizing Python or Go to construct specialized automation agents, intelligent command-line interfaces, and custom operational tools.
- Solid understanding of data science principles and analytics. You leverage cloud analytics frameworks like BigQuery to collect, structure, and refine infrastructure telemetry data for machine learning models.
- Solid understanding of cloud administration across public providers. You bring high-level experience managing environments within Google Cloud Platform, specifically with GKE, Cloud Run, and VPC Service Controls, as well as managing enterprise workloads across AWS and Azure.
- High-level networking and infrastructure design skills. Your experience covers a strong grasp of global load balancing configurations, Cloud Armor, cloud interconnects, and cross-cloud VPN architectures to ensure platform stability and security.
- Solid understanding of Infrastructure as Code frameworks. You are proficient in leveraging automation tools such as Terraform or Ansible to build, maintain, and manage scalable cloud infrastructure setups.
- High-level diagnostic and structural problem-solving abilities. You bring a strong capacity for deep-stack troubleshooting across complex environments to identify systemic platform issues and rapidly establish operational guardrails.
- Solid communication and cross-functional collaboration skills. You are experienced at translating technical platform metrics into strategic value for leadership, leading formal root-cause analyses, and documenting designs into clear Standard Operating Procedures, alongside an understanding of industry-standard project management frameworks to utilize tools like Jira and Confluence for tracking technical tasks and prioritizing platform development effectively.
Preferred Skills:
- Plus factors for this role include experience integrating advanced progressive delivery models, such as metrics-driven canary deployments, natively within container orchestration clusters.
- Plus factors for this role include a background in developing policy-as-code frameworks to implement zero-trust compliance standards without introducing development friction.
- Plus factors for this role include relevant industry cloud certifications across GCP, AWS, or Azure, or specialized automation designations.
Benefits & conditions
The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/com-missioned roles) is expected to be the annual range listed below. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here (https://benefits.paloaltonetworks.com/) .
$160,000.00 - $220,000.00/yr
Our Commitment
We're trailblazers that dream big, take risks, and challenge cybersecurity's status quo. It's simple: we can't accomplish our mission without diverse teams innovating, together.