Principal Cloud Operations Engineer

Extreme Networks
San Jose, United States of America
5 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 225K

Job location

San Jose, United States of America

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Azure
Cloud Computing
Cloud Computing Security
Cloud Engineering
Linux
DevOps
Elasticsearch
Middleware
Monitoring of Systems
PostgreSQL
Nagios
Networking Basics
NoSQL
RabbitMQ
Redis
Cloud Services
Prometheus
SQL Databases
Google Cloud Platform
Grafana
Reliability of Systems
Kubernetes
Information Technology
Apache Flink
Deployment Automation
Kafka
Terraform
Docker
Microservices

Job description

Extreme's Cloud Operations team is a group of talented engineers passionate about building highly reliable, scalable and secure solutions in public/private cloud environments. We are looking to hire highly motivated Cloud Operations engineers with strong work experience in production operation, as well as cloud infrastructure design and implementation. Together, we will design, develop and implement the best public / private / local cloud solutions for our customers. This position is responsible for leading cloud infrastructure implementation initiatives and providing technical leadership in cloud architecture, operational excellence, and cost optimization. The role stays current with industry trends and best practices and leverages AI and cloud service provider platforms (AWS, Google Cloud, and Azure) to enhance operational efficiency, reliability, security, and scalability. Extreme Networks is the right place to be and now is the right time to join us and be part of our spectacular, * Provide technical leadership in cloud architecture, operational excellence, reliability, and cost optimization across large-scale production environments.

  • Stay current with industry trends and best practices, and leverage AI technologies and cloud service provider platforms (AWS, Google Cloud, and Azure) to improve operational efficiency, scalability, security, and resiliency.
  • Design and ensure secure, reliable, and high-performance communication across multiple regions and cloud service providers.
  • Configure, tune, and operate middleware services, including SQL and NoSQL databases, messaging and streaming platforms, and related infrastructure components.
  • Evaluate, recommend, and lead the adoption of CloudOps and DevOps tools, platforms, and automation solutions.
  • Troubleshoot complex production infrastructure and application issues, providing deep technical expertise and hands-on support when required.
  • Drive root cause analysis (RCA), implement corrective actions, and establish preventive measures to avoid recurrence.
  • Collaborate closely with engineering cloud architects in system design discussions, architecture reviews, and whiteboard sessions.
  • Partner with Development, QA, SRE, and external service providers or carriers to resolve issues and improve system reliability.
  • Design, implement, and evolve deployment automation platforms for Kubernetes-based microservices.
  • Improve service availability, performance, and scalability through automation, tooling, capacity planning, and process improvements.
  • Analyze system and service performance, identify bottlenecks, and deliver actionable recommendations to improve efficiency and resilience.

Requirements

growth and success. We're looking for the best and the brightest 'A' players who want to make a difference doing a job they love., * BS level technical degree required; Computer Science or Engineering background preferred.

  • 8+ years of experience in a CloudOps / DevOps role.
  • Hands on experience with AWS or any public cloud (Azure, Google Cloud Platform etc.).
  • Knowledge of Linux, security and networking fundamentals.
  • Working knowledge of container-based architecture and deployment (Docker, Kubernetes.)
  • Working knowledge of deployment automation development (Terraform, Helm, ArgoCD).
  • Experience in diagnosing and resolving complex application problems.
  • Working knowledge of Elasticsearch, PostgreSQL, Redis, Ignite, Flink, Kafka, and RabbitMQ.
  • Experience with monitoring tools (Nagios, Grafana, Prometheus)
  • Experience with cloud security and compliance implementation is a plus.
  • Strong follow-through and initiative to stay with issues until they are resolved.
  • Comfortable working within a distributed team located in multiple time zones.

Benefits & conditions

Compensation: Salary based on region, qualifications and experience up to $180,000 - 225,000

About the company

Over 50,000 customers globally trust our end-to-end, cloud-driven networking solutions. They rely on our top-rated services and support to accelerate their digital transformation efforts and deliver unprecedented progress. With double-digit growth year over year, no provider is better positioned to deliver scalable outcomes than Extreme. Inclusion is one of our core values and in our DNA. We are committed to fostering an inclusive workplace that embraces our differences and creates an atmosphere where all our employees thrive because of their differences, not in spite of them. Become part of Something big with Extreme! As a global networking leader, learn why there's no better time to join the Extreme team., Extreme Networks, Inc. (EXTR) creates effortless networking experiences that enable all of us to advance. We push the boundaries of technology leveraging the powers of machine learning, artificial intelligence, analytics, and automation. Over 50,000 customers globally trust our end-to-end, cloud-driven networking solutions and rely on our top-rated services and support to accelerate their digital transformation efforts and deliver progress like never before. For more information, visit Extreme's website or follow us on Twitter, LinkedIn, and Facebook.

Apply for this position