PaaS SRE Engineer (North America)

IntelliPro Group Inc.
Palo Alto, United States of America
26 days ago

Role details

Contract type
Temporary to permanent
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 187K

Job location

Palo Alto, United States of America

Tech stack

Java
API
Amazon Web Services (AWS)
Azure
Cloud Computing
Computer Programming
Databases
Continuous Integration
Linux
Distributed Systems
Elasticsearch
Python
MySQL
Nginx
Platform as a Service (PAAS)
Performance Tuning
Redis
SQL Databases
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Cloud Platform System
Break Fix
Reliability of Systems
Kubernetes
Information Technology
Data Analytics
Kafka

Job description

We are hiring a PaaS SRE Engineer to support cloud platform services across the North America region. "PaaS" (Platform-as-a-Service) refers to internal cloud platform products such as cloud consoles, APIs, and platform-level services that enable developers and enterprises to build and manage applications efficiently. This role focuses heavily on database reliability, optimization, and large-scale enterprise operations, with higher expectations in database expertise compared to other SRE tracks (e.g., compute or security)., * Monitor, maintain, and ensure the stability and reliability of PaaS products in the North America region

  • Troubleshoot production issues and customer-reported problems, ensuring timely resolution and service continuity
  • Support product deployment, upgrades, and regional rollout using CI/CD and automation tools
  • Deeply understand cloud platform architecture (console, APIs, platform services) and drive system optimization
  • Improve operational efficiency through automation, intelligent tooling, and data-driven operations
  • Redesign and document operational processes and best practices to improve service quality and delivery
  • Collaborate with cross-functional teams to ensure enterprise-level service reliability and performance

Requirements

  • Bachelor's degree or above in Computer Science or related field
  • 5+ years of experience in system operations, SRE, or infrastructure engineering
  • Hands-on experience operating services on public cloud platforms (AWS, Azure, GCP, etc.)
  • Strong understanding of distributed systems and Kubernetes-based environments

Core Technical Requirements

  • Strong database expertise (high priority for this role):

  • Solid hands-on experience with MySQL (usage + performance optimization required)

  • No need for source code-level understanding, but must be able to troubleshoot and tune performance

  • Experience with other SQL databases is acceptable, with the ability to quickly learn and adapt

  • Strong knowledge of:

  • Linux systems and troubleshooting

  • TCP/IP networking fundamentals

  • Common cloud infrastructure components (Nginx, Redis, Kafka, Elasticsearch, etc.)

  • Proficiency in at least one programming/scripting language:

  • Python, Go, Shell, or Java

Preferred Qualifications

  • Experience supporting enterprise-level production systems (2B environments)
  • Strong hands-on troubleshooting and incident response experience (on-call awareness preferred)
  • Familiarity with CI/CD pipelines, automation, and operational efficiency improvements
  • Experience improving system reliability through monitoring, alerting, and performance tuning

Soft Skills

  • Strong ownership and service-oriented mindset
  • Ability to work in fast-paced, production-critical environments
  • Excellent communication and cross-team collaboration skills
  • Strong documentation and knowledge-sharing ability

About the company

© 2026 Careerjet All rights reserved

Apply for this position