Senior DevOps & Site Reliability Engineer - Americas

Appspace Inc.
Tampa, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 100K

Job location

Remote
Tampa, United States of America

Tech stack

Microsoft Windows
Artificial Intelligence
Confluence
JIRA
Azure
Code Review
Databases
Continuous Integration
DevOps
Middleware
Github
Infrastructure as a Service (IaaS)
Python
Linux Servers
Log Analysis
Microsoft SQL Server
MongoDB
MySQL
Platform as a Service (PAAS)
Powershell
RabbitMQ
Reliability Engineering
Google Cloud Platform
Cloud Platform System
Database Optimization
Kubernetes
Atlassian Tools
Bicep
Cosmos DB
Bitbucket
Data Management
Terraform
Bamboo

Job description

In this role, you will be the technical anchor for a global platform footprint that includes a mix of Azure IaaS/PaaS, Google Cloud Platform (GCP), Kubernetes, and various data platforms. Your day will consist of:

  • Intelligent Automation & DevOps: Identifying manual "toil" and replacing it with automated workflows for monitoring, change management, and routine administration of large-scale VM environments to ensure a positive ROI.
  • AI-Enhanced Operations: Leading the integration of AI tools for automated code reviews, development frameworks, and predictive log analysis to drive departmental velocity and efficiency.
  • Scalable CI/CD & Provisioning: Designing and maintaining "self-service" deployment frameworks and CI/CD pipelines (GitHub Actions, Bamboo) using Infrastructure as Code (Bicep, Terraform).
  • Strategic ROI Projects: Evaluating platform components to determine the most cost-effective path: automating the current state or migrating features to modern, shared architectures.
  • Unified Observability: Designing and maintaining a comprehensive observability stack across Azure and GCP (metrics, logs, traces) to identify performance bottlenecks and proactively address system defects.
  • Cross-Functional Collaboration: Partner with engineering, security and operations teams to ensure new features are "born" with reliability, security and automated delivery in mind; Ensure adherence to security best practices and compliance standards (SOC2, HIPAA, ISO 27001) and operational excellence with cost efficiency.
  • Root Cause Analysis & Forensics: Investigating complex performance defects by following log trails across web, application, and database tiers (SQL Server, MongoDB, MySQL).
  • Governance & Security: Ensuring all platforms meet security standards (SOC2, HIPAA, ISO 27001) through automated policy enforcement across Azure and GCP.

Requirements

  • Must have a passion for life-long learning.
  • 6+ years in DevOps or SRE roles, with a proven track record of bridging development and operations in complex cloud environments
  • Extensive experience with Microsoft Azure (IaaS, PaaS, App Services, Networking) and/or Google Cloud Platform (GCP).
  • Expert-level PowerShell and Python skills. Hands-on experience with Bicep or Terraform is required
  • Strong background in Windows/Linux Server OS, Kubernetes (AKS/GKE), Helm, and container orchestration
  • Familiarity with various middleware and PaaS technologies (e.g. Event Hub, Service Bus, CosmosDB, RabbitMQ, MongoDB, etc.)
  • Expert-level troubleshooting and the ability to reason through complex process workflows to identify faults in large-scale platform environments.

Nice to Haves:

  • Experience with Atlassian suite (Jira, Confluence, Bitbucket).
  • Experience with AI-driven log analysis or automated incident remediation.
  • Knowledge of database tuning (SQL Server, MySQL, MongoDB).
  • Familiarity with compliance standards (SOC2, HIPAA, GDPR).

Benefits & conditions

For all our US based team members, we offer a variety of benefits from competitive salaries, medical, dental and vision coverage, disability coverage, employer paid life insurance, mental health resources, 401(k) plan and a fully paid parental leave program.

Additional perks include:

  • Generous PTO
  • Flexible work schedules
  • Remote work opportunities
  • Paid company holidays
  • Appspace Quiet Fridays (No non-essential internal meetings scheduled)
  • A casual dress work environment

Disclaimer:

Appspace is committed to equitable compensation practices and complies with all applicable local, state, and federal regulations. For jurisdictions that require pay scale disclosure, a general compensation range may be provided during the initial stages of the interview process. Final compensation will be based on multiple factors including experience, skills, certifications, and overall fit for the role.

Apply for this position