Site Reliability Engineer
Role details
Job location
Tech stack
Job description
We are seeking a skilled Senior Site Reliability Engineer (SRE) with strong experience in Snowflake to support, optimize, and maintain our data platform. The ideal candidate will ensure the reliability, performance, scalability, and operational efficiency of Snowflake-based data systems while partnering closely with Data Engineering, DevOps, and Security teams. A Typical Day: Snowflake Platform Operations
-
Manage and administer Snowflake environments (warehouses, databases, schemas, roles).
-
Monitor system performance, query efficiency, and warehouse utilization.
-
Optimize storage and compute costs through proactive analysis and tuning.
-
Maintain and support Snowflake tasks, streams, pipes, and Snowpipe ingestion processes. Reliability & Monitoring
-
Implement monitoring, alerting, and dashboards for Snowflake performance and health.
-
Troubleshoot data platform issues, latency problems, and ingestion failures.
-
Ensure high availability, reliability, and SLA adherence for Snowflake workloads. Automation & DevOps
-
Automate Snowflake deployments using Terraform, DBT, or similar tools.
-
Build and maintain CI/CD pipelines for data platform changes using Kubernetes, ArgoCD, Docker, and Helm.
-
Create automation scripts (Python/SQL/Bash) for operational tasks and process improvements. Data Pipeline Reliability
-
Support ETL/ELT pipelines from an infrastructure perspective (Airflow, Kubernetes, ArgoCD).
-
Ensure reliable data ingestion from cloud storage and streaming sources (Airflow, Kafka).
-
Partner closely with Data Engineering teams to maintain production-grade pipelines. Cloud Infrastructure & Security
-
Work with cloud platforms such as AWS for Snowflake integration.
-
Manage IAM roles, access policies, and Snowflake RBAC for secure data access.
-
Ensure compliance with data governance, auditing, and security standards. Incident & Change Management
-
Participate in on-call rotations to support data platform incidents.
-
Perform root cause analysis (RCA) and implement long-term corrective actions.
-
Maintain documentation, runbooks, and SOPs for Snowflake operations. Bring Your Passion, Do What You Love. Here's What We're Looking For
Requirements
-
Bachelor's degree in a related field or equivalent professional experience.
-
5+ years of experience as an SRE, DevOps Engineer, or Data Platform Engineer.
-
Hands-on experience with Snowflake administration, performance tuning, and security.
-
Strong SQL skills and solid database fundamentals.
-
Proficiency in Python scripting for automation.
-
Experience with Kubernetes or containerized environments.
-
Experience with Infrastructure as Code tools such as Terraform and CI/CD pipelines.
-
Familiarity with Airflow or similar orchestration tools.
-
Experience working with AWS or Azure cloud platforms.
-
Hands-on experience deploying Helm charts to Kubernetes, preferably using GitOps tools like ArgoCD.
-
Strong understanding of logging, monitoring, and alerting tools such as Datadog, Splunk, or CloudWatch. Preferred Qualifications
-
Snowflake SnowPro certification (Core or Advanced).
-
Experience with DBT for data transformations.
-
Experience optimizing costs for large-scale Snowflake workloads.