Site Reliability Engineer I

Backblaze, Inc.

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Shift work

Languages

English

Experience level

Intermediate

Compensation

$ 88K

Job location

Remote

Tech stack

Linux

Network Topologies

Networking Cables

Ansible

Server Farms

Sysadmin

Job description

We are looking for a Site Reliability Engineer I to help support the stability, health, and day-to-day operations of Backblaze's infrastructure. This role serves as a first line of response for customer-affecting issues and production alerts, helping drive timely incident resolution, maintain service reliability, and support operational readiness across our environments. You will work closely with TechOps, Data Center Technicians, and other cross-functional teams to troubleshoot issues, monitor system health, support deployments and migrations, and improve day-to-day operational processes through documentation and automation. The ideal candidate is technically curious, calm under pressure, eager to learn, and excited to grow in a hands-on infrastructure and reliability role.

What You'll Do:

Act as first point of contact for all customer affecting issues
Be a Key Driver for managing the resolution of technical problems
Ensure that incident management processes are following and that incident post-mortems are completed to capture process deviations and areas for improvement
Deliver consistent communication to Management
Respond to zabbix alerts/regular monitoring of zabbix, either by taking direct action on alerts or escalating. Acknowledge every alert if direct action taken, or with escalation point of contact.
Make sure escalations are handed off successfully.
Ensure health of pods across all sites (define pod alerts on zabbix).
Work through daily filesystem checks for pods.
Troubleshoot technical issues for DC Techs -> advanced pod questions, deployment questions, migration troubleshooting, and ansible playbook issues.
Identification and escalating any potential issues regarding the network.
Vault pre-deployment configuration and testing.
Start Vault Migrations, monitor migration pods, handle applicable migration pod health checks.
Document/Work on automating Daily Items.
Document/Provide Network IP's for upcoming deployments.
Monitor Releases/Updates to the Server Farm, escalate issues as they arise.
Engaging in on-call rotation shifts.
Assist fellow TechOps team members in handling tasks.
Making recommendations for improvements in organizational productivity.
Be able to work outside of normal business hours(weekend shift, holidays & evenings) as needed

Requirements

Must be located in Bangalore.
2 - 4 years of relevant experience.
Knowledge of Sysadmin and Linux skills.
Desire to learn and develop all necessary technical skills.
Strong analytical thinking.
Strong skills in working with different teams and communication.
Knowledge of network cabling, network classification, and network topology.

Benefits & conditions

Annual Company bonus plan
Healthcare for family, including dental and vision
401K
ESPP program
Flexible vacation policy
Maternity & paternity leave
MacBook Pro for work plus a generous stipend to personalize your workstation
Childcare bonus (human children only)
Fertility treatment and support
Learning & development program
Commuter benefits
A culture that supports a healthy work-life balance

To provide greater transparency to candidates, we share base pay ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar-stage growth companies. Final offer amounts are determined by multiple factors, including candidate location, skills, depth of work experience, and relevant licenses/credentials, and may vary from the amounts listed below.

The expected salary range for this role is $66,000 - $88,000.

About the company

About Backblaze Backblaze is the object storage leader in the open cloud movement, fueling customer success with cloud storage built purposefully to unlock budgets, unburden administrators, and unleash innovators. Together with our partners, we're helping customers break free from the restrictive, overpriced legacy solutions that hold them back, and blaze forward with the full power of the open cloud in their hands. Founded in 2007, we scaled the business with less than $3 million in outside funding until 2021, when we did a traditional IPO on the Nasdaq stock exchange. Today, Backblaze generates over $100m in revenue and is the leading specialized storage cloud - managing over three billion gigabytes of data storage for 500K+ customers in 175+ countries, including businesses, developers, IT professionals, and individuals. But while there is a lot to celebrate in our past, there is almost as much opportunity ahead of us. We are seeking a Site Reliability Engineer I to join our team!, At Backblaze, we value being fair and good to our customers, partners, and employees. That's why diversity, equity, and inclusion are at the core of our values. We are committed to fostering a workforce where all employees feel a sense of belonging regardless of race, ethnicity, nationality, gender, sexual orientation, age, religion, socio-economic status, ability, veteran status, and education. We believe that our dedication to cultivating a diverse workspace not only allows us to better serve our customers in over 175 countries but further reinforces our commitment to doing the right thing. We are proud to be an Equal Opportunity Employer.