Senior Site Reliability Engineer
Role details
Job location
Tech stack
Job description
As the Senior Site Reliability Engineer (SRE), you will be critical in ensuring the reliability, scalability, and security of LoadUp's next-generation platform-a modern, cloud-native microservices architecture built for enterprise scale. You will work closely with the CTO, Director of Architecture, and engineering teams to build, optimize, and maintain a world-class AWS infrastructure to support our mission-critical, multi-tenant enterprise systems., * Infrastructure Management: Design, provision, and manage a scalable AWS infrastructure supporting our cloud-native microservices platform.
- Reliability & Observability: Build and maintain frameworks for logging, metrics, tracing, and dashboards to ensure deep visibility into system health.
- Incident Leadership: Lead incident management and post-mortem processes, driving root cause analysis and long-term remediation.
- Security & Compliance: Enforce infrastructure security best practices (IAM policies, secrets management, network segmentation) and support SOC 2 requirements.
- Infrastructure as Code (IaC): Develop and maintain IaC using tools such as Terraform or AWS CloudFormation to ensure consistent, repeatable provisioning.
- Database Optimization: Manage and optimize relational databases (PostgreSQL) at scale, including replication, failover, and performance tuning.
- Capacity & Efficiency: Guide architectural reviews for resilience and fault tolerance while driving capacity planning and cloud cost optimization.
Requirements
Do you have experience in Tooling?, * Experience: Bachelor's degree in CS/Engineering OR 8+ years of practical SRE/DevOps experience provisioning large-scale, multi-tenant SaaS applications.
- AWS Mastery: Extensive hands-on experience with services including EC2, ECS/EKS, RDS, S3, CloudWatch, IAM, Route 53, and API Gateway.
- IaC & Automation: Deep, proven experience with Infrastructure as Code (IaC) tools like Terraform.
- Container Orchestration: Hands-on experience with orchestration platforms supporting backend microservices (Java/Spring Boot, Node.js).
- Database Reliability: Strong knowledge of PostgreSQL optimization, connection pooling, and query performance tuning.
- Production Support: Proven experience handling production support, on-call rotations, and running blameless post-mortems., * An Ownership-Driven Engineer who values long-term system reliability and architectural resilience over short-term hotfixes.
- An Elite Communicator capable of translating complex backend infrastructure architecture into clear concepts for non-technical stakeholders.
- A Security Champion who naturally designs with least-privilege access and data isolation patterns in mind.
Benefits & conditions
Pulled from the full job description
- Referral program
- Health insurance
- 401(k) matching
- Paid time off
- Vision insurance
- 401(k) 5% Match
- Health savings account, * Competitive Compensation - Salary that rewards your technical expertise and infrastructure ownership.
- Health & Wellness Coverage - We've got you covered with Medical, Dental, Vision, and Life Insurance.
- Flexible Spending & Savings - Choose between an FSA or HSA to manage your health expenses your way.
- Generous Paid Time Off - Take the time you need to rest, recharge, or explore life outside of work.
- 401(k) with 5% Company Match - Start building your future now-we'll help you save for it.
- The "LoadUp Extras" - Employee Recognition Program, Monthly Lifestyle Stipends, and Referral Bonuses.