SRE-Intl- LATAM
Role details
Job location
Tech stack
Job description
Insight Global is seeking a Site Reliability Engineer on the Microservices Framework team, you will help design, build, and operate the foundational services that power DocuSign's global, cloud-scale platform. You'll focus on developing highly scalable, reliable, and resilient distributed systems that support rapid business growth while maintaining world-class availability and performance.
This role blends software engineering, infrastructure automation, and reliability engineering, with a strong emphasis on building reusable microservices frameworks, cloud-native data platforms, and automated delivery pipelines.
What You'll Do:
-Engineer for Scale & Reliability: Design and build highly available, fault-tolerant architectures for large-scale distributed systems.
-Build Microservices Frameworks: Develop and maintain shared frameworks and platform services that enable teams to build, deploy, and operate microservices efficiently.
-Write Production-Quality Code: Deliver testable, high-quality, and ship-ready code with strong test coverage.
-Architecture & Design: Partner with Product Management and engineering teams to translate requirements into scalable technical designs and architectures.
CI/CD Automation: Design, build, and maintain CI/CD pipelines to automate application builds, testing, and deployments.
-Infrastructure as Code: Use Terraform to provision, deploy, and maintain cloud infrastructure and data platform components.
-Cloud Optimization: Implement best practices for cloud security, performance, reliability, and cost optimization.
Cross-Functional Collaboration: Work closely with engineers, platform teams, and stakeholders across regions to define and execute cloud strategies.
-Operational Ownership: Participate in on-call rotations, contribute to incident response, and drive continuous reliability improvements.
-Data Platform Engineering: Design, build, and maintain robust, cloud-native data platform solutions using modern tools.
PR: $28-35/hr
Requirements
5+ years of hands-on software development experience in object-oriented languages such as C#, Java, or C+-5+ years of experience deploying and operating applications in cloud environments, using scripting and configuration tools
-Strong experience with system design, API development, and distributed systems
-Proven experience designing and operating CI/CD pipelines for automated build, test, and deployment workflows
-Hands-on experience managing infrastructure and data platforms using Terraform (Infrastructure as Code)
-Strong understanding of reliability engineering concepts (availability, latency, fault tolerance, observability) -Experience building or operating microservices platforms or shared service frameworks
-Strong background in SRE practices, including monitoring, alerting, incident response, and postmortems
-Experience with cloud-native data platforms and distributed storage systems
-Familiarity with DevOps and Agile development practices
-Experience working in global, cross-site engineering teams
-Knowledge of cloud cost optimization strategies and performance tuning
-Ability to influence platform standards and mentor other engineers