Site Reliability Enigneer
Role details
Job location
Tech stack
Job description
We're hiring a dedicated SRE to take real ownership of operational excellence across Cloud Infrastructure. Today, too much critical operational knowledge - vendor relationships, cost management, and incident response - lives with one or two people. Your mission is to take genuine ownership of those domains, make them resilient to any single person, and raise the bar on how reliably we run. This is not simply a ticket-queue or keep-the-lights-on role. You'll own domains end to end: understand them deeply, operate them well, and build the automation and tooling that make them boring. We deliberately pair operational and engineering work so the role grows rather than narrows.
What you'll own
- Incident management & operational excellence - take custody of the incident process: on-call quality, response, post-mortems, and driving down incident count, time-to-detect, and time-to-resolve.
- Automation & reliability engineering - automate low-frequency, high-consequence operations (the certificate-renewal class of problem - rare, easy to forget, outage-causing when missed), not just the high-frequency toil. You decide what to automate based on risk and blast radius, not just time saved.
- A platform domain - over time, deep ownership of a domain such as Temporal, observability, or Kubernetes operations, partnering with the engineers building in it.
- Vendor & third-party management - own key external relationships and integrations (e.g. LLM API providers, third-party services), today managed manually and informally. Bring structure, automation, and bus-factor resilience.
- FinOps - own cloud and platform cost visibility and efficiency, and the mechanics of how usage maps to billing.
What success looks like (first 12 months)
- Critical operational knowledge is documented and shared - no single point of failure for vendor, cost, or incident response.
- Measurable reliability gains: fewer SEV1-SEV3 incidents per quarter, faster customer-impact resolution, and a much higher share of incidents caught by monitoring before customers feel them.
- High-risk manual processes are automated and self-documenting., Provide CTI-wide delivery enablement and project management support, coach teams on best practices, improve use of tools (Wrike/JIRA), and support Product & Pricing Salesforce delivery including planning, testing, and coordination. Top Skills: CpqJIRASalesforceWrike Enverus
Owner Relations Agent - 26237
3 Hours Ago In-Office or Remote United States Mid level Mid level Big Data * Information Technology * Software * Analytics * Energy Answer owner relations calls about revenue, land, division orders, JIB, A/R, and A&P. Log and track inquiries in a case system, follow up on unresolved issues, build client relationships, handle difficult interactions professionally, and cross-train to expand skills. Top Skills: MS Office MetLife, Manage large group insurance client relationships with a focus on reporting and metrics. Serve as primary liaison, deliver client reports and insights, lead projects and implementations, drive strategic initiatives, mentor junior staff, and ensure accurate system data and documentation. Top Skills: ExcelMS OfficeMicrosoft Powerpoint
What you need to know about the Colorado Tech Scene
With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute
Requirements
- Strong production operations experience on AWS and Kubernetes; comfortable with MongoDB and scripting/automation in Python.
- An operations-and-reliability mindset - you take pride in systems that run quietly - paired with the instinct to engineer the problem away rather than absorb it manually.
- Sound judgement on incidents and risk; calm and clear under pressure.
- Influences through relationships and evidence, not escalation; comfortable owning a domain and partnering across teams.
- Bonus: vendor/cost management exposure, Temporal, observability tooling.