Principal Software Engineer (Ruby/Java - Production Engineering)

CBTS Technology Solutions LLC
Cincinnati, United States of America
2 days ago

Role details

Contract type
Temporary contract
Employment type
Part-time (≤ 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 166K

Job location

Remote
Cincinnati, United States of America

Tech stack

Java
API
Amazon Web Services (AWS)
Cloud Computing
Databases
Software Debugging
Distributed Systems
Performance Tuning
Ruby on Rails
Reliability Engineering
Ruby
Runbook
Software Engineering
SQL Databases
Datadog
System Availability
Grafana
Reliability of Systems
Splunk
New Relic (SaaS)
Service Stack
Microservices

Job description

We are seeking an experienced Principal Software Engineer to join a high-impact production engineering team focused on improving platform reliability, scalability, and operational excellence. This is a hands-on engineering role where you'll work closely with Product, SRE, Platform, and Operations teams to troubleshoot complex production issues and implement long-term solutions., * Lead production incident triage across APIs, distributed services, payment workflows, infrastructure, and databases.

  • Investigate and resolve complex production issues across application code, cloud infrastructure, data, and third-party integrations.
  • Partner with engineering teams to identify root causes and implement permanent fixes.
  • Improve system reliability through automation, code enhancements, monitoring, and observability.
  • Develop and enhance:
  • Monitoring and alerting strategies
  • Operational tooling and automation
  • Runbooks and diagnostic workflows
  • Work across a technology stack that includes Ruby on Rails, Java, AWS, APIs, microservices, and SQL databases.
  • Design highly observable, resilient, and maintainable systems.
  • Mentor engineers and promote engineering best practices across development, SRE, and operations teams.

Requirements

  • AWS
  • Ruby (Ruby on Rails) and Java
  • Strong experience debugging production issues end-to-end (code * infrastructure * data * dependencies), * 8+ years of experience in Software Engineering, Production Engineering, Site Reliability Engineering (SRE), or distributed systems.
  • Strong hands-on experience troubleshooting production issues end-to-end.
  • Experience with:
  • Ruby on Rails
  • Java
  • AWS cloud environments
  • APIs and microservices
  • SQL and database investigations
  • Observability tools such as Splunk, Datadog, New Relic, or similar
  • Strong understanding of:
  • Distributed systems
  • Fault isolation
  • Performance tuning
  • Reliability and resiliency engineering
  • Ability to perform effectively during production incidents and critical escalations.
  • Excellent communication and collaboration skills.

Apply for this position