Software Engineer (AWS)
Role details
Job location
Tech stack
Job description
Job Description We're looking for a Software Engineer to help build and operate the data infrastructure that powers our next generation of products and AI-driven capabilities. In this role, you'll take ownership of observability, monitoring, and reliability across our platform, ensuring every feature is measurable, performant, and resilient from day one.
Alongside infrastructure and operations work, you'll contribute directly to our software codebase, developing data-driven services and AI-integrated features. This is a hands-on engineering role for someone who enjoys automation, problem solving, and building scalable systems while growing into data and AI technologies.
This is a hybrid position requiring 3 days per week in our Newcastle office.
First 90 Days
-
30 days: Get familiar with our platform architecture, data pipelines, monitoring tools, and cloud environment. Understand existing observability practices, alerting systems, and software development workflows.
-
60 days: Begin actively improving logging, performance tracking, and cost monitoring across services. Contribute code to internal products and data-driven applications while tuning alerting systems for clarity and reliability.
-
90 days: Own observability standards for new features and pipelines. Lead root-cause investigations using monitoring data and implement long-term fixes through automation and code improvements. Drive continuous improvements in reliability, performance, and cost efficiency.
Meet the Team You'll work closely with software engineers, data engineers, product teams, and AI specialists in a collaborative, fast-moving environment. The team values automation, operational excellence, and building systems that scale reliably while enabling rapid product innovation.
How Success Will Be Measured
- Consistent implementation of observability across all new services and features
- Improved system reliability, performance, and actionable alerting
- Early detection and resolution of cost anomalies in cloud usage
- Quality and maintainability of code contributions to data and AI-enabled products
- Reduction in recurring incidents through strong root-cause analysis and preventative solutions, * You'll integrate logging, metrics, and performance tracking into new services and pipelines.
- You'll monitor platform health, investigate anomalies, and fine-tune alerting systems to remain accurate and actionable.
- You'll track cloud usage and identify cost spikes, working with the team to optimise infrastructure.
- You'll contribute code to internal tools, data platforms, and AI-integrated features.
- You'll lead root-cause analysis for incidents and implement long-term fixes through automation and system improvements., Advert Working at Sage means you're supporting millions of small and medium sized businesses globally with technology to work faster and smarter. We leverage the future of AI, meaning business owners spend less time doing routine tasks, like entering invoices and generating reports, and more time pursuing their ambitions.
Requirements
- Deep expertise in observability, monitoring, and cloud cost management
- Hands-on experience building data-driven and AI-enabled software products
- Strong exposure to automation and reliability engineering practices
- Experience working across infrastructure, software development, and operations
- Growth into AI/ML-adjacent systems and data platform engineering, * Experience as a Software Engineer working on cloud-based or data-driven systems
- Strong programming skills with a focus on clean, maintainable code
- Experience implementing monitoring, logging, and observability solutions
- Familiarity with cloud platforms and cost management concepts
- Understanding of software development lifecycles and operational best practices
- Strong problem-solving mindset with a focus on reliability and automation
- Ability to work effectively in cross-functional teams
- Excellent communication skills in English
Nice-to-Have Skills
- Experience with data pipelines, analytics platforms, or AI-integrated systems
- Familiarity with incident response, SRE, or reliability engineering practices
- Exposure to automation tools, infrastructure as code, or DevOps workflows
- Interest or experience in AI/ML systems and data infrastructure
Benefits & conditions
At Sage, we offer you an environment where you can grow professionally without compromising your personal well-being. Our benefits package is designed to provide stability, flexibility, and balance: Benefits video - https://youtu.be/TCMtTYUUiuU
- Comprehensive health, dental and vision coverage
- Work away scheme for up to 10 weeks a year
- On-going training and professional development
- Paid 5 days yearly to volunteer through our Sage Foundation
- Flexible work patterns and hybrid working