Platform Engineer
Role details
Job location
Tech stack
Job description
This role is crucial for ensuring our services are scalable, reliable, secure, and highly performant.
- Infrastructure as Code: Design, deploy, and manage our AWS cloud infrastructure using Pulumi (Python) to ensure a consistent, repeatable, and version-controlled environment.
- CI/CD Pipeline Ownership: Build and optimise our continuous integration and delivery pipelines using CircleCI to facilitate rapid and reliable deployment of our Next.js (Web), Django (Backend), and React Native/Expo (Mobile) services.
- Observability & Monitoring: Implement and manage comprehensive monitoring, alerting, and logging solutions using Datadog and Sentry to provide deep visibility into application performance, system health, and user experience.
- Automation & Tooling: Develop and maintain internal tooling and automation, primarily using Python, to improve engineering efficiency and streamline operational tasks.
- Scalability & Reliability: Collaborate with product and development teams to architect scalable services, troubleshoot production issues, and ensure high availability across the platform.
- Security & Compliance: Embed security best practices into our infrastructure and deployment processes, ensuring adherence to internal standards and industry compliance requirements.
Requirements
We are looking for an experienced and motivated Senior Platform Engineer to join our growing Platform & Enablement team. In this role, you will design, build, and maintain the cloud infrastructure and tooling that powers our Web, Backend, and Mobile applications. You will champion infrastructure-as-code principles, CI/CD best practices, and robust observability. Your goal is to help complete the migration of our infrastructure to IaC and help teams ship features easily, faster, and with confidence., * 7+ years of proven experience in a Platform Engineering, DevOps, or SRE role.
- Deep expertise with a broad range of AWS services, including solid MySQL RDS
- Strong hands-on experience with Infrastructure as Code, specifically Pulumi with Python.
- Expertise in Python as a general-purpose programming language for tooling and automation.
- Experience with containerisation technologies such as Docker, Kubernetes or ECS.
- Hands-on experience configuring and maintaining CI/CD pipelines using CircleCI.
- Demonstrated experience with monitoring and observability tools, particularly Datadog.
- A product-minded engineering approach able to abstract complex operational problems into elegant, reusable platform solutions.
- Excellent communication skills and confidence collaborating cross-functionally with both technical and non-technical stakeholders.
- Comfortable driving initiatives in ambiguous problem spaces, aligning business and technical perspectives.
Nice to haves
- Proven experience mentoring engineers on best practices for infrastructure and reliability.
- Familiarity with the operational concerns of deploying and running diverse technology stacks, including Next.js/TypeScript, Django/Python and React Native/Expo.
- Experience with data store infrastructure such as OpenSearch, Redis, and ElastiCache.
- Experience delivering infrastructure projects, driving consensus, and providing guidance to peers.
- Experience integrating AI/ML-powered tools or APIs to enhance platform capabilities.
- Experience working in a fast-paced, product-led growth environment.
- A mindset geared toward continuous improvement and operational excellence.
If you're ready to join us on our mission to become the most loved pet travel solution, we'd love to hear from you! In your application, please feel free to note which pronouns you use (For example- she/her/hers, he/him/his, they/them/theirs, etc).