Staff/Principal Platform Engineer - Infrastructure
Role details
Job location
Tech stack
Job description
This is not a protocol-specific role. You will work across the entire organization, building and scaling the internal developer platform that enables 60+ blockchain network teams to deploy, monitor, and operate their services reliably and securely. You will be a technical leader who sets direction, drives standards, and mentors engineers-while still being deeply hands-on., * Own the architecture and evolution of P2P.org's internal developer platform-Kubernetes, monitoring, secrets management, and delivery infrastructure. You are responsible for how these systems scale, interconnect, and evolve across 60+ blockchain networks.
- Design and build scalable, fault-tolerant platform components-including capacity planning, multi-tenancy, networking topology, and storage architecture. Make and document architectural trade-offs clearly.
- Write production code in Go and/or Python - platform tooling, services, operators, and APIs. This is a hands-on engineering role, not just configuration.
- Build self-service infrastructure that teams consume with minimal friction - Terraform modules, Helm charts, reusable pipeline templates, and platform APIs. Champion our versioned module library and automated security propagation via Renovate.
- Scale our Kubernetes platform across bare-metal and cloud, and evolve the observability stack (Victoria Metrics, Prometheus, Grafana, Loki) and delivery platform (ArgoCD, CI/CD pipelines).
Security & Compliance
- Own the security posture of the platform - secrets management, access controls, identity federation, network policies, and encryption at rest and in transit.
- Maintain SOC 2 Type II compliance across platform infrastructure - implement controls as code, maintain audit readiness, and work with compliance teams to close gaps proactively.
- Enforce container and supply chain security - image scanning, signed artifacts, base image management, and vulnerability remediation. Apply zero-trust principles: least-privilege roles, short-lived credentials, auditable access. Harden Kubernetes workloads through pod security standards, network policies, RBAC, and runtime monitoring.
AI & Engineering Productivity
- Use AI tooling as a core part of how you work and champion its adoption across the infrastructure team and wider engineering organization-in code generation, code review, incident response, documentation, and operational automation.
- Build or integrate AI-assisted tooling into platform workflows where it delivers genuine improvement. Help define how we measure the impact of AI adoption across engineering.
Leadership & Collaboration
- Lead the technical direction for the infrastructure team-set architectural standards, drive best practices, and reduce technical debt.
- Collaborate cross-functionally with SRE, blockchain, engineering, data, and security teams. Translate their needs into platform capabilities.
- Mentor and grow engineers through code reviews, architectural guidance, design reviews, and knowledge sharing.
- Contribute to incident response and post-incident reviews, driving platform reliability improvements.
Requirements
- 8+ years of experience in infrastructure, SRE, or platform engineering, with at least 3 years in a senior or staff-level role owning system architecture and technical direction.
- Strong software engineering skills in Go and/or Python - you write production-quality code, not just scripts. You design, build, test, and ship platform services and tooling.
- Proven ability to architect large-scale distributed systems-capacity planning, fault-tolerance patterns, system decomposition, and the ability to articulate trade-offs clearly.
- Deep Kubernetes expertise - managed (GKE, EKS) and bare-metal/hybrid clusters, including networking, storage, multi-tenancy, and cluster lifecycle management.
- Strong Terraform experience - reusable module libraries, state management at scale, and infrastructure lifecycle automation.
- Experience with at least one major cloud provider (GCP, OCI, AWS, or Azure) plus bare-metal infrastructure.
- Hands-on experience with observability (Prometheus, VictoriaMetrics, Grafana, Loki, OpenTelemetry, or equivalent) and delivery tooling (ArgoCD, Helm, CI/CD pipelines).
- Strong Linux systems and networking fundamentals (DNS, load balancing, firewalls, VPN, BGP).
- Infrastructure security experience - container security, supply chain security, network policies, zero-trust access patterns, and compliance (SOC 2 Type II or similar).
- Active use of AI tooling in your engineering workflow, with experience driving adoption across a team.
- A track record of building self-service platform capabilities that other teams actually adopt.
- Strong written and verbal communication - technical proposals, architectural decisions, and cross-team collaboration across distributed teams. English B2 minimum.
Benefits & conditions
At P2P.org we have a team of experts with their own unique approach and ownership culture. Together we gain experience and make dreams come true!
- Fully remote
- Full-time contractor (Indefinite-term Consultancy Agreement)
- Competitive salary level in $ (we can also pay in crypto)
- Paid vacation and sick leave
- Well-being program
- Mental Health care program
- Compensation for education, including foreign language & professional growth courses
- Equipment & co-working reimbursement program
- Overseas conferences, community immersion