Staff-level engineers
Role details
Job location
Tech stack
Job description
Vollzeit
- Typ Festanstellung
Gewünschte Fähigkeiten & Kenntnisse
Kubernetes Design SQL Azure Edge Software-Engineering Cloud across Übersetzungssoftware Open Source AWS Python Solid Fehler-Ursachen-Analyse Grafana Go Mentoring Support Continuous Integration SaaS Engineering Automatisierung Microservices Flexibilität, As a Senior / Staff Software Engineer -- Platform and Reliability, you'll be a key contributor to the design and operation of our cloud-native infrastructure. You will build and scale a reliable, cost-efficient platform that enables fast iteration and secure delivery of our vector database solutions to customers around the world.
This role sits at the intersection of software engineering and infrastructure. You'll write production-grade Go code, manage Kubernetes at scale, and design internal tooling and systems that enable developer autonomy and operational excellence.
We are open to hiring at Senior or Staff level, depending on experience, scope of impact, and technical leadership. Staff-level engineers are expected to operate with broader ownership, drive technical direction, and influence multiple areas of the platform.
Tasks
- Design, implement, and operate our cloud-native platform architecture
- Build and maintain Kubernetes clusters and develop custom Kubernetes operators
- Write production-grade Go code for platform services and automation tooling
- Optimize cloud infrastructure (AWS/GCP/Azure) for performance, cost, and reliability
- Improve observability across the platform through monitoring, logging, and alerting
- Automate workflows and integrations across systems and tools
- Collaborate with engineering, operations, and data teams to understand and support their infrastructure needs
- Contribute to incident response, root cause analysis, and system hardening
- Continuously improve performance, developer experience, and reliability at scale, * Amazon Web Services, Architektur, Automatisierung, Cloud Computing, Coaching und Mentoring, Continuous Integration, Daten- / Datensatzprotokollierung, Datenbanken, Distributed Computing, Grafana, Incident Response, Infrastruktur, Istio, Kubernetes, Microservices, Microsoft Azure, Montage und Demontage, Open Source, Operational Excellence, Produktlinienentwicklung (Software), Prometheus, Python, Saas, Technische Leitung, Technische Überwachung, Telemetrie, Ursachenanalyse, Workflows
Persönliche Fähigkeiten
- Eigenmotivation, Kommunikation, Teamarbeit, Zuverlässigkeit
Requirements
- 5--7+ years of experience in platform engineering or SRE roles
- Strong proficiency in Go and Python, or deep expertise in one with willingness to work with both
- Deep experience developing Kubernetes operators
- Solid understanding of distributed systems and microservices architecture
- Hands-on experience with cloud providers (AWS, GCP, or Azure)
- Experience with CI/CD, infrastructure-as-code, and automation best practices
- Comfort participating in on-call rotations and managing production incidents
- Proactive, ownership-driven mindset with strong communication skills
Nice to have
- Experience in a SaaS, database, or systems-level product company
- Experience with Prometheus, Grafana, and service meshes (e.g. Istio, Linkerd)
- Familiarity with observability standards like OpenTelemetry
- Contributions to open-source projects
Staff-level expectations (if applicable)
- Ability to drive technical direction across systems or domains
- Experience mentoring engineers and raising the technical bar
- Strong system-level thinking and long-term ownership mindset
Benefits & conditions
- Competitive salary and benefits package.
- Flexible work hours and full remote setup.
- Opportunity to work on cutting-edge technology in a fast-growing industry.
- Professional development opportunities and career growth.
- Collaborative and inclusive work environment.