Principal Engineer, Cloud Platforms

AI Savants Incorporated
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Tech stack

API
Amazon Web Services (AWS)
Business Logic
Azure
Cloud Computing
Computer Security
Computer Programming
Continuous Integration
Data Sharing
Relational Databases
Disaster Recovery
Distributed Systems
Fault Tolerance
Python
PostgreSQL
Enterprise Messaging Systems
MySQL
Platform as a Service (PAAS)
Queueing Systems
Reliability Engineering
Prometheus
Datadog
Google Cloud Platform
Cloud Platform System
Data Classification
Istio
Delivery Pipeline
Grafana
Multi-Cloud
Event Driven Architecture
Build Management
Gitlab-ci
Kubernetes
Information Technology
Deployment Automation
Kafka
REST
ELK
Go

Job description

In this pivotal role, you will be instrumental in designing, building, and maintaining the shared infrastructure services and platforms that our product and application teams will depend on

You will focus on creating reusable, reliable, and scalable solutions that abstract away complexity, enabling other teams to focus on their core business logic and deliver features faster in a multi-cloud environment

Design and build core platform components and shared infrastructure services that other development teams will integrate with and leverage to deploy and operate their applications

Architect, implement, and manage highly available and scalable Kubernetes platforms as a service for internal consumers

Develop robust, internal-facing tools and automation for infrastructure provisioning and management primarily using Go (Golang)

Architect and optimize foundational solutions within Cloud environments (AWS, Azure, etc.), focusing on creating reusable patterns and modules for other teams

Design and implement shared Event-Driven Architecture components and messaging platforms using technologies like Kafka or Google Pub/Sub that product teams can easily utilize

Develop and maintain robust CI/CD pipelines (e.g., GitLab CI and ArgoCD) as a service, providing standardized and automated deployment workflows for various development teams

Design and build resilient Distributed Systems components that serve as building blocks for other applications, focusing on reliability, fault tolerance, and performance

Manage and optimize our shared infrastructure across Multi-Region Cloud Environments, ensuring that platform services are globally available and performant for all consumers

Establish and enhance centralized Observability and Monitoring platforms and tools that provide self-service insights for consuming teams

Define and implement clear, well-documented RESTful API designs for the infrastructure services you build, ensuring ease of integration for internal clients

Implement and manage Service Mesh (e.g., Envoy, Istio) capabilities, providing traffic management, security, and policy enforcement as a shared platform for services

Design, implement, and optimize highly available Relational Database services or shared data platforms for broad organizational use

Collaborate closely with product development teams to understand their infrastructure needs and pain points, providing technical guidance and support

Participate in on-call rotations to support the critical shared infrastructure you build, Complete security & privacy literacy and awareness training during onboarding and annually thereafter

  • Review (initially and annually thereafter), understand, and adhere to Information Security/Privacy Policies and Procedures such as (but not limited to):

Data Classification, Retention & Handling Policy

Incident Response Policy/Procedures

Business Continuity/Disaster Recovery Policy/Procedures

Mobile Device Policy

Account Management Policy

Access Control Policy

Personnel Security Policy

Privacy Policy

Saviynt is an amazing place to work. We are a high-growth, Platform as a Service company focused on Identity Authority to power and protect the world at work. You will experience tremendous growth and learning opportunities through challenging yet rewarding work which directly impacts our customers, all within a welcoming and positive work environment. If you're resilient and enjoy working in a dynamic environment you belong with us!

Requirements

9+ years of experience in an Infrastructure Development, Platform Engineering, or Site Reliability Engineering role, with a strong focus on building tools and services for other engineers

Deep expertise with Kubernetes in production environments, particularly in providing it as a platform(i.e single tenant and multi-tenant deployment architectures)

Strong programming skills in Go (Golang) and Python, with experience building robust, maintainable backend services and automation

Extensive hands-on experience with at least one major Cloud Provider (AWS, Google Cloud Platform, or Azure); multi-cloud experience is a strong plus, especially in building abstractions over them

Proven experience designing and implementing Event-Driven Architecture and message queuing systems (e.g., Kafka, RMQ, NATS) as shared services

Solid understanding and practical experience with CI/CD pipeline tools (especially GitLab CI) and experience establishing automated delivery processes for other teams

Demonstrable experience designing and operating Distributed Systems, with an understanding of patterns for creating reliable, shared components

Familiarity with Multi-Region Cloud Environments and strategies for building globally distributed and highly available platform

Proficiency in establishing and utilizing comprehensive Observability and Monitoring platforms (e.g., Prometheus, Grafana, ELK stack, Datadog) for shared infrastructure

Strong experience with RESTful API design principles and building well-documented, consumable APIs

Knowledge of Service Mesh concepts and practical experience with solutions like Istio in a platform context

Hands-on experience with Relational Databases (e.g., MySQL, PostgresSQL), ideally in managing them as a service

Excellent communication skills and the ability to clearly articulate complex technical concepts to both technical and non-technical audiences

A strong customer-centric mindset, treating internal development teams as your primary customers

Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience or equivalent military experience required

Benefits & conditions

Competitive compensation, benefits, and growth opportunities

About the company

Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely accelerate their deployment and usage of AI. Saviynt is recognized as the leader in identity security, with solutions that protect and empower the world's leading brands, Fortune 500 companies and government institutions. For more information, please visit ;br>

Apply for this position