IT Leader - Platform Architecture, Scalability & AI Automation

Resolution Technologies, Inc.

15 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Tech stack

Artificial Intelligence

Amazon Web Services (AWS)

Computing Platforms

Azure

Cloud Computing

Cloud Engineering

Continuous Integration

Distributed Data Store

Distributed Systems

Machine Learning

Performance Tuning

Site Reliability Engineering Practices

Data Streaming

Data Logging

Transaction Processing (Computing)

Google Cloud Platform

Real Time Systems

Caching

Cloudformation

Event Driven Architecture

Containerization

Kubernetes

Low Latency

Data Analytics

Kafka

Terraform

Data Pipelines

Legacy Systems

Microservices

Job description

We are seeking a senior Technology Leader to own the architecture, scalability, and intelligent automation of our core FinTech platforms. This role is responsible for designing large-scale, cloud-native, distributed systems while leveraging AI-driven automation to improve platform efficiency, reliability, and operational scale. The ideal candidate combines deep expertise in platform architecture and distributed systems with a strong point of view on using AI to automate infrastructure operations, optimize performance, and enable predictive, self-healing platforms. This is a highly technical leadership role with material influence over how the platform scales as the business grows., Platform Architecture & Technical Strategy

Own the end-to-end platform architecture supporting core FinTech products and transaction flows
Define architectural standards for scalability, performance, resiliency, and system composability
Lead evolution from tightly coupled or monolithic systems toward distributed, service-oriented platforms
Establish clear system boundaries, ownership models, and architectural governance
Define and execute a multi-year platform roadmap aligned with growth, transaction scale, and product velocity

Scalability & Distributed Systems

Design platforms capable of handling high transaction volumes, burst traffic, and sustained throughput
Guide horizontal scaling strategies across compute, storage, data, and messaging layers
Lead architectural decisions around sharding, partitioning, caching, asynchronous processing, and concurrency
Continuously improve latency, throughput, and resource efficiency across the platform
Enable multi-region and multi-environment scalability where required

Cloud & Infrastructure Architecture

Architect cloud platforms (AWS, Azure, or Google Cloud Platform) optimized for scale, availability, and operational efficiency
Define reference architectures for containerized workloads, microservices, and distributed runtimes
Lead Kubernetes and container platform adoption and standardization
Mature Infrastructure as Code (Terraform, CloudFormation, etc.) for consistent, scalable environments
Own capacity modeling, growth forecasting, and infrastructure lifecycle planning

AI-Driven Automation & Intelligent Platforms

Apply AI and machine learning techniques to automate platform operations and decision-making
Use AI for:

Capacity forecasting and demand prediction
Anomaly detection in platform performance and system behavior
Automated root-cause analysis and incident correlation
Predictive scaling and infrastructure optimization

Drive adoption of self-healing platform patterns where systems can respond automatically to failure or degradation
Enable data pipelines, feature stores, and runtime environments required to support AI-enabled platform services
Partner with data and engineering teams to productionize AI capabilities within core platform workflows

Platform Engineering & Developer Enablement

Build shared platform capabilities that abstract complexity and enable product teams to scale independently
Provide self-service infrastructure, golden paths, and opinionated platform tooling
Standardize CI/CD, runtime environments, observability, and deployment patterns
Reduce friction and cognitive load for application teams through strong platform design
Measure and improve developer experience as a platform outcome

Reliability, Performance & Intelligent Operations

Lead SRE practices focused on scalability, automation, and operational maturity
Define and track SLIs/SLOs centered on throughput, latency, availability, and platform health
Establish advanced observability (metrics, tracing, logging) as inputs to AI-driven insights
Lead analysis of scaling failures, performance bottlenecks, and systemic inefficiencies
Drive continuous improvement toward predictable, automated, and resilient operations, * Develop deep understanding of current platform architecture and scaling limits
Review system topology, transaction paths, and performance characteristics
Identify opportunities for automation, AI-driven optimization, and architectural simplification
Build strong relationships across engineering, data, and product leadership

Days 31-60 - Architect & Automate

Define target-state platform architecture with explicit scalability patterns
Prioritize architectural improvements with the highest scale and automation leverage
Introduce AI-enabled insights into observability, capacity, or incident analysis
Establish platform standards, reference architectures, and design principles

Days 61-90 - Scale & Industrialize

Deliver measurable improvements in throughput, latency, and platform stability
Advance automation toward self-service, self-scaling, and self-healing capabilities
Roll out platform-level AI automation for operations and performance optimization
Finalize a multi-year platform and AI-automation roadmap
Establish a culture of building intelligent systems designed to scale by default

Requirements

10+ years of experience designing and operating large-scale distributed systems
5+ years in senior technical leadership roles (Director, Principal, VP, or equivalent)
Deep expertise in platform architecture, cloud-native design, and system scalability
Strong hands-on experience with AWS, Azure, or Google Cloud Platform
Proven experience with microservices, event-driven architectures, and distributed data systems
Solid background in Infrastructure as Code and automation-first platform design
Experience applying AI/ML concepts to operational or platform use cases, * Experience with high-volume transaction processing or real-time systems
Strong Kubernetes and container platform experience
Experience with event streaming platforms (Kafka or equivalent)
Background modernizing legacy platforms at scale
Experience with AI-assisted operations, AIOps, or intelligent monitoring platforms

Key Competencies

Systems-level architectural thinking with a strong scalability mindset
Ability to blend platform engineering and AI automation into practical solutions
Technical credibility with senior engineers, architects, and leadership
Pragmatic decision-maker who balances ideal architecture with real-world constraints
Strong communicator who can translate technical strategy into business impact

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all