Senior Platform Engineer

Scotiabank Group
Dallas, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Dallas, United States of America

Tech stack

Audit Trail
Automation of Tests
Azure
Bash
Cloud Computing
Code Review
Computer Security
Databases
Continuous Integration
Data Governance
DevOps
Github
Identity and Access Management
IP Routing
Subnetting
Python
Key Management
Log Analysis
SQL Azure
Platform as a Service (PAAS)
Performance Tuning
Powershell
Systems Development Life Cycle
Role-Based Access Control
Reliability Engineering
Ansible
Zero Trust Network Access
Azure
SQL Databases
Azure
Data Logging
Scripting (Bash/Python/Go/Ruby)
Grafana
Mttr
Firewalls (Computer Science)
Containerization
Data Lake
AI Platforms
Information Technology
Deployment Automation
Bicep
Hashicorp
Cosmos DB
Bitbucket
Data Management
Terraform
Software Version Control
Azure
Docker
Databricks
Vulnerability Analysis

Job description

The Senior Platform Engineer will be responsible for the building, tuning, managing infrastructure, DevOps, Platform site reliability, monitoring, troubleshooting, enhancing, enabling new features on Data & AI platform(s) as per banks Data & AI strategy. This consists of working with cross functional teams like IAM, Network, Cloud Ops, Security, Client partners etc for integration, process automation, platform enhancement and delivery of new projects.

What You'll Do

  • Guidance and Direction: Provide clear direction to the team, set goals, and keep the team accountable for their deliverables. Align team goals with the overall direction of the Azure & Databricks Platform roadmap and enterprise standards.

  • Technical Oversight: Own the technical direction across Azure and Databricks: Azure networking and security architecture (VNets, Private Endpoints, NSGs, route tables, Azure Firewall), Azure Identity & Access Management (RBAC, PIM), and Databricks platform governance (Unity Catalog, workspace configuration, cluster policies). Ensure best practices for reliability, cost, and security are consistently applied.

  • Quality Assurance: Ensure a high quality of support delivery for platform users; adhere to platform SLAs/SLOs and service objectives

  • Process Improvements: Continually improve platform processes and SOPs for efficiency and automation. Design and develop reusable Terraform modules for Azure native resources and Databricks (clusters, SQL warehouses, Unity Catalog objects), enabling consistent, scalable, and automated deployments via Terraform Cloud/Enterprise and CI/CD.

  • Customer Relations: Build strong relationships with data engineers, analysts, and platform users. Communicate proactively with stakeholders and cross functional teams (Platform, Security, Cloud Ops, Networking, Data Governance) to align priorities, manage expectations, and drive adoption of platform standards.

  • Advanced Monitoring and Troubleshooting: Troubleshoot and resolve performance issues across Databricks jobs, clusters, SQL warehouses, and Azure dependencies. Implement Azure Monitor and Log Analytics based observability with custom dashboards for cluster/job health, driver/executor metrics, and cost insights. Establish proactive alerting and early issue detection via logs/metrics for Databricks and Azure services.

  • Site Reliability: Analyze, triage, and resolve platform issues promptly to achieve SLOs and platform reliability objectives. Drive error budget aware practices, post incident reviews, and resilience engineering (e.g., autoscaling, retry/backoff strategies, policy guardrails).

  • Incident Management: Provide support during major incidents, including after hours support. Lead incident response, communications to users and stakeholders, and root cause analysis with clear action items and follow through.

  • Observability Tools Development: Design, build, and deploy logging/monitoring solutions for early detection and actionable insights. Standardize ingestion to Log Analytics from Databricks (audit logs, cluster events, job runs) and key Azure resources; built dashboards and alert rules to reduce MTTR.

  • Release Control Management: Maintain and enhance the Infrastructure & Platform release pipeline using Terraform, Terraform Cloud, Azure DevOps and/or GitHub Actions, with source control in GitHub/Bitbucket and artifact promotion via ACR/Artifacts. Enforce approvals, change windows, and automated checks to ensure safe, repeatable releases.

  • Client Pipeline Management: Implement CI/CD for infrastructure and analytics workloads using Terraform, Docker, Azure DevOps/GitHub Actions, and Artifact/Container registries.Automated Terraform plan/apply, Databricks Bundle releases, policy validation, and security scanning to streamline delivery and ensure compliance.

  • Credential Security: Set up Azure Key Vault and HashiCorp Vault for secret management; integrate with Databricks secret scopes and workload identities. Enforce least privilege access via Azure RBAC and rotate credentials per policy.

  • Vendor and Technical Support Interaction: Partner with Microsoft and Databricks support and product teams to fine tune and troubleshoot components, plan upgrades, and adopt new capabilities aligned to roadmap and enterprise controls.

  • Mentorship: Mentor junior engineers in best practices for building, deploying, testing, and supporting services on Azure and Databricks. Promote a culture of automation, documentation, and continuous learning.

Requirements

  • 15+ years of IT experience in big organizations operating in various geographies/regulations.

  • 5+ years of hands on experience with Microsoft Azure (networking, security, identity, storage, compute, PaaS).

  • 5+ years with Databricks on Azure (workspaces, jobs/workflows, clusters/SQL warehouses, Unity Catalog governance).

  • 5+ years using Infrastructure as Code (Terraform modules, Terraform Cloud/Enterprise; working knowledge of ARM/Bicep a plus).

  • 5+ years with CI/CD (Azure DevOps, GitHub Actions), including automated testing, security scanning, and policy gates.

  • 5+ years with development/scripting languages (Python, Go optional; plus Bash/PowerShell) for automation and platform tooling.

  • 5+ years with container technologies (Docker, orchestration on AKS or containerized jobs on Databricks/Functions).

  • Strong understanding of Azure networking (VNets, subnets, Private Endpoints, NSGs, UDRs, Azure Firewall), RBAC/PIM, and zero trust principles.

  • In depth knowledge of databases and data platforms: Azure SQL, Cosmos DB, Databricks Lakehouse (Delta Lake, SQL Warehouses), and data integration patterns (Event Hubs, ADLS Gen2).

  • Comprehensive understanding of SDLC and GitOps (branching, environments, code review, release promotion).

  • Experience with config management and automation (Ansible, Bash/PowerShell) and governance via cluster policies and IaC standards.

  • Bachelor's degree in computer science, Engineering, Mathematics, Management or related field.

Benefits & conditions

If your experience is closely related but doesn't align perfectly with every qualification, we do encourage you to apply - you might be the right candidate for this or other roles at Scotiabank!

At Scotiabank, every employee is empowered to reach their fullest potential, respected for who they are and, embraced for their differences. That's why we work to grow and diversify talent and engage employees in a performance-oriented culture.

What's in it for you?

Scotiabank wants you to be able to bring your best self to work - and life, every day. With a focus on holistic well-being, our many flexible benefit programs are designed to help support your unique family, financial, physical, mental, and social health needs.

About the company

Global Banking and Markets (GBM) is a leading Canadian Capital Markets and Investment Banking business with a growing platform in the US and Latin America, operating globally for over 100 years. Scotiabank's strong U.S. presence provides our clients an important bridge to this key global market for trade and investment flows across the Americas and the world. Global Banking & Markets provides a full range of investment banking, credit and risk management products and services relevant to the financing and strategic development needs of our clients. Our products include debt and equity financing, mergers & acquisitions, corporate banking, institutional equity sales, trading and research, fixed income products, derivatives, energy, foreign exchange and precious & metals. We also cross-sell the full range of wholesale products and services offered by the Scotiabank Group. Be part of an innovative, Global Capital Markets and Investment Banking business with a unique geographic footprint that puts capital to work for our clients across industries! We work together to drive ambition for every future!, Scotiabank is a leading bank in the Americas. Guided by our purpose: "for every future", we help our customers, their families and their communities achieve success through a broad range of advice, products and services, including personal and commercial banking, wealth management and private banking, corporate and investment banking, and capital markets.

Apply for this position