Principal Data Engineer
Black Duck Software, Inc.
Belfast, United Kingdom
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Shift work Languages
EnglishJob location
Belfast, United Kingdom
Tech stack
API
Artificial Intelligence
Amazon Web Services (AWS)
Customer Data Management
Database Queries
Infrastructure as a Service (IaaS)
Identity and Access Management
Python
Online Analytical Processing
Operational Databases
Performance Tuning
Management of Software Versions
Google Cloud Platform
Cloud Platform System
Delivery Pipeline
Low Latency
Operational Systems
Data Management
Machine Learning Operations
Stream Processing
Job description
- Lead the design and build-out of cross-product data services for multiple product lines from one governed data plane.
- Define the "customer data plane" model: canonical customer identifiers, shared dimensions, and consistent facts used across products.
- Build and operationalize ingestion patterns for batch, streaming, and event data, with repeatable onboarding for new sources.
- Own the operational playbook for data reliability: data contracts, quality checks, lineage, monitoring, and incident response.
- Implement and run access methods that make data usable: curated datasets, secure query interfaces, and product-ready data APIs where needed.
- Productize customer-facing data products (datasets, metrics, exports, and feeds) with versioning, documentation, and clear ownership.
- Design data models that fit both operational systems (RDS) and analytics stores (columnar/OLAP), including performance and cost tuning.
- Ensure data products also power ML workflows: trusted training datasets, feature-ready outputs, and consistent definitions for decision-making.
- Enable AI automation by delivering reliable, low-latency, governed data products that can be used safely in automated workflows.
- Partner closely with product, engineering, and security stakeholders to align data products to roadmap priorities and customer outcomes.
- Raise the technical bar through architecture reviews, standards, and mentoring-while staying hands-on in key systems.
Requirements
- Significant experience building and operating production data platforms at scale, including on-call and operational ownership.
- Strong SQL skills and strong Python skills, used to build pipelines, services, and automation.
- Hands-on experience running cloud systems on AWS and Google Cloud (IaaS level: compute, storage, networking, IAM).
- Practical experience with both operational databases (RDS-style) and analytics stores (columnar/OLAP), including performance tuning.
- Strong data modeling ability, including schema evolution, conformed dimensions, and "one source of truth" metric definitions.
- Track record of delivering data products that other teams or customers depend on, with clear contracts and reliability expectations.
- Ability to make sound engineering tradeoffs across latency, accuracy, cost, and security without creating brittle complexity.
- Experience with lakehouse patterns and open table formats (or similar), including governance and table maintenance.
- Experience with orchestration and streaming systems used in production (batch + real-time), and managing backfills safely.
- Familiarity with ML data needs (training/serving splits, feature-ready datasets, evaluation datasets) and AI-adjacent workflows.
Preferred
- Experience building self-service data platforms (catalog, discoverability, access controls) used by multiple teams.
- Experience in regulated or security-sensitive environments, including retention, auditing, and data access controls.
Work model, location & travel
About the company
Black Duck Software, Inc. helps organizations build secure, high-quality software, minimizing risks while maximizing speed and productivity. Black Duck, a recognized pioneer in application security, provides SAST, SCA, and DAST solutions that enable teams to quickly find and fix vulnerabilities and defects in proprietary code, open source components, and application behavior. With a combination of industry-leading tools, services, and expertise, only Black Duck helps organizations maximize security and quality in DevSecOps and throughout the software development life cycle.