Data Quality Lead - Active SC Clearance

Hays plc
Kilsby, United Kingdom
4 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
£ 143K

Job location

Kilsby, United Kingdom

Tech stack

Amazon Web Services (AWS)
Data analysis
Software Documentation
Data Deduplication
Data Dictionary
Data Files
Data Governance
Data Integrity
Data Visualization
Data Logging
Data Ingestion
SC Clearance
Data Pipelines
Databricks

Job description

We are seeking a strategic and technically proficient Data Quality Lead to oversee the ingestion, evaluation, and quality assurance of critical datasets supporting the Global Supply Chains Intelligence Programme (GSCIP). This role will be pivotal in ensuring data integrity, enhancing operational efficiency, and supporting high-impact analysis across government and commercial domains., Data Ingestion & Source Management

  • Oversee ingestion pipelines for key data sources including Sayari, S&P, and Altana.
  • Liaise with internal and external stakeholders (Microdata team, HMG equivalents and external suppliers) to ensure procurement tracking and data quality.
  • Monitor and resolve backlog issues in data quality logs.

Dataset Evaluation & Maintenance

  • Lead on the assessment of data quality of all GSCIP data. This includes exploratory data analysis of new datasets and pushing suppliers to rectify issues.
  • Tracking and logging existing data quality issues by coordinating with GSCIP data scientists and licence users. Prioritise issues based on feedback.
  • Raising issues with suppliers, suggesting solutions (based on DS feedback), communicating updates to GSCIP users, understanding contractual agreements to keep suppliers to account.
  • Write pipelines to ensure that ingested data meets expected format/contractual criteria
  • Proactively conduct EDA and anomaly detection on data sources to identify unknown issues. Produce dashboards and reports to disseminate features of interest of the graph to GSCIP data scientists and users, eg where the largest outliers are, where coverage is best/worst etc.
  • Actively work on enhancements to data quality, eg developing reproducible pipeline functions that handle data quality features in a standardised way for gscip_utils/dashboards such as transaction deduplication, imputation etc.
  • Investigate the relative value of each data source to feed into future commercial procurement strategy. Track the strengths and weaknesses of each data supplier.
  • Scope out new data sources to fill existing data gaps.
  • Lead annual evaluations of datasets to determine retention and relevance.
  • Conduct IRAP assessments for new datasets.
  • Maintain comprehensive documentation including data dictionaries, caveats, and usage guides.

Cyber Monitoring & Pipeline Development

  • Collaborate with engineers and data scientists to support engineers to build robust, high-quality data pipelines.
  • Explore new data sources, particularly those related to risk and company intelligence.
  • Proactively investigate data anomalies and ensure datasets meet commercial requirements.

Governance of Unowned or Informally Managed Datasets

  • Formalise ownership and tracking for datasets such as HMRC, ComTrade, TDM, and Shipping Instructions.
  • Coordinate with microdata teams to align procurement and sharing protocols.

Learning & Development

  • Design and deliver learning modules to support GSCIP data literacy.
  • Encourage new team members to engage in QA activities to build familiarity with datasets.

Requirements

  • Proven experience in quality assurance of data and working with data engineers.
  • Strong understanding of commercial and government data sources.
  • Ability to manage cross-functional relationships and coordinate across departments.
  • Experience with documentation standards and data governance frameworks., * Strategic thinker with a proactive approach to data quality and risk mitigation.
  • Excellent communicator with the ability to translate technical concepts for non-technical audiences.
  • Comfortable working in a fast-paced, multi-stakeholder environment.
  • Familiarity with systems including databricks, Amazon Neptune and AWS

About the company

Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found on our website.

Apply for this position