Quantitative Research Analyst (Data Modeling & Imputation)

Value Tech, Inc.
Boston, United States of America
14 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Boston, United States of America

Tech stack

Big Data
Data Transformation
Data Normalization
Python
NumPy
Raw Data
Standard Sql
SQL Databases
Unstructured Data
Data Processing
Spark
Pandas
Data Pipelines
Databricks

Job description

Value Technology focuses on building high-quality, research-ready datasets from incomplete, inconsistent, and fragmented financial data. You will be responsible for designing and implementing imputation frameworks, data transformations, and modeling pipelines that convert raw inputs into reliable signals.

This is not a pure modeling role. The majority of the value comes from data construction, validation, and methodological rigor upstream of the model.

The Impact

You will directly influence the quality and credibility of investment research by:

  • Constructing datasets where ground truth is partial or noisy
  • Designing imputation methodologies that are economically and statistically sound
  • Ensuring outputs are stable, explainable, and usable in production research
  • Your work underpins signal generation, backtesting, and ultimately client-facing insights.

Core Responsibilities

  • Build and maintain end-to-end data pipelines across structured and unstructured datasets
  • Develop imputation frameworks for missing or sparsely reported financial data (e.g., segment-level estimates, coverage gaps, timing mismatches)
  • Design and implement data normalization and reconciliation logic across overlapping hierarchies (e.g., segments, geographies, entities)
  • Perform data quality diagnostics, including coverage analysis, bias detection, and stability testing
  • Partner with researchers to translate raw data into model-ready features
  • Write efficient, reproducible code in Python and SQL for large-scale data processing
  • Document methodologies clearly to ensure transparency and repeatability

Requirements

  • 5-7 years of experience in quantitative research, data science, or financial data engineering
  • Strong expertise in data wrangling and transformation at scale (this is the core skill)
  • Proven experience with missing data techniques and imputation methods (e.g., cross-sectional inference, time-series interpolation, model-based approaches)
  • Advanced Python skills (pandas, numpy); strong SQL required
  • Experience working with messy, real-world datasets (not just clean academic data)
  • Solid grounding in statistics and econometrics
  • Familiarity with equity markets and financial statements preferred

What We're Actually Looking For

  • You are an expert data wrangler - you know that 80% of the work is getting the data right
  • You are skeptical of inputs and instinctively test for edge cases, leakage, and bias
  • You understand that imputation is modeling, not a preprocessing afterthought
  • You can balance statistical rigor with practical constraints (coverage vs. precision trade-offs)
  • You write code that others can read, audit, and reuse

Nice to Have (Not Required)

  • Experience with NLP or unstructured data pipelines
  • Exposure to alternative datasets (supply chain, transcripts, etc.)
  • Familiarity with distributed compute environments (Databricks, Spark)

Apply for this position