Data Scientist III - Lead Data Architect

Astreya Partners
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 180K

Job location

Remote

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Data analysis
Azure
Google BigQuery
Data Architecture
Data Cleansing
Data Governance
Data Transformation
Data Structures
Data Warehousing
Dimensional Modeling
Supervisory Control and Data Acquisition (SCADA)
JSON
Machine Learning
Performance Tuning
Azure
SQL Databases
Data Streaming
Parquet
Google Cloud Platform
Feature Engineering
Delivery Pipeline
Large Language Models
Spark
Generative AI
Data Lake
Semi-structured Data
Kubernetes
Data Lineage
Data Analytics
Star Schema
Kafka
Spark Streaming
Machine Learning Operations
Smartgrid
Data Pipelines

Job description

  • Design AI-ready data models to support machine learning, advanced analytics, and real-time decisioning
  • Build and maintain feature-ready datasets for data science teams (feature engineering support)
  • Develop semantic and analytical data layers for BI, AI, and self-service analytics
  • Collaborate with data scientists to translate ML use cases into scalable data structures
  • Model and integrate high-volume time-series and IoT data (e.g., smart meters, sensors, grid telemetry)
  • Enable real-time / near-real-time data pipelines for AI-driven insights
  • Ensure data models support MLOps frameworks (model training, validation, deployment pipelines)
  • Implement data lineage, observability, and quality frameworks to support trusted AI outcomes
  • Optimize data structures for lakehouse architectures and distributed compute environments
  • Align with data governance, privacy, and regulatory compliance requirements

AI/Analytics Use Case Alignment

  • Predictive Maintenance: Asset failure prediction using sensor and maintenance data
  • Wildfire Risk Modeling: Environmental and grid data modeling for risk forecasting
  • Load Forecasting: Time-series modeling for energy demand prediction
  • Customer 360 Analytics: Behavioral segmentation and usage insights
  • Grid Intelligence: AI-driven outage prediction and response optimization
  • Generative AI Enablement: Structuring enterprise data for LLM-based insights and copilots

Requirements

  • 8+ years in data modeling, data architecture, or analytics engineering
  • 3+ years of Utility/energy domain experience (smart grid, AMI, SCADA systems) supporting electric, gas, and/or water utilities.
  • Strong expertise in:
  • Dimensional modeling for analytics (Star/Snowflake schemas)
  • Data modeling for machine learning pipelines
  • SQL and data transformation frameworks (dbt preferred)
  • Experience designing data models for:
  • Data lakes / lakehouse architectures (Delta Lake, Iceberg, etc.)
  • Structured + semi-structured data (JSON, Parquet)
  • Proven experience supporting AI/ML workloads in production environments, * Experience with cloud AI ecosystems:
  • AWS (SageMaker, Redshift)
  • Azure (Synapse, Azure ML)
  • Google Cloud Platform (BigQuery, Vertex AI)
  • Familiarity with time-series and streaming platforms (Kafka, Spark Streaming)
  • Knowledge of feature stores (Feast, Tecton)
  • Experience with MLOps tools (MLflow, Kubeflow)
  • Understanding of LLM data preparation, vector databases, and embeddings

Key Skills

  • AI/ML Data Modeling & Feature Engineering
  • Lakehouse & Modern Data Stack (dbt, Spark, Delta Lake)
  • Time-Series & Streaming Data Modeling
  • Data Governance for AI (quality, lineage, bias mitigation)
  • Performance Optimization for Analytics Workloads
  • Cross-functional collaboration (Data Science, Engineering, Business)

Benefits & conditions

$108,000.00 - $180,000.00 USD (Salary)

  • Please note that the salary information provided herein is base pay only (gross); it does not include other forms of compensation which may or may not apply to this specific position, namely, performance-based bonuses, benefits-related payments, or other general incentives - none of which are guaranteed, may be subject to specific eligibility requirements, and are wholly within the discretion of Astreya to remit.
  • Further, the salary information noted above is a range that consists of a minimum and maximum rate of pay for this specific position. Where an applicant or employee is placed on this range will depend and be contingent on objective, documented work-related considerations like education, experience, certifications, licenses, preferred qualifications, among other factors.

Astreya offers comprehensive benefits to all Regular, Full-Time Employees, including:

  • Medical provided through UHC (PPO, HSA, Surest options) / Medical provided through Kaiser (HMO option only) for California employees only

  • Dental provided through UHC

  • Nationwide Vision provided by UHC

  • Flexible Spending Account for Health & Dependent Care

  • Pre-Tax Account for Commuter Benefit/Parking & Transit (location-specific)

  • Continuing Education and Professional Development via various integrated platforms, e.g. Udemy and Coursera

  • Corporate Wellness Program provided by Goomi Group

  • Employee Assistance Program

  • Wellness Days 401k Plan

  • Basic and Supplemental Life Insurance

  • Short Term & Long Term Disability

  • Critical Illness, Critical Hospital, and Voluntary Accident Insurance

  • Tuition Reimbursement (available 6 months after start date, capped)

  • Paid Time Off (accrued and prorated, maximum of 120 hours annually)

  • Paid Holidays

  • Any other statutory leaves, paid time, or other ancillary benefits required under state and federal law

Apply for this position