Data Scientist III - Lead Data Architect
Role details
Job location
Tech stack
Job description
- Design AI-ready data models to support machine learning, advanced analytics, and real-time decisioning
- Build and maintain feature-ready datasets for data science teams (feature engineering support)
- Develop semantic and analytical data layers for BI, AI, and self-service analytics
- Collaborate with data scientists to translate ML use cases into scalable data structures
- Model and integrate high-volume time-series and IoT data (e.g., smart meters, sensors, grid telemetry)
- Enable real-time / near-real-time data pipelines for AI-driven insights
- Ensure data models support MLOps frameworks (model training, validation, deployment pipelines)
- Implement data lineage, observability, and quality frameworks to support trusted AI outcomes
- Optimize data structures for lakehouse architectures and distributed compute environments
- Align with data governance, privacy, and regulatory compliance requirements
AI/Analytics Use Case Alignment
- Predictive Maintenance: Asset failure prediction using sensor and maintenance data
- Wildfire Risk Modeling: Environmental and grid data modeling for risk forecasting
- Load Forecasting: Time-series modeling for energy demand prediction
- Customer 360 Analytics: Behavioral segmentation and usage insights
- Grid Intelligence: AI-driven outage prediction and response optimization
- Generative AI Enablement: Structuring enterprise data for LLM-based insights and copilots
Requirements
- 8+ years in data modeling, data architecture, or analytics engineering
- 3+ years of Utility/energy domain experience (smart grid, AMI, SCADA systems) supporting electric, gas, and/or water utilities.
- Strong expertise in:
- Dimensional modeling for analytics (Star/Snowflake schemas)
- Data modeling for machine learning pipelines
- SQL and data transformation frameworks (dbt preferred)
- Experience designing data models for:
- Data lakes / lakehouse architectures (Delta Lake, Iceberg, etc.)
- Structured + semi-structured data (JSON, Parquet)
- Proven experience supporting AI/ML workloads in production environments, * Experience with cloud AI ecosystems:
- AWS (SageMaker, Redshift)
- Azure (Synapse, Azure ML)
- Google Cloud Platform (BigQuery, Vertex AI)
- Familiarity with time-series and streaming platforms (Kafka, Spark Streaming)
- Knowledge of feature stores (Feast, Tecton)
- Experience with MLOps tools (MLflow, Kubeflow)
- Understanding of LLM data preparation, vector databases, and embeddings
Key Skills
- AI/ML Data Modeling & Feature Engineering
- Lakehouse & Modern Data Stack (dbt, Spark, Delta Lake)
- Time-Series & Streaming Data Modeling
- Data Governance for AI (quality, lineage, bias mitigation)
- Performance Optimization for Analytics Workloads
- Cross-functional collaboration (Data Science, Engineering, Business)
Benefits & conditions
$108,000.00 - $180,000.00 USD (Salary)
- Please note that the salary information provided herein is base pay only (gross); it does not include other forms of compensation which may or may not apply to this specific position, namely, performance-based bonuses, benefits-related payments, or other general incentives - none of which are guaranteed, may be subject to specific eligibility requirements, and are wholly within the discretion of Astreya to remit.
- Further, the salary information noted above is a range that consists of a minimum and maximum rate of pay for this specific position. Where an applicant or employee is placed on this range will depend and be contingent on objective, documented work-related considerations like education, experience, certifications, licenses, preferred qualifications, among other factors.
Astreya offers comprehensive benefits to all Regular, Full-Time Employees, including:
-
Medical provided through UHC (PPO, HSA, Surest options) / Medical provided through Kaiser (HMO option only) for California employees only
-
Dental provided through UHC
-
Nationwide Vision provided by UHC
-
Flexible Spending Account for Health & Dependent Care
-
Pre-Tax Account for Commuter Benefit/Parking & Transit (location-specific)
-
Continuing Education and Professional Development via various integrated platforms, e.g. Udemy and Coursera
-
Corporate Wellness Program provided by Goomi Group
-
Employee Assistance Program
-
Wellness Days 401k Plan
-
Basic and Supplemental Life Insurance
-
Short Term & Long Term Disability
-
Critical Illness, Critical Hospital, and Voluntary Accident Insurance
-
Tuition Reimbursement (available 6 months after start date, capped)
-
Paid Time Off (accrued and prorated, maximum of 120 hours annually)
-
Paid Holidays
-
Any other statutory leaves, paid time, or other ancillary benefits required under state and federal law