Data Scientist

ORNI 39 LLC
Remote, United States of America
11 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Remote
Remote, United States of America

Tech stack

Microsoft Excel
API
Artificial Intelligence
Data analysis
Data Validation
Data Cleansing
Information Engineering
Data Transformation
Data Mining
Data Systems
Python
Machine Learning
NumPy
Power BI
SQL Databases
Data Streaming
Tableau
Management of Software Versions
Data Processing
Model Validation
Pandas
Scikit Learn
Information Technology
Performance Monitor

Job description

The Data Scientist makes predictive insights and builds decision-support tools from operational and performance data, combining hands-on data science (statistical modeling and machine learning) with ownership of industrial analytics stack. The Data Scientist serves as a PI System Administrator, Seeq Developer, and builder of AI-enabled tools that help engineers and operators act faster and with confidence.

The Data Scientist will work with industrial data systems-especially the PI System (OSIsoft PI / AVEVA PI)-to ensure reliable data availability and governed context and will develop Seeq analyses and Python-based pipelines for time-series modeling, anomaly detection, forecasting, and performance monitoring.With curiosity, rigor and comfort with large time-series datasets the Data Scientist is able to take solutions end-to-end-from PI tag and context configuration, to Seeq development, to Python/AI tool deployment-while collaborating with cross-functional technical teams.

Essential Functions

  • Data Science, Modeling, and Insights
    • Build statistical and machine-learning models to detect anomalies, forecast performance, and identify optimization opportunities
    • Design and evaluate experiments and model validation approaches; translate results into clear recommendations for engineering and operations
    • Develop dashboards, reports, and model performance metrics to communicate insights and drive data-informed decisions
  • PI System Administration & Time-Series Data Engineering
  • Administer and support the PI System (OSIsoft PI / AVEVA PI), including tag strategy, data quality monitoring, and user support
  • Build and maintain PI AF structure (assets, templates, attributes) and documentation to provide governed context for analytics and reporting
  • Support PI interfaces/data flows and collaborate with OT/IT and engineers to validate sensors/tags, troubleshoot gaps, and improve reliability and performance
  • Create curated datasets, features, and labels from PI data (with clear definitions and lineage) to support Seeq analyses and ML modeling
  • Seeq Development & AI-Enabled Tools
  • Develop and maintain Seeq Workbooks/Analyses for performance monitoring, anomaly detection, and root-cause investigations
  • Create reusable Seeq templates, calculation standards, and best practices; enable users through documentation and training
  • Build AI-enabled tools (e.g., copilots, guided diagnostics, automated summaries) that leverage governed PI/Seeq context to accelerate engineering workflows
  • Evaluate, monitor, and improve AI tool quality (accuracy, drift, user feedback), and implement practical guardrails for safe, reliable use
  • Python, Analytics Engineering & Deployment
  • Develop and maintain Python-based pipelines for data extraction, preprocessing, modeling, and automation
  • Prototype and productionize analytical applications that support performance monitoring, anomaly detection, and forecasting
  • Automate recurring model runs, evaluations, and reporting workflows with attention to reproducibility and reliability
  • Improve existing analytics codebases; contribute to model monitoring, documentation, and maintainable data science practices
  • Project & Engineering Partnership
  • Collaborate with engineers and subject matter experts to frame operational problems into measurable data science objectives
  • Provide analytical support for initiatives including data validation, statistical analysis, modeling, and performance reporting
  • Help standardize modeling approaches, feature definitions, and evaluation metrics across projects
  • Data Quality, Governance & Monitoring
  • Ensure accuracy and reliability of datasets used for analysis and modeling (validation checks, outlier handling, sensor sanity checks)
  • Perform data cleaning, validation, and documentation, including assumptions, feature definitions, and dataset lineage
  • Maintain organized analytical workflows and pipelines to support repeatable modeling and ongoing monitoring

Other Responsibilities

  • Other duties and projects as assigned by management

Requirements

  • Bachelor's degree in Data Science, Computer Science, Engineering, Statistics, Applied Math, or related field

  • 2-5 years of experience in data science, applied analytics, or technical modeling roles

  • Strong Python skills for data science (e.g., pandas, numpy, scikit-learn; visualization libraries)

  • Strong skills in SQL and Excel for analysis, validation, and stakeholder-ready outputs

  • Experience with data visualization and reporting tools (Power BI, Tableau, or similar)

  • Strong statistical reasoning, analytical problem-solving skills, and attention to data quality

  • Ability to communicate technical findings clearly to both technical and non-technical stakeholders

  • Demonstrated experience using applied statistics, machine learning, and time-series modeling

  • Ability to use Python for data science and AI tooling (data wrangling, modeling, visualization; building assistants/automation)

  • Understand and apply PI System administration fundamentals and data engineering for high-frequency time-series (tags, quality checks, contextualization)

  • Seeq development (shared analyses, calculations, templates) and stakeholder-ready data storytelling

  • Ability to partner cross-functionally and tool and team enablement (requirements, training, documentation, adoption)

  • Lead and support deployment-minded practices (reproducibility, versioning, testing, monitoring) for analytics, models, and AI tools

  • Must have strong verbal and written communication skills

  • Must be able to read, write and speak English at a level which will permit the employee to accurately understand and communicate information to safely and efficiently perform the job duties

  • Ability to prioritize and plan work activities so time is used efficiently and effectively

  • Must demonstrate accuracy and thoroughness to ensure quality performance

  • Ability to identify and resolve problems in a timely manner, + Experience working with OSIsoft PI / AVEVA PI System (PI Data Archive, PI AF) and industrial time-series data

  • Experience developing in Seeq (Workbench/Organizer), including building shared analyses and calculations
  • Experience with operational/engineering datasets (e.g., power generation, rotating equipment, process systems)
  • Familiarity with time-series methods (e.g., resampling, lag features, seasonality, change-point detection)
  • Experience developing reusable analytics packages, APIs, or scheduled jobs for model execution
  • Knowledge of predictive modeling (forecasting, classification/regression), anomaly detection, and model evaluation

Physical Requirements

  • The ability to work in an office environment and to work at a computer, and computer monitor, and use repetitive motion for long periods of time

  • The ability to periodically lift up to 15 lbs

Apply for this position