Data Architect, Data Platform

Goto, Inc.
Boston, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Boston, United States of America

Tech stack

Java
Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Azure
Big Data
Google BigQuery
Cloud Storage
Data Architecture
Data Governance
Data Infrastructure
Data Integration
ETL
Data Systems
Decision Support Systems
Software Design Patterns
Dimensional Modeling
Distributed Data Store
Electronic Data Interchange (EDI)
Github
Python
Memcached
NoSQL
Performance Tuning
Query Optimization
Redis
Software Engineering
SQL Databases
Data Streaming
Technical Data Management Systems
CircleCI
Pulumi
Data Storage Management
Real Time Systems
Fast Healthcare Interoperability Resources
Snowflake
Database Optimization
Technical Debt
Electronic Medical Records
Data Strategy
GIT
Containerization
Data Lake
Kubernetes
Data Analytics
Star Schema
Health Level Seven International
Kafka
REST
Terraform
Stream Processing
Software Version Control
Api Management
Docker
Databricks
Microservices

Job description

As a Data Architect for healthcare applications, you are responsible for innovating, designing and managing scalable, secure, and interoperable data systems that support clinical, operational, and financial workflows. This role focuses on structuring complex healthcare data from electronic health records (EHRs) to healthcare financial data into cohesive architectures that enable accurate reporting, analytics, and patient care insights. This role will ensure compliance with healthcare regulations such as HIPAA, implement industry standards like HL7 and FHIR for seamless data exchange, and establish strong data governance, quality, and security practices such as HITRUST. By aligning data strategy with organizational goals, this role plays a critical part in improving data accessibility, reliability, and ultimately patient outcomes. Data Modeling & Design

You need to be fluent in conceptual, logical, and physical data modeling. That includes understanding normalization vs. denormalization, dimensional modeling (star/snowflake schemas), and designing for scalability and performance. Database & Storage Expertise

Deep knowledge of both relational and non-relational systems is critical. This also means familiarity with data lakes, lakehouses, and distributed storage systems/warehouses (eg, S3, Delta Lake, BigQuery). Data Integration

Designing pipelines that move and transform data reliably. This includes experience with ETL/ELT tools (DBT), streaming systems (Kafka, Kinesis), and orchestration frameworks (Airflow, etc.) with the ability to understand batch vs. Real Time tradeoffs. Performance Optimization

Indexing strategies, partitioning, query tuning, and workload. The ability to architect for scale, resiliency and business continuity. Strategic Thinking

  • Define what the future data architecture should look like
  • Determine how and where to reduce technical debt
  • Identify how to enable analytics insights, incorporate AI, and drive self-service

Artificial Intelligence (AI)

  • Define and evolve data architectures that support AI/ML workloads, including curated training datasets, feature stores, and scalable pipelines for batch and Real Time inference
  • Define and evolve data architectures that leverage AI to drive greater operational efficiency, reduce system complexity, and accelerate the ingestion and processing of healthcare data across platforms
  • Design scalable pipelines and platforms (eg, lakehouse, streaming, feature stores) that enable faster data availability for AI-driven insights and Real Time decision support

Requirements

  • 12+ years of software engineering experience, including hands-on technical experience building, maintaining and scaling data systems.
  • 5+ years of experience as a tech lead who successfully converts business/product requirements into well architecture designs.
  • Extensive experience in building and scaling large data pipelines including Real Time processing and/or 100+ GB transformation in Java, Python, DBT, and SQL.
  • Extensive experience in building and driving large business outcomes by leveraging a combination of existing and new technologies.
  • A deep knowledge of common data technology stacks such as GCP BigQuery, Snowflake, Databricks, DBT, Datalake architecture on AWS S3 or GCP Cloud storage.
  • A deep knowledge in cloud platforms such as AWS, GCP, or Azure, and cloud-native API solutions.
  • Deep knowledge of data modeling and data governance control
  • Strong RESTful API design principle, microservices architecture, distributed asynchronous system and good design patterns
  • Strong knowledge with CI/CD pipelines (CircleCI, Github Action), containerization (Docker, Kubernetes), and version control (Git), infrastructure as code (Pulumi, Terraform), relational and NoSQL databases, caching mechanisms (Redis, Memcached), and performance optimization techniques.
  • Strong leadership and communication skills, with the ability to influence cross-functional teams and communicate complex technical details to upper management and non-technical stakeholders.
  • Proven problem-solving ability with a focus on delivering solutions
  • Positive attitude: Maintaining a constructive approach to work challenges.

Language Skills

Ability to read, analyze and interpret general business periodicals, professional journals, technical procedures or governmental regulations. Ability to write reports, business correspondence and procedure manuals. Ability to effectively present information and respond to questions from a variety of both internal and external sources. Physical Capabilities

The physical capabilities described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions. While performing the duties of this job, the employee is regularly required to sit; use hands to finger, handle, or feel; reach with hands and arms; and talk or hear. The employee is occasionally required to stand and walk. The employee must occasionally lift and/or move up to 10 pounds. Specific vision abilities required by this job include close vision, distance vision, color vision, peripheral vision, depth perception, and ability to adjust focus. EEO Statement

Apply for this position