Principal Data Engineer

PRIME RECRUITMENT & SERVICES AGENCY, LLC
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Remote

Tech stack

API
Artificial Intelligence
Amazon Web Services (AWS)
Software as a Service
Encodings
Continuous Integration
Data as a Services
Data Architecture
Information Engineering
Data Governance
Data Infrastructure
Data Transformation
Data Structures
Data Systems
Data Warehousing
Relational Databases
Distributed Systems
Graph Database
Machine Learning
Meta-Data Management
MongoDB
Neo4j
Operational Data Store
Data Processing
Data Ingestion
Sql Optimization
Snowflake
Spark
Data Strategy
Build Management
Infrastructure Automation Frameworks
Apache Flink
Data Analytics
Stream Processing
Data Pipelines
Microservices

Job description

Serv, a global executive recruitment partner, is hiring on behalf of our client Virtual Intros for a Principal Data Engineer. Join the Team: Virtual Intros is building the intelligence layer for meaningful, privacy-first connection across people, companies, and communities, turning real-time engagement into measurable business value. The platform is supported by modern architecture including microservices, Infrastructure as Code, and distributed systems, and is now evolving to include a secure, multi-tenant data foundation that powers analytics, reporting, and AI-enabled capabilities at enterprise scale. To learn more please visit: Not provided Position Responsibilities: This role owns the design and delivery of the company's data platform, setting architectural direction while remaining hands-on in building the systems that power analytics, reporting, and AI capabilities. You will define data standards, ensure tenant-safe design, and build a scalable, trustworthy data foundation used across the business. This role includes:

  • Define and own the foundational data architecture including data models, relationships, and source-of-truth systems
  • Establish data contracts and guide the evolution from operational data to analytical and AI-ready systems
  • Design and build end-to-end data pipelines from ingestion through serving layers
  • Implement scalable ingestion from relational databases, object storage, event streams, and SaaS APIs
  • Own transformation layer architecture including modeling, testing, documentation, and CI/CD integration
  • Establish data quality, observability, schema management, and pipeline reliability standards
  • Design multi-tenant data models, storage layouts, and access patterns
  • Implement tenant-aware security including row-level and column-level controls
  • Define data lifecycle management including retention, archival, and deletion strategies
  • Establish data governance standards including classification, lineage, cataloging, and auditability
  • Partner with security and compliance teams to ensure privacy and regulatory alignment
  • Design serving layers for analytics, reporting, and internal business use
  • Prepare data systems to support AI-enabled capabilities including embeddings and advanced data structures
  • Evaluate and implement graph-based data models where appropriate
  • Mentor engineers and provide architectural oversight across internal and external contributors
  • Lead vendor evaluation and manage external data partners to ensure alignment with standards
  • Drive data literacy and data-informed decision making across the organization

Requirements

  • 10+ years of experience in data engineering with principal-level ownership

  • Proven experience owning foundational data architecture in production environments

  • Strong experience with modern data pipelines across batch and streaming systems

  • Deep expertise in data modeling for both operational and analytical use cases

  • Proven experience designing multi-tenant data systems and isolation strategies

  • Strong understanding of data governance, privacy, and sensitive data handling

  • Advanced SQL skills and experience with modern processing frameworks such as Spark, Flink, or equivalent

  • Experience with AWS data services including S3, Glue, Kinesis, RDS, or similar

  • Experience ingesting and modeling data from document-based systems such as MongoDB

  • Experience working with graph data models and graph databases such as Neo4j or Amazon Neptune

  • Experience with modern cloud data warehouses such as Snowflake or Redshift

  • Experience with data transformation tools such as dbt including testing and CI/CD

  • Experience building ingestion pipelines from CDC, object storage, and streaming systems

  • Experience managing or governing external data vendors and partners

  • Experience communicating data strategy and tradeoffs to executive stakeholders Preferred Experience

  • Experience supporting AI, machine learning, or embedding-based data use cases

  • Experience with data cataloging, lineage, and governance tools

  • Background in enterprise or regulated data environments

  • Experience with real-time and event-driven data architectures

  • Experience in high-growth or startup environments scaling toward enterprise Our Ideal Teammate Is Someone Who:

  • Is proactive and accountable

  • Thinks strategically and executes consistently

  • Communicates with clarity and confidence

  • Brings a positive, growth-oriented mindset

  • Is highly organized and dependable

  • Embraces learning and feedback

  • Cares deeply about doing great work Location: Remote

Apply for this position