Databricks Data Engineer/Architect

Techclub, Inc
Seattle, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Seattle, United States of America

Tech stack

API
Amazon Web Services (AWS)
Business Analytics Applications
Azure
Big Data
Cloud Computing
Cloud Database
Cloud Storage
Databases
Continuous Integration
Data as a Services
Data Architecture
Information Engineering
Data Governance
ETL
Data Systems
Data Warehousing
DevOps
Python
Performance Tuning
Standard Sql
Software Deployment
Data Streaming
Management of Software Versions
Google Cloud Platform
Cloud Platform System
Azure
Spark
Data Lake
PySpark
Amazon Web Services (AWS)
Data Lakehouse
Data Pipelines
Databricks

Job description

  1. Design and develop scalable data pipelines using Databricks and Apache Spark.
  2. Build and maintain ETL/ELT processes for ingesting, transforming, and loading large volumes of data.
  3. Architect modern data platforms using the Databricks Lakehouse architecture.
  4. Implement data modeling solutions for data lakes, data warehouses, and analytics platforms.
  5. Optimize Spark jobs, clusters, and workloads for performance and cost efficiency.
  6. Integrate data from multiple sources, including cloud storage, databases, APIs, and streaming platforms.
  7. Work with cloud platforms such as AWS, Azure, or Google Cloud Platform, leveraging native data services.
  8. Establish data governance, security, data quality, and compliance standards.
  9. Develop and manage Delta Lake solutions, including data versioning and optimization.
  10. Collaborate with business stakeholders, data scientists, analysts, and engineering teams to define data requirements.
  11. Provide technical leadership, architecture guidance, and best practices for data engineering initiatives.
  12. Support CI/CD, automation, monitoring, and production deployment of data solutions.

Key Technical Skills Typically Required:

  • Databricks
  • Apache Spark (PySpark/Scala)
  • SQL
  • Delta Lake
  • Data Lakehouse Architecture
  • Azure Data Factory / AWS Glue (depending on cloud platform)
  • Python
  • Cloud Platforms (Azure, AWS, or Google Cloud Platform)
  • ETL/ELT Development
  • Data Modeling
  • CI/CD and DevOps Practices

Requirements

  • Strong hands-on experience with Databricks and Spark.
  • Experience designing enterprise-scale data architectures.
  • Expertise in cloud-based data engineering solutions.
  • Ability to lead technical discussions and mentor engineering teams.
  • Strong understanding of performance tuning, security, and data governance.

Apply for this position