Databricks - Data Engineer

Guidehouse Inc.
Arlington, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Arlington, United States of America

Tech stack

API
Amazon Web Services (AWS)
Bash
Big Data
Continuous Integration
Data Governance
Data Integrity
ETL
Distributed Systems
Python
Performance Tuning
Scripting (Bash/Python/Go/Ruby)
Cloud Platform System
Azure
Spark
Data Lake
PySpark
Kubernetes
Infrastructure Automation Frameworks
Kafka
Machine Learning Operations
Video Streaming
Terraform
Data Pipelines
Databricks

Job description

  • Develop and implement CI/CD pipelines for Databricks notebooks and jobs.
  • Develop ETL pipelines using PySpark and Databricks.
  • Implement Delta Lake for ACID transactions and data reliability.
  • Optimize ingestion from APIs, streaming, and batch sources.
  • Ensure compliance with data governance and security standards.
  • Collaborate with data engineers and scientists to support data pipelines and ML workflows.
  • Conduct ETL and data quality analysis using various technologies (ie, Python, Databricks).
  • Ensure data governance and quality assurance standards are met.
  • Organize and lead meetings, including scheduling meetings; drafting and delivering agendas and meeting minutes; providing and archiving required documentation; and documenting, tracking, and following up on action items.
  • Summarize and present information and reports to the team and make recommendations (both oral and written).

Requirements

  • US Citizenship is required
  • A bachelor's degree is required
  • Minimum THREE (3) years of total experience in cloud-based data platforms
  • Minimum THREE (3) years of experience with Databricks
  • Strong Scripting skills (Python, Bash).
  • Experience with Delta Lake and Unity Catalog.
  • Strong knowledge of Spark architecture and distributed computing.
  • Hands-on experience with Terraform or other IaC tools.
  • Experience with Unity Catalog and Delta Lake.
  • Experience with data modeling and performance tuning.
  • Experience with streaming technologies (Kafka, Event Hub).
  • Experience with using CI/CD for data pipelines.
  • Familiarity with Kubernetes and container orchestration.
  • Excellent problem-solving skills and attention to detail.
  • Strong communication and collaboration skills, with the ability to work effectively in a team environment.

What Would Be Nice To Have:

  • Databricks Certified Data Engineer Associate or Professional.
  • Azure Data Engineer Associate or AWS Big Data Specialty.
  • Active Secret or Top Secret Clearance'

Benefits & conditions

Guidehouse offers a comprehensive, total rewards package that includes competitive compensation and a flexible benefits package that reflects our commitment to creating a diverse and supportive workplace.

Benefits include:

  • Medical, Rx, Dental & Vision Insurance
  • Personal and Family Sick Time & Company Paid Holidays
  • Position may be eligible for a discretionary variable incentive bonus
  • Parental Leave and Adoption Assistance
  • 401(k) Retirement Plan
  • Basic Life & Supplemental Life
  • Health Savings Account, Dental/Vision & Dependent Care Flexible Spending Accounts
  • Short-Term & Long-Term Disability
  • Student Loan PayDown
  • Tuition Reimbursement, Personal Development & Learning Opportunities
  • Skills Development & Certifications
  • Employee Referral Program
  • Corporate Sponsored Events & Community Outreach
  • Emergency Back-Up Childcare Program
  • Mobility Stipend

About Guidehouse

Guidehouse is an Equal Opportunity Employer-Protected Veterans, Individuals with Disabilities or any other basis protected by law, ordinance, or regulation.

Guidehouse will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of applicable law or ordinance including the Fair Chance Ordinance of Los Angeles and San Francisco.

Apply for this position