Data Engineer
VC5 Consulting
Houston, United States of America
6 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
IntermediateJob location
Houston, United States of America
Tech stack
Artificial Intelligence
Data analysis
Information Systems
Information Engineering
Data Integration
Python
PostgreSQL
Microsoft SQL Server
NoSQL
Oracle Applications
SQL Databases
Snowflake
Microsoft Fabric
Data Lake
PySpark
Data Analytics
Data Pipelines
Databricks
Job description
This role focuses on designing, developing, and supporting data engineering and data integrations, primarily in a Databricks data lake house environment. The Data Engineer will collaborate closely with business users to build semantic data models, dashboards, and perform hands-on analysis to address business questions. This position plays a crucial role in establishing a robust data foundation that supports AI and machine learning initiatives, ensuring high-quality, well-governed data products., * Design and develop reliable data pipelines that ingest data from various source systems into a Databricks data lake house.
- Curate data through a layered medallion architecture into clean, analytics-ready datasets.
- Collaborate with business users to build semantic data models and dashboards that support decision-making.
- Conduct hands-on data analysis to answer business questions and provide actionable insights.
- Establish and maintain data governance and quality practices to ensure the reliability of data products.
Requirements
- Minimum of 5 years of IT/technology experience spanning data analysis, data engineering, and/or data integration.
- At least 3 years of experience writing SQL/NoSQL queries, with specific experience in MS SQL Server, Oracle, and/or Postgres.
- Bachelor''s degree in Information Systems, IT, or a related technical discipline.
- Hands-on experience with a modern cloud data platform or lake house, such as Databricks, Microsoft Fabric, or Snowflake.
- Strong Python skills for data engineering, including experience with PySpark.