Data Engineer
Inclusion Inc.
Dallas, United States of America
10 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
EnglishJob location
Dallas, United States of America
Tech stack
Artificial Intelligence
Airflow
Azure
Software as a Service
Databases
ETL
Data Warehousing
Programming Tools
Revision Control Systems
Python
Machine Learning
Power BI
SQL Databases
T-SQL
Workflow Management Systems
Datadog
Spark
GIT
Microsoft Fabric
PySpark
Performance Monitor
Database Mirroring
Cloud Optimization
REST
Software Version Control
Data Pipelines
Custom Reports
Databricks
Job description
Our client is building a next generation SaaS application that ingests data from various sources, processes through a data pipeline and executes machine learning models to provide predictions on Project Management data. The solution will leverage cutting edge technologies hosted in Microsoft Azure including Databricks, Apache Airflow, Fabric database mirroring, Fabric Lakehouse, Semantic Models, Materialized Lake Views and Power BI. The app users will have the ability to access Machine Learning predictions, create custom reports and combine data from various sources.
Requirements
Do you have experience in Version control?, * Python (expert)
- Spark/pySpark (advanced)
- T-SQL (advanced)
- ETL/ELT and creation of data pipelines (expert)
- Databricks (advanced)
- Data modeling (advanced)
- Experience with scheduling and orchestration tools, specifically Apache Airflow (advanced)
- Azure services including compute, storage, databases and developer tools (advanced)
- Data warehousing concepts (expert)
- Version control tools such as Azure DevOps Git (advanced)
- Performance monitoring and optimization of code and tooling (advanced)
- Security including row level and object level
- Invoking RESTful API's
- Strong aptitude and ability to work independently
- Medallion architecture
Desired Skills
- Experience with Microsoft Fabric and implementing data pipelines using Fabric tooling
- Knowledge of AI concepts and experience executing ML models
- Cloud cost optimization
- Knowledge of OneLake security
- Sharing of Power BI and Semantic Models in Microsoft Fabric
- Understanding of cross tenant data sharing in Microsoft Fabric
- Team leadership experience
- Observability tooling, * Knowledge of Project Management and Financial concepts including budgets, tasks, revenue, profit and earned value
- Certification in Microsoft Azure and/or Python, SQL, Databricks