Data Engineer architect

Alltech Consulting Services

Dallas, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

Dallas, United States of America

Tech stack

Java

Amazon Web Services (AWS)

Business Analytics Applications

Data analysis

Azure

Big Data

Google BigQuery

Databases

Continuous Integration

Data Architecture

Information Engineering

Data Files

Data Governance

Data Infrastructure

ETL

Data Masking

Data Presentation

Data Systems

Data Visualization

Data Warehousing

DevOps

Amazon DynamoDB

Graph Database

Hadoop

Python

Machine Learning

MongoDB

Raw Data

Cloud Services

SQL Databases

Data Streaming

Talend

Test Data

Data Processing

Google Cloud Platform

Snowflake

Spark

Kubernetes

Kafka

Build Tools

REST

Stream Processing

Azure

Data Pipelines

Docker

Redshift

Programming Languages

Job description

Looking for a data professional with expertise in data engineering, programming and architect to build systems that collect, manage, and convert raw data into usable information for business requirements. As a Data Engineer architect, you'll play a crucial role in ensuring data reliability, quality, and efficiency within the organization. Responsibilities: Analyze and organize raw data: Work with various data sources, document parsing, extracting relevant information and structuring it for further processing. Build data systems and pipelines: Construct robust data pipelines that facilitate data flow from source to Target. Evaluate business needs and objectives: Understand the company's requirements and align data systems accordingly. Interpret trends and patterns: Use your analytical skills to identify data patterns. Conduct complex data analysis and report on results: Dive deep into data to extract meaningful information. Prepare data for prescriptive and predictive modeling: Ensure data is ready for machine learning and statistical analysis. Build algorithms and prototypes: Develop and test data processing algorithms. Combine raw information from different sources: Integrate data from various systems. Explore ways to enhance data quality and reliability: Continuously improve data processes. Identify opportunities for data acquisition: Stay informed about new data sources. Develop analytical tools and programs: Create tools to facilitate data analysis. Collaborate with data scientists and architects: Work closely with other data professionals to achieve common goals. Implement data access controls, data encryption, and data masking techniques Familiarity with data visualization tools and techniques for presenting data Create and maintain dashboards and reports for stakeholders.

Requirements

15+ years in Data Engineering / Data Architecture Good knowledge of programming languages (e.g., Python, Java, Spark, etc). Design, develop, and maintain data pipelines. Exposure to process automation. Experience working with REST APIs and services, messaging and event technologies. Experience working with large and complex data sets. Hands-on experience with SQL/No-SQL database (RDS, Redshift, DynamoDB, synapse, big query, mongo, etc.) Batch/stream data processing experience Monitor, troubleshoot, and optimize the performance of data infrastructure to ensure scalability, reliability, and cost efficiency. Stay up to date with cloud services and best practices in data engineering to continuously improve our data ecosystem. Good exposure on at least two public cloud platforms (Azure/AWS/Google Cloud Platform) Experience with Graph database (e.g. Neptune, RDF4j, etc.) Experience with Vector database (e.g. Pinecone, FAISS, etc.) Hands-on experience with ETL/ELT tools (Informatica, Talend, dbt, etc.) - Expertise in cloud platforms: AWS / Azure / Google Cloud Platform - Experience with big data technologies: Spark, Hadoop, Kafka - Strong knowledge of data warehousing (Snowflake, Redshift, BigQuery) - Experience in building real-time and batch data pipelines - Strong understanding of data governance and security frameworks - Experience with CI/CD, DevOps practices for data pipelines - Proven leadership and stakeholder management skills Good-to-Have Skills:

knowledge or work experience in insurance, mortgage, banking domains.
Proficiency in building stream processing systems using kinesis, Kafka, etc.
Familiarity with Docker, Kubernetes, CI/CD and cloud services (AWS, Azure, Google Cloud Platform).
Technical expertise with segmentation techniques.
NLP knowledge

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all