Data Engineer

NTT DATA, Inc.

Irving, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Irving, United States of America

Tech stack

Java

Airflow

Amazon Web Services (AWS)

Automation of Tests

Azure

Data Control

Database Models

Django

Python

Machine Learning

NoSQL

NumPy

Scrum

Unstructured Data

Google Cloud Platform

Data Storage Technologies

Cloud Platform System

Sql Optimization

Spark

Pandas

Build Management

Data Lake

PySpark

Kafka

Non-relational Database

Data Management

Machine Learning Operations

Data Pipelines

Job description

Design and build robust data pipelines to ingest structured and unstructured data from multiple sources. Build and optimize data storage solutions (relational/NoSQL databases, data lakes) to handle scale and performance. Implement validation checks, automated testing, and data monitoring to ensure accuracy and compliance. Partner with Data Scientists, Product Managers, and Software Engineers to build infrastructure that supports machine learning models and BI dashboards. Negotiate features and associated priorities and help the team and their customers reach consensus. Develops and/or leads the development of prototypes, Identify problem causality, business impact and root causes. Coming up with exact solutions for problems related to object identity and error handling.

Requirements

Overall 5+ years of experience. 3+ Years of experience in building data pipelines using PySpark, Django, High proficiency in Python or Java/Scala. 2+ Years of experience in Advanced SQL skills and experience with database modeling. Hands-on experience with technologies like Apache Spark and Kafka. Familiarity with deploying data platforms in cloud environments such as AWS, Google Cloud, or Azure. Hands-On experience in working with Python and related packages (like NumPy, pandas etc.) to load and scrap the data. Experience scheduling data workflows using tools like Apache Airflow or dbt. Should have hands on experience on the MLOps. Working experience on Relational/Non-relational databases and familiarity with data model concepts Working exposure in blending as part of larger scrum team and understanding of related scrum ceremonies Working knowledge of Unix/Linux.

About the company

NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at us.nttdata.com

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all