Data Engineer

NTT DATA, Inc.
Irving, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Irving, United States of America

Tech stack

Java
Airflow
Amazon Web Services (AWS)
Automation of Tests
Azure
Data Control
Database Models
Django
Python
Machine Learning
NoSQL
NumPy
Scrum
Unstructured Data
Google Cloud Platform
Data Storage Technologies
Cloud Platform System
Sql Optimization
Spark
Pandas
Build Management
Data Lake
PySpark
Kafka
Non-relational Database
Data Management
Machine Learning Operations
Data Pipelines

Job description

Design and build robust data pipelines to ingest structured and unstructured data from multiple sources. Build and optimize data storage solutions (relational/NoSQL databases, data lakes) to handle scale and performance. Implement validation checks, automated testing, and data monitoring to ensure accuracy and compliance. Partner with Data Scientists, Product Managers, and Software Engineers to build infrastructure that supports machine learning models and BI dashboards. Negotiate features and associated priorities and help the team and their customers reach consensus. Develops and/or leads the development of prototypes, Identify problem causality, business impact and root causes. Coming up with exact solutions for problems related to object identity and error handling.

Requirements

Overall 5+ years of experience. 3+ Years of experience in building data pipelines using PySpark, Django, High proficiency in Python or Java/Scala. 2+ Years of experience in Advanced SQL skills and experience with database modeling. Hands-on experience with technologies like Apache Spark and Kafka. Familiarity with deploying data platforms in cloud environments such as AWS, Google Cloud, or Azure. Hands-On experience in working with Python and related packages (like NumPy, pandas etc.) to load and scrap the data. Experience scheduling data workflows using tools like Apache Airflow or dbt. Should have hands on experience on the MLOps. Working experience on Relational/Non-relational databases and familiarity with data model concepts Working exposure in blending as part of larger scrum team and understanding of related scrum ceremonies Working knowledge of Unix/Linux.

About the company

NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at us.nttdata.com

Apply for this position