Data Engineer

TEKPULSE TECHNOLOGIES LLC
Cherry Hill Township, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Remote
Cherry Hill Township, United States of America

Tech stack

A/B testing
Airflow
Amazon Web Services (AWS)
Software Applications
JIRA
Cluster Analysis
Data Validation
Data Files
Data Infrastructure
Data Integrity
ETL
Document-Oriented Databases
Github
Google Analytics
Hive
Integrated Development Environments
Python
Machine Learning
MongoDB
MySQL
Natural Language Processing
Oracle Applications
Raw Data
SAS (Software)
PL-SQL
SQL Databases
Data Streaming
Support Vector Machine
Tableau
Text Mining
Azure
Informatica Powercenter
Flask
Snowflake
Random Forest
Kubernetes
Information Technology
XGBoost
Splunk
Data Pipelines
K Means
Docker
Jenkins
Databricks

Job description

Critical role designing, developing and maintaining scalable data pipelines, data sets, and systems for the company's data infrastructure. Will be collaborating with stakeholders to understand data requirements; Develop ETL (Extract, Transform, Load) processes to change raw data into usable formats for analysis, use and cost effectiveness utilizing techniques such as partitioning, indexing, and caching; Design and implement data models to support the business analytical and reporting needs; ensure data integrity, consistency, and performance and identify errors and performance issues; perform data quality checks for accuracy and consistency of data across systems and platforms; Work closely with business analysts/software engineers to understand their data needs and provide technical expertise and support. Document data pipelines, processes, and best practices for knowledge sharing and future reference; Identify and resolve issues to ensure uninterrupted data flow and availability. Relocation/ telecommuting may be required plus travel to various unanticipated client locations within the United States for short and long term assignments.

Languages, skills and tools: Any suitable combination of the following tools: SQL, SAS, Python, PyCharm, Hive, Tableau, GitHub, Jira, MySQL, Oracle, PL/SQL, MongoDB, Informatica ETL, AWS, Databricks, Azure Data Factory, Apache-Airflow, Flask, Docker, Kubernetes, Jenkins, Splunk, Snowflake, SAS, Google Analytics, Supervised and Unsupervised Machine Learning Algorithms, K-Means, SVM, Statistics, KNN, Time Series Forecasting, Survival Analysis, ETL, Business Intelligence, Text Mining, Clustering, Cross-Validation, Support Vector Machine, Decision Tree, Random Forest, A/B Testing, Market Basket Analysis, Linear Discriminant Analysis, Gradient Boosting, Bagging, Natural Language Processing, Sentimental Analysis, Data Modeling, PCA

Requirements

Masters degree in Computer Science/Computer Applications/Business Analytics/Technology/Engineering (Mechanical/ I.T./Electrical). Will accept Bachelors degree in Computer Science/Computer Applications/Business Analytics/Technology/Engineering (Mechanical/ I.T./Electrical) plus five (5) years of progressive experience in related fields in lieu of Masters Degree. Will accept foreign education equivalent.

Apply for this position