Senior Data Software Engineer
EPAM Systems, Inc.
Poland, United States of America
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Poland, United States of America
Tech stack
Clean Code Principles
Java
.NET
Agile Methodologies
Artificial Intelligence
Big Data
C Sharp (Programming Language)
Cloud Computing
Cloud Database
Data Distribution Service
Data Security
Data Virtualization
Distributed Computing Environment
Python
NumPy
Open Source Technology
Systems Development Life Cycle
TensorFlow
SciPy
SQL Databases
Data Streaming
Enterprise Software Applications
Spark
Pandas
Storage Technologies
Data Programming
Kafka
Code Inspection
Database Replication
Data Delivery
GPT
Databricks
Job description
- Engineer solutions focused on data virtualization, distributed processing and cloud-based large-scale data storage architecture.
- Design and implement a highly scalable and reliable data distribution solution using Python, Spark and open-source technologies to deliver data to business components.
- Work with ADLS, Databricks, Kafka, Denodo, data modeling and data replication for handling large data sets.
- Apply clustering, SQL query patterns and indexing techniques to support large-scale data operations.
- Define technical approaches and determine the appropriate methodologies.
- Collaborate with solution architects and business analysts to define implementation design and coding of assigned modules with the highest quality.
- Apply standards, methods, techniques and templates defined by the SDLC, including code control, code inspection and code deployment.
- Design, plan and deliver solutions in a large-scale enterprise environment.
- Produce secure and clean code that is stable, operational, consistent and well-performing.
- Communicate technical analyses, recommendations, status and results effectively to the project management team.
- Operate in a collaborative, multi-site environment to support rapid development and delivery of results and capabilities (AGILE SDLC).
Requirements
- 6-9 years of hands-on experience developing large-scale applications using data virtualization and/or data streaming technologies.
- Background as a software engineer/developer focused on cloud-based data virtualization and data delivery technologies.
- Proficiency in Python for coding and development.
- Familiarity with .NET, C# or Java in case of no hands-on experience in Python.
- Expertise in Python data access libraries (Numpy, Scipy, pandas).
- Knowledge of machine learning frameworks (TensorFlow) and AI libraries (ChatGPT).
- Skills in designing, planning and delivering solutions in a large-scale enterprise environment.
- Capability to determine technical approaches and define appropriate methodologies.
- Understanding of AGILE SDLC and ability to work in a collaborative, multi-site environment.
- Competency in effectively communicating technical analyses, recommendations and results to project stakeholders.
- English proficiency at B2 level or higher.