Data Engineer

Cyber Sphere LLC
McLean, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

McLean, United States of America

Tech stack

Java
ActiveMQ
Agile Methodologies
Amazon Web Services (AWS)
Data analysis
Azure
Big Data
Cloud Engineering
Data as a Services
Data Architecture
Information Engineering
ETL
Data Mapping
Data Migration
Data Presentation
Relational Databases
DevOps
Distributed Computing Environment
Python
Machine Learning
Microsoft SQL Server
MongoDB
NoSQL
Operational Data Store
Oracle SQL Developer
DataOps
Data Streaming
Systems Integration
Tableau
Enterprise Data Management
Qliksense
Informatica Powercenter
Delivery Pipeline
Snowflake
Grafana
Spark
Spring-boot
SAP Sybase ASE
Deep Learning
Backend
Data Lake
PySpark
Infrastructure Automation Frameworks
Information Technology
Amazon Web Services (AWS)
Data Analytics
Performance Monitor
QlikView
Bitbucket
Operational Systems
Data Management
Machine Learning Operations
Cloudwatch
Terraform
Looker Analytics
Data Pipelines
Amazon Web Services (AWS)
ELK
Jenkins
Databricks
Microservices

Job description

Senior Data Engineer for design and deployment of our next-generation Enterprise Data-as-a-Service (DaaS) solution. This role bridges the gap between legacy operational systems and modern cloud-native analytics, focusing on building and operationalizing robust data pipelines across AWS and Snowflake ecosystems.

The successful candidate will have a strong background in distributed data processing, lakehouse architecture, and a passion for solving complex business problems through data democratization

What You ll Get to Do:

Design, build, and maintain a robust Data-as-a-Service (DaaS) provider model to effectively decouple data producers from downstream consumers.

Architect and manage end-to-end data pipelines through a multi-stage Medallion architecture.

Lead digital transformation initiatives by migrating legacy operational data from MS SQL Server and Sybase to modern, cloud-native Snowflake and NoSQL (MongoDB) environments.

Develop and implement high-performance, event-driven backend solutions using Spring Boot microservices and ActiveMQ.

Design and operationalize scalable cloud-native processing services using AWS Glue (PySpark), Amazon EMR, and Snowflake.

Implement comprehensive data observability and proactive monitoring solutions using AWS CloudWatch, Grafana, and the ELK Stack.

Streamline development through DevOps best practices, maintaining efficient and reliable CI/CD pipelines via Jenkins, Bitbucket, and Terraform.

Collaborate with cross-functional engineering, product, and business teams

Solve challenging problems using data science and advanced machine learning methods.

Apply data science techniques to process and wrangle data in preparation for analysis; and analyse big data and perform exploratory analysis to uncover insights and identify innovative opportunities.

Leverage machine learning techniques, to explore and examine data from multiple disparate sources to provide a competitive advantage or address a pressing business problem.

Define approaches to embed and scale machine learning models with senior data scientist oversight

Assist on client pitches and proposals for machine learning projects

Work on difficult engineering problems with scientists and business leaders with minimal engineering or data science backgrounds.

Work with the data science team to build reusable assets, solutions and develop best practices for current and future business problems.

Work with the engineering team to develop end to end production ML systems

Staying up to date on the latest machine learning capabilities.

Work with other analysts to utilize analytics to solve business problems in various areas within financial services, like capital markets, wealth management, asset management, fraud, AML, process mining, customer centricity, risk management, talentology etc.

Requirements

7+ years of professional experience in Data Engineering and Analytics, with a focus on building high-availability, enterprise-grade data platforms.

Expertise in building and troubleshooting event-based microservices using Java and Spring Boot.

Proficiency in Apache Spark (Batch & Structured Streaming) for large-scale data transformation using Python (PySpark) or Scala.

Extensive experience with Snowflake and modern Lakehouse architectures (Databricks/Delta Lake).

Deep understanding of NoSQL (MongoDB) and RDBMS (SQL Server/Sybase).

Experience automating infrastructure using Terraform (IaC) and managing CI/CD pipelines (Jenkins, Azure DevOps, or Bitbucket).

Experience with data observability tools (ELK Stack or Grafana).

Experience working in a fast-paced Agile development environment.

Financial services industry experience 7+ years of relevant experience in analytics, business intelligence, data engineering, or related field

Experience with Informatica, ETL, Databricks 5+ years of demonstrated experience in building enterprise grade BI applications using tools such as: QlikSense, QlikView, Tableau or Looker

Must have PySpark and Snowflake experience

Experience working in large data warehouse environments

Bachelor s degree in a quantitative area such as math, statistics, computer science, engineering, or equivalent experience

Strong verbal/written communication and data presentation skills, including an ability to effectively communicate with both business and technical teams

Experience in using of Oracle SQL tools for data analysis is required

Candidates having strong experience in Data Mapping, Data Migration and system integration projects with knowledge of Wealth Management and related investment products.

Would be a plus, with knowledge in any of the Wealth Management products like TCS BaNCS

Desirable skills

Financial services industry experience

Experience with data visualisation tools (Tableau, QlikView etc)

Experience with Informatica, ETL, Databricks

Apply for this position