Data Engineer - Next Generation Big Data
Role details
Job location
Tech stack
Job description
Joining Amex Tech means discovering and shaping your contribution to something big. Here, you can work alongside talented tech teams and build a unique career with the Powerful Backing of American Express. With a range of opportunities to work with the latest technologies, and a commitment to back the broader engineering community through open source, our mission is to power your success. Because Amex Tech is powered by our technology, our culture, and our colleagues.
The Enterprise Data and AI organization unites the data governance, strategy, engineering, and pro
duct teams with those responsible for AI engineering, generative AI enablement, and automation product and engineering. This group plays a pivotal role in leveraging data as a core driver of innovation and integrating AI capabilities to transform products, operations, and customer experiences.
The EDAI organization also incorporates technology Research & Development and experimentation with emerging capabilities, along with engineering support for Amex Digital Labs. This integration ensures that research breakthroughs seamlessly translate into business impact.
Purpose of the Role:
LUMI is company's largest Big Data Platform, ideally suited for computationally and/or data intensive processing applications. Whether the data needs to be processed in batch, online, or streaming manner, Lumi provides robust capabilities to handle such workloads effectively, in a cost-efficient manner.
A hub of very hardworking Big Data engineers and most exciting & upcoming technologies. Cornerstone platform offers an environment where Engineers are challenged every day to build world class products.
As we embark on the journey to move to public cloud - GCP you will be part of a fast-paced Agile team, design, develop, test, troubleshoot & optimize solutions created to simplify access to the Amex's Big Data Platform.
Focus:
Designs, develops, solves problems, debugs, evaluates, modifies, deploys, and documents software and systems that meet the needs of customer-facing applications, business applications, and/or internal end user applications.
Organizational Context:
Member of an engineering or delivery and integration team reporting to an Engineering manager or Engineering Director
Responsibilities
-
Implement scalable and efficient data architectures on GCP
-
Collaborate with cross-functional teams to understand data requirements and develop solutions that meet business needs
-
Data Pipeline Development:
-
Build, test, and deploy data pipelines to move, transform, and process data from various sources to GCP
-
Ensure the reliability, scalability, and performance of data pipelines
-
Utilize GCP's big data technologies such as Big Query, Dataflow, Dataproc, and Pub/Sub to implement effective data processing solutions
-
Monitor system performance and proactively optimize data pipelines for efficiency
-
Troubleshoot and resolve issues
-
Create and maintain comprehensive documentation for tools , architecture, processes, and solutions, We back our colleagues with the support they need to thrive, professionally and personally. That's why we have Amex Flex, our enterprise working model that provides greater flexibility to colleagues while ensuring we preserve the important aspects of our unique in-person culture. Depending on role and business needs, colleagues will either work onsite, in a hybrid model (combination of in-office and virtual days) or fully virtually.
Requirements
-
Understanding of GCP services Cloud dataflow, Cloud Pub-Sub, Big Query, Cloud Storage, Cloud Dataflow, Google Composer etc
-
Strong SQL knowledge
-
Understanding of fundamentals of Git and Git workflows
-
Experience of working in agile application development environment
-
Technical support to applications on trouble shooting Environment, software and application level issues Write, test programs using Unix Shell scripting, oracle PL/SQL programming
-
Experience of supporting platform Engineering Activities, Network, firewall