Data Engineer - GCP/Spark/Scala

Infosys
Bentonville, United States of America
7 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Bentonville, United States of America

Tech stack

Airflow
Data analysis
Software Applications
Big Data
Software Bug Management
Code Review
Continuous Delivery
Information Engineering
ETL
Python
Performance Tuning
Power BI
SQL Databases
Systems Integration
Strategies of Testing
Workflow Management Systems
Data Processing
Sql Optimization
Spark
Reliability of Systems
Deployment Automation
Data Pipelines

Job description

In the assigned Job Role of Technology Consultant 2, your Area Of Responsibility will be as below:

Contribute to the requirements elicitation process by documenting assigned parts of business requirements, in line with guidance provided Facilitate software application design discussions, and document design decisions to guide the technical team towards building software solutions Participate in coding and integrate new features or updates into existing applications, with a focus on maintaining system stability Conduct code reviews, do changes to the codebase and maintain code repositories Implement test strategies, analyse results, and coordinate bug fixes to uphold the software quality standards Develop user training programs, documentation, and support frameworks to ensure a smooth transition to new software applications Actively participate in resolving production issues and recommend preventive strategies to enhance system reliability Maintain detailed records of code, testing techniques, and support activities to enrich the knowledge base and assist other similar projects, * Design, develop, and maintain scalable data pipelines on GCP.

  • Build and optimize data processing workflows using BigQuery, Spark, and GCS.
  • Develop and maintain ETL/ELT pipelines using Scala and Python.
  • Orchestrate and schedule data workflows using Apache airflow.
  • Write complex and optimized SQL queries for large scale datasets.
  • Integrate and process data form multiple sources ensuring data quality and reliability.
  • Implement and maintain CI/CD pipelines for automated deployment of data engineering workflows.
  • Troubleshoot performance issues and optimize data processing jobs.

Requirements

A collaborative spirit and excellent communication skills. The ability to handle end to end SDLC phases from requirement gathering to implementation. A knack for translating complex requirements into actionable development tasks. A passion for design and hands-on coding experience A proactive approach to testing, troubleshooting, and refining our applications. The ability to work with cross-functional teams and do software integration., * Strong hands-on experience with GCP

  • Expertise in BigQuery and Google Cloud Storage (GCS)
  • Proficiency in Scala and/or Python for data engineering workflows.
  • Strong experience with Apache Spark for large scala data processing.
  • Experience with Apache Airflow for workflow orchestration.
  • Advanced SQL skills for data analysis and transformation.
  • Experience implementing CI/CD pipelines., * Strong analytical and problem-solving skills.
  • Excellent communication and collaboration abilities.
  • Microsoft certifications (e.g., Power BI Data Analyst Associate, Fabric Analytics Engineer) are a plus., * Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
  • This position may require relocation and/or travel to work/project location.
  • Candidates authorized to work for any employer in the United States without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role now or in the future.

Benefits & conditions

Along with competitive pay, as a full-time Infosys employee you are also eligible for the following benefits:

  • Medical/Dental/Vision/Life Insurance
  • Long-term/Short-term Disability
  • Health and Dependent Care Reimbursement Accounts
  • Insurance (Accident, Critical Illness , Hospital Indemnity, Legal)
  • 401(k) plan and contributions dependent on salary level
  • Paid holidays plus Paid Time Off

About the company

Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems and workings of global enterprises, we expertly steer our clients through their digital journey. We do it by enabling the enterprise with an AI-powered core that helps prioritize the execution of change. We also empower the business with agile digital at scale to deliver unprecedented levels of performance and customer delight. Our always-on learning agenda drives their continuous improvement through building and transferring digital skills, expertise, and ideas from our innovation ecosystem., The Infosys Retail, Consumer Goods, and Logistics unit stands as a globally respected partner of choice, dedicated to helping clients achieve their business goals through cutting-edge technology and seamless services. Our unit offers a dynamic forum where projects and teams can effectively learn, adopt, and excel in all technologies. We foster a vibrant community that leverages shared skills and experiences to deliver high-quality, value-enhanced solutions. Join us and become part of a team that drives innovation, operational efficiency, and sustainable growth in the retail, consumer goods, and logistics sector. Together, we can shape the future of these industries and achieve remarkable success.

Apply for this position