GCP Data Scientist
Role details
Job location
Tech stack
Job description
This GCP Data Scientist must work 2 to 3 days in the local office in Austin, Texas is required., * Contribute to the migration of legacy data warehouse to a google cloud-based data warehouse
- Collaborate with Data Product Managers, Data Architects to design, implement, and deliver successful data solutions
- Help architect data pipelines for the underlying data warehouse and data marts
- Design and develop very complex ETL pipelines in Google cloud Data environments.
- Our legacy tech stack includes Teradata and new tech stack includes GCP Cloud Data Technologies like BigQuery and Airflow and languages include SQL , Python
- Maintain detailed documentation of your work and changes to support data quality and data governance
- Support QA and UAT data testing activities
- Support Deployment activities to higher environments
- Ensure high operational efficiency and quality of your solutions to meet SLAs and support commitment to our customers (Data Science, Data Analytics teams)
- Be an active participant and advocate of agile/scrum practice to ensure health and process improvements for your team
Requirements
- 8+ years of data engineering experience developing large data pipelines in very complex environments
- Very Strong SQL skills and ability to build Very complex transformation data pipelines using custom ETL framework in Google BigQuery environment
- Exposure to Teradata and ability to understand complex Teradata BTEQ scripts
- String Python programming skills
- Strong Skills on build Airflow Jobs and Debug issues
- Ability to Optimize the Query in BigQuery
- Hands-on experience on Google Cloud data Technologies ( GCS , BigQuery, Dataflow, Pub sub, Data Fusion , Cloud Function)
Preferred Qualifications
- Experience with cloud data warehouse technology BigQuery.
- Nice to have experience with Cloud technologies like GCP (GCS , Data Proc, Pub/sub, Data flow, Data Fusion, Cloud Function)
- Nice to have exposure to Teradata
- Solid experience with Job Orchestration Tools like Airflow and ability to build complex Jobs.
- Writing and maintaining large Data Pipelines using Custom ETL framework
- Ability to Automate Jobs using Python
- Familiarity with Data Modeling techniques and Data Warehousing standard methodologies and practices
- Very good experience with Code Version control repository like Github
- Good Scripting skills, including Bash scripting and Python
- Familiar with Scrum and Agile methodologies
- Problem solver with strong attention to detail and excellent analytical and communication skills
- Ability to work in Onsite / Offshore model and able to lead a Team.
Benefits & conditions
The base compensation range for this role in the posted location is $76,200 to $187,740.
Capgemini provides compensation range information in accordance with applicable national, state, provincial, and local pay transparency laws. The base compensation range listed for this position reflects the minimum and maximum target compensation Capgemini, in good faith, believes it may pay for the role at the time of this posting. This range may be subject to change as permitted by law.
The actual compensation offered to any candidate may fall outside of the posted range and will be determined based on multiple factors legally permitted in the applicable jurisdiction. These may include, but are not limited to: Geographic location, Education and qualifications, Certifications and licenses, Relevant experience and skills, Seniority and performance, Market and business consideration, Internal pay equity.
It is not typical for candidates to be hired at or near the top of the posted compensation range.
In addition to base salary, this role may be eligible for additional compensation such as variable incentives, bonuses, or commissions, depending on the position and applicable laws.
Capgemini offers a comprehensive, non-negotiable benefits package to all regular, full-time employees. In the U.S. and Canada, available benefits are determined by local policy and eligibility and may include:
- Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade, Company paid holidays, Personal Days, Sick Leave
- Medical, dental, and vision coverage (or provincial healthcare coordination in Canada)
- Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)
- Life and disability insurance
- Employee assistance programs
- Other benefits as provided by local policy and eligibility
Important Notice: Compensation (including bonuses, commissions, or other forms of incentive pay) is not considered earned, vested, or payable until it becomes due under the terms of applicable plans or agreements and is subject to Capgemini's discretion, consistent with applicable laws. The Company reserves the right to amend or withdraw compensation programs at any time, within the limits of applicable legislation.
About the company
Capgemini ist einer der weltweit führenden Anbieter von Management- und IT-Beratung, Technologie-Services und Digitaler Transformation. Als ein Wegbereiter für Innovation unterstützt das Unternehmen seine Kunden bei deren komplexen Herausforderungen rund um Cloud, Digital und Plattformen.