Associate Data Engineer
Role details
Job location
Tech stack
Job description
The Associate Data Engineer will be reporting into Data Integration and Analytics team in Product Engineering at San Diego, California. You will be part of team that is committed to providing highly reliable data infrastructure, supporting advanced data analytic tools and automation to drive efficiency, productivity and data uniformity in the company.
The assigned tasks will be entry level to Moderate in Nature, requiring basic domain knowledge and foundational technical skills with opportunities to build proficiency through coaching and ongoing project-based work. In this role you will be primarily responsible for supporting maintenance of the infrastructure and tools for engineering data analysis and automation. In this role you will interface mainly with IT and Product Engineering groups to support the implementation of solutions. This position operates in fast-paced environment and must be flexible to project needs. The Associate Data engineer will need to collaborate within the immediate team and cross functionally in an effective way and be able to meet project requirements on schedule.
Roles & Responsibilities
This position has responsibility for:
-
Support and monitor assigned ETL pipelines which integrate data from internal and external data sources such as overseas manufacturing and test locations, foundries, module assembly factories to support analysis by Engineering teams.
-
Collaborate with PE and IT and support them in resolving production issues related to data flow and data quality for robust data ingestion and disposition.
-
Monitor existing automations and legacy jobs; identify recurring failures or performance bottlenecks and recommend improvements for implementation.
-
Aid in implementation of analytic tools and machine learning models that utilize existing data processing pipelines to provide actionable insights into operational efficiency, and other key business performance metrics.
-
Contribute incremental improvements to reliability and accuracy of dashboards and templates by implementing data validation checks, documentation updates, and small automation enhancements.
-
Support team operations in cloud and distributed data stores (e.g., Cassandra) by completing assigned tickets, learning operational practices, and updating runbooks/documentation.
-
Participate in mentor/mentee development, complete training plan milestones, and cross-train as backup for defined production support procedures.
-
Participate in Mentor-Mentee opportunities and train as backup on critical production functions.
-
Works under close supervision with regular code review and guidance from senior engineers.
Requirements
-
0 - 2 years of work experience (with Bachelor's degree) in Computer Science/software development/Information Systems or equivalent
-
Understanding of ETL pipeline setup (with basic experience preferred)
-
Foundational file I/O level skills in programming languages like Python/R
-
Basic SQL skills (joins, updates, etc.)
-
Familiarity with Shell scripting and willingness to debug wide varieties of legacy and current code bases.
-
Experience with Linux/Unix based operating systems.
-
Experience in relational databases such as Oracle SQL or SQL server etc.
-
Experience with Git-based workflows (PRs, Code Review)
-
knowledge of distributed database concepts and workings
-
Basic knowledge about BI tools such as Tibco Spotfire or tableau or Microsoft Power BI etc.
-
Strong written and verbal communication; comfortable documenting work and providing status updates
Education Requirements
- BS Degree in Computer Science or Software Engineering; MS preferred.