Data Scientist (Big Data Systems)
Bayside Solutions
Cupertino, United States of America
2 months ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Compensation
$ 104KJob location
Cupertino, United States of America
Tech stack
Data analysis
Big Data
Data Security
Linux
Hadoop Distributed File System
Python
DataOps
SQL Databases
Scripting (Bash/Python/Go/Ruby)
Spark
Presto
Splunk
Job description
Seeking a data scientist/analyst. Ideally, skilled in data analysis. While the role is primarily for a data scientist, we are also open to candidates with strong project management skills to help oversee aspects of our data operations., * Ad-hoc data analysis and investigations, running queries on our big data systems (SQL, Splunk, and HDFS)
Requirements
- Proficient in Python and able to automate scripts and tools using Python
- Experience in Spark to process and query large datasets efficiently
- Superb communication skills (both verbal and written) with the ability to present results of analyses in a clear and impactful manner
- Understand algorithms (tweak them when needed) as well as the infrastructure that enables fast iterations.
- Experience in documenting (i.e., data schemas, compliance policies, timeline/project management, and status updates)
- Capable of assisting with data investigations and providing recommendations on leveraging existing data effectively
- Assist in tracking and managing data projects to ensure successful completion.
- Facilitate data access management, including data creation and the data onboarding process.
Requirements and Qualifications:
- SQL querying (including anomaly/outlier detection)
- Python Proficiency (scripting, automation, tooling development)
- Big Data systems experience (Trino/Presto and HDFS)
- Spark (processing and querying large datasets)
- Splunk
- Linux/Unix proficiency
- Data onboarding and GDPR/regulatory compliance experience
- Strong written and verbal communication skills
- Strong communicator: must be able to present analytical findings clearly to cross-functional teams (including legal)
- Self-directed and able to work across multiple engineering teams
- Detail-oriented with a compliance/regulatory mindset
- Comfortable managing ambiguity and long-running workflows (e.g., legal/GDPR processes)