Data Analyst
Dcode Talent LLC
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Tech stack
Data analysis
Big Data
Data Cleansing
Data Transformation
Data Mining
Python
Machine Learning
Standard Sql
SQL Databases
Google Cloud Platform
Model Validation
Pandas
PySpark
Enterprise Integration
Data Pipelines
Databricks
Requirements
Databricks Data Engineer certification (most important; can be completed after onboarding).
- Google Cloud certification (acceptable post-hire).
- SQL Skills:
- Ability to independently write and execute SQL queries.
- Proficient in data extraction and basic to intermediate data transformations.
- PySpark Expertise:
- Strong hands-on experience required.
- Core responsibility: building data pipelines and performing large-scale data processing.
- Python/Pandas Skills:
- Required for data cleaning, transformation, and analysis.
- Machine Learning:
- 20% of the role.
- Must have hands-on experience with ML (classification, regression, clustering, model evaluation).
- No ML training will be provided.
- Data Analysis Focus:
- Majority of work involves SQL-based data extraction and PySpark/Pandas-driven analysis and insights.
- Workflow Integration:
- Should understand end-to-end workflows that combine SQL, PySpark, and ML.