Senior Data Scientist - R01566194
Role details
Job location
Tech stack
Job description
- Design and implement advanced statistical models and machine learning algorithms to solve complex business challenges
- Develop, validate, and deploy predictive models using Python, R, or PySpark, ensuring high accuracy and scalability
- Conduct rigorous hypothesis testing, including T-Test and Z-Test, to drive data-driven decision making and uncover actionable insights
- Apply regression techniques such as linear and logistic regression to analyze trends, forecast outcomes, and optimize business processes
- Utilize time series forecasting methods, including ARIMA, ARIMAX, and exponential smoothing, to deliver accurate demand and trend predictions
- Leverage classification methods such as decision trees and support vector machines to segment data and enhance model performance
- Collaborate with cross-functional teams to translate business requirements into analytical solutions and communicate findings effectively
- Ensure model reliability and compliance by implementing robust validation frameworks and tools, including Great Expectations and Evidently AI
Requirements
Do you have experience in Time series models?, Do you have a Master's degree?, * Experience Range: 5 - 8 years of experience in advanced data science roles, 1. Advanced proficiency in Python and PySpark for data analysis and modeling
-
Expertise in statistical analysis and computing, including hypothesis testing, T-Test, and Z-Test
-
Hands-on experience with regression techniques (linear and logistic regression)
-
Practical knowledge of probabilistic graph models
-
Experience with forecasting methods such as exponential smoothing, ARIMA, and ARIMAX
-
Proficiency in classification algorithms, including decision trees and support vector machines (SVM)
-
Strong command of ML frameworks such as TensorFlow, PyTorch, Sci-Kit Learn, CNTK, Keras, and MXNet
-
Skilled in using statistical tools like SAS and SPSS
-
Familiarity with R and R Studio for statistical modeling
-
Experience with distance metrics (Hamming, Euclidean, Manhattan Distance) Preferred Skills:
-
Experience implementing data validation frameworks such as Great Expectations and Evidently AI
-
Knowledge of model deployment tools like KubeFlow and BentoML
-
Background in handling large-scale data processing and distributed computing environments
-
Expertise in feature engineering and dimensionality reduction techniques
-
Familiarity with automated machine learning (AutoML) pipelines Desired Qualifications:
-
Bachelor's or Master's degree in Computer Science, Statistics, Mathematics, Data Science, or a related quantitative field
-
Relevant certifications in data science, machine learning, or statistical analysis are a plus