Data Scientist
Role details
Job location
Tech stack
Job description
Overview
SME Careers connects subject-matter experts, students, and professionals with flexible, remote AI training work such as annotation, evaluation, fact-checking, and content review. We partner with leading AI teams, and all contributions are paid weekly once approved, ensuring consistent and reliable compensation.
We are looking for a skilled data professional to support a project focused on generating high-quality visualisations from code-based prompts for AI Data Training. The ideal candidate will have hands-on experience with data visualization tools, a solid understanding of statistical concepts, and the ability to evaluate code and graphical outputs effectively. This contract role is designed for someone who can contribute technical expertise toward improving AI systems through structured content creation and review. What You'll Do
Create Prompt-Based Content
- Design coding exercises and sample responses to teach models how to generate a range of plots and graphs from structured prompts.
- Develop examples across data analysis scenarios involving basic statistics, distributions, and experimental methods.
- Ensure clarity, correctness, and completeness in both code and the associated visual outputs.
Create Responses to Coding Prompts
- Develop responses to coding prompts that demonstrate correct data visualization approaches.
Assess Quality of Model Output
- Review responses generated by AI for correctness in code and visual presentation.
- Identify errors in statistical interpretation or visualization logic, and suggest improvements.
- Provide feedback on the model's use of plotting libraries and its understanding of core data analysis workflows.
Support Evaluation Criteria
- Validate that visualizations convey appropriate insights based on statistical inputs like averages, variability, and distribution type.
- Check for proper usage of tools such as pandas, matplotlib, and seaborn, as well as structure and readability of code.
- Help refine evaluation guidelines by assessing performance across a range of prompt types.
Qualifications
- 7+ years of hands-on experience.
- Background in statistics or data science with applied experience in probability, hypothesis testing, and experimental analysis.
- Proficiency in Python, with strong skills in libraries such as pandas, matplotlib, seaborn.
- Ability to interpret and evaluate code written for data visualization.
- Deep knowledge of statistical terms and techniques, including mean, median, standard deviation, and confidence intervals.
Nice to Have
- Exposure to libraries such as scikit-learn, SciPy, or statsmodels.
- Experience working with R or MATLAB in a data analysis or visualization context.
Understanding of basic experimental design (e.g. A/B testing) and its representation through visual data.
Please make sure to add the JOB ID: DataScience-25-119 when applying for the position. Specifications
- Seniority level: Mid-Senior level
- Employment type: Contract
- Job function: Quality Assurance, Information Technology, and Engineering
- Industries: Data Infrastructure and Analytics and IT Services and IT Consulting
Note: Referrals increase your chances of interviewing at SME Careers by 2x
Requirements
- 7+ years of hands-on experience.
- Background in statistics or data science with applied experience in probability, hypothesis testing, and experimental analysis.
- Proficiency in Python, with strong skills in libraries such as pandas, matplotlib, seaborn.
- Ability to interpret and evaluate code written for data visualization.
- Deep knowledge of statistical terms and techniques, including mean, median, standard deviation, and confidence intervals.
Nice to Have
- Exposure to libraries such as scikit-learn, SciPy, or statsmodels.
- Experience working with R or MATLAB in a data analysis or visualization context.
Understanding of basic experimental design (e.g. A/B testing) and its representation through visual data.