Data Scientist (GenAI / Agent Performance)
Role details
Job location
Tech stack
Job description
- Analyze agent outputs to identify failures, context gaps, and improvement opportunities
- Define and apply evaluation criteria and metrics for agent performance
- Improve context engineering through better retrieval, data structuring, and source onboarding
- Collaborate with product managers and engineers to translate findings into technical improvements
- Design lightweight tests and validations to measure the impact of changes
Requirements
Do you have experience in Windows?, Do you have a Master's degree?, You are a curious and analytical Data Scientist who enjoys understanding why GenAI systems behave the way they do and how to improve them. You thrive at the intersection of data, engineering, and product thinking, and you are motivated by debugging agent behavior rather than training models. You think structurally, communicate clearly, and have a strong intuition for what "good" looks like in real-world AI applications., * Strong understanding of agentic GenAI systems and LLM-based applications
- Experience reasoning about prompts, context windows, retrieval, and grounding
- Solid technical integration skills, including APIs, data pipelines, and data flow debugging
- Analytical mindset with the ability to design evaluation frameworks beyond classical ML metrics
- Clear communicator able to explain insights and proposals to technical and non-technical stakeholders