(Senior) Data Scientist

PLATO AG
Berlin, Germany
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English, German
Experience level
Senior
Compensation
€ 130K

Job location

Berlin, Germany

Tech stack

A/B testing
API
Artificial Intelligence
Amazon Web Services (AWS)
Cluster Analysis
Data Structures
Distributed Systems
Github
Python
Machine Learning
Operational Databases
Recommender Systems
SciPy
Software Engineering
SQL Databases
Data Processing
Large Language Models
Snowflake
Prompt Engineering
Spark
PySpark
Scikit Learn
XGBoost
Data Management
Machine Learning Operations
Terraform
Databricks

Job description

  • Your recommendation engine suggests a product category - opening up €50K in new monthly revenue
  • Your clustering algorithm reveals that 20 customers have untapped potential worth €5M

Welcome to wholesale AI - where sparse data meets high stakes, where relationships matter more than transactions, and where the right insight at the right time can transform a business., * Build and deploy ML models processing millions of transactions across 20+ enterprise customers

  • Design recommendation engines handling catalogs of 500K+ products
  • Develop predictive models for customer behavior with sparse, irregular interaction patterns
  • Create churn prediction and cross-sell systems integrated with ERP systems
  • Own complete lifecycle from problem definition to production deployment
  • Build evaluation frameworks connecting model performance to business KPIs
  • Explanations matter as much as predictions

What you'll be working on

Production AI Systems at Scale

  • Build and deploy ML models that process millions of transactions across 20+ (and growing) enterprise customers simultaneously
  • Design systems that adapt to wildly different data distributions (a construction wholesaler vs. a medical equipment distributor)
  • Create recommendation engines that handle catalogs of 500K+ products where most customers buy <5% of items
  • Develop predictive models for customer behavior with sparse, irregular interaction patterns
  • Design churn prediction and cross-sell systems that integrate seamlessly with ERP (Entereprise Resource Planning) systems
  • Build models that generate automated proposals and personalized communication templates using LLMs.
  • Combine classical ML (churn prediction, clustering) with LLMs for insight generation and sales enablement

End-to-End Ownership

  • Own the complete lifecycle from problem definition to production deployment
  • Build evaluation frameworks that connect model performance to business KPIs
  • Design A/B testing infrastructure for continuous improvement
  • Create feedback loops that learn from real-world outcomes

Complex Technical Challenges

  • Handle extreme class imbalance (95%+ negative cases) while maintaining business value
  • Build models that work with limited historical data (cold-start problems)
  • Design architectures that scale from SMBs with 100 customers to enterprises with 100K customers
  • Solve multi-objective optimization problems (maximize revenue while ensuring diversity)

Requirements

Do you have experience in Unity?, Do you have a Master's degree?, * Strong background in machine learning fundamentals - you understand why algorithms work, not just how to use them

  • Experience building production data powered products systems that handle real-world messiness
  • Familiarity in distributed computing (Spark/PySpark) for processing large-scale data
  • Solid software engineering practices - your code is tested, documented, and maintainable

Problem-Solving Mindset

  • You approach problems from first principles rather than reaching for standard solutions
  • Comfortable with ambiguity - you can define success metrics when requirements are vague
  • You balance technical elegance with business pragmatism
  • Experience translating business problems into data science solutions

Proven Track Record In:

  • Building ML systems that handle irregular patterns (time series with gaps, seasonal businesses, etc.)
  • Working with hierarchical data structures (product taxonomies, customer segments)
  • Creating models that provide actionable insights, not just predictions
  • Deploying ML in multi-tenant architectures where one model serves many clients

Bonus Points For:

  • Experience in B2B analytics, e-commerce, or supply chain optimization
  • Knowledge of recommendation systems, customer analytics, or revenue optimization
  • Familiarity with modern data platforms (Databricks, Snowflake, etc.)
  • Experience with MLOps practices and model lifecycle management
  • Understanding of European business practices and regulations
  • Experience with LLM APIs, prompt engineering, or building LLM-augmented products
  • Proficiency in German

Our Tech Stack

You'll be working with modern tools, but we care more about your ability to learn than specific tool experience:

  • Data Processing: PySpark, SQL, Python
  • ML Platform: Databricks (Unity Catalog, Workflows, Model Serving)
  • ML Libraries: scikit-learn, XGBoost/LightGBM, implicit, scipy, OpenAI + experience with your preferred frameworks
  • Infrastructure: AWS, Terraform
  • Orchestration: Databricks Workflows, Github Actions
  • Experimentation: MLflow

About the company

Plato is the AI-powered Sales Intelligence platform built specifically for B2B wholesale. We transform reactive sales teams into proactive revenue engines by surfacing hidden opportunities in their data. Our platform identifies which customers are at risk, who's ready to buy more, and what products to recommend next - turning thousands of signals into clear next actions. The Wholesale Challenge Why wholesale is different from everything else you've worked on: Imagine building ML for a world where: * Your "best customer" might buy twice a year - but spends €500K each time * A customer churning doesn't mean they stopped needing your products - they just found another supplier * Product catalogs contain 100K+ SKUs but even your best customers only know 500 of them * A "trend" might be 3 data points over 18 months * Success means helping a sales rep know which of their 200 customers needs attention today

Apply for this position