Data Scientist

Middesk
San Francisco, United States of America
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

San Francisco, United States of America

Tech stack

Network Analysis
Data Systems
Relational Databases
Graph Database
Software Deployment
Workflow Management Systems
Large Language Models
Data Analytics
Virtual Agents

Job description

We're building AI-driven applications that simplify customer workflows, starting with business onboarding. With our proprietary identity data and deep domain expertise, we're in a strong position to expand into a broader set of intelligent, risk-aware products., Design and ship production systems that detect and prevent fraud across KYB, trust & safety, and compliance workflows.

  • Work with messy, real-world data Tackle problems with extreme class imbalance, sparse signals, evolving adversarial behavior, and limited ground truth.

  • Leverage relationships in data Apply graph-based approaches and entity resolution techniques to uncover hidden connections and improve risk detection.

  • Improve signal & labeling Use a mix of heuristics, weak supervision, and modern AI tools (including LLMs where appropriate) to generate better features and labels.

  • Help scale our infrastructure Partner with engineering to build and evolve systems for feature generation, model training, and production deployment across multiple use cases.

Requirements

  • 4+ years of experience in fraud, risk, or trust & safety You've worked on real-world fraud or abuse problems and understand the domain deeply.

  • Experience building and shipping production systems You've deployed models or data-driven systems that power external-facing products.

  • Strong foundation in applied ML or data systems Comfortable working on classification problems with real-world constraints like imbalanced data, sparse signals, and changing patterns.

  • Experience with graph or relational data approaches Familiarity with knowledge graphs, network analysis, or entity linking is strongly preferred.

  • Hands-on and pragmatic You focus on impact over perfection and know how to balance speed, accuracy, and maintainability.

About the company

Middesk makes it easier for businesses to work together. Since 2018, we've been transforming business identity verification, replacing slow, manual processes with seamless access to complete, up-to-date data. Our platform helps companies across industries confidently verify business identities, onboard customers faster, and reduce risk at every stage of the customer lifecycle. Middesk came out of Y Combinator, is backed by Sequoia Capital and Accel Partners, and was recently named to Forbes Fintech 50 List., This is a highly technical, hands-on role with broad influence over how we design, build, and scale data-driven systems at Middesk.

Apply for this position