Founding Machine Learning Engineer
Role details
Job location
Tech stack
Job description
- This is an ML engineering role, not a research role, not a prompt engineering role. You understand the underlying mechanics: how embeddings encode meaning, how attention shapes retrieval, how to build and evaluate representations that power matching and ranking systems.
- You use LLMs where they're the right tool, but your first instinct is to understand the problem at a model level, not to reach for an API.
- The work is concrete: building a system that takes a new role and stack-ranks the entire candidate database against it in seconds.
- You own the representations and scoring models that sit at the core of how Dex connects engineers with companies.
- The matchmaking engine you build here is the foundation that powers everything that comes next - candidate-facing products, automated outreach, smarter sourcing - so you're building for durability, not just the immediate use case.
- Your ML judgment is backed by solid engineering execution.
What You'll Do
- Own the matchmaking engine - build and improve the AI-driven talent matching system; design representations, scoring models, and instrumentation from the ground up
- Work with embeddings and retrieval - build and evaluate embedding models, vector search, and semantic retrieval systems that power candidate-to-role matching
- Design and run evaluations - build practical eval frameworks for model behaviour and output quality; make rollout safety and failure handling first-class concerns
- Contribute to agent and LLM systems - work on the voice agent backend and LLM pipelines with a model-level understanding of what's actually happening
- Ship reliable backend services - build production-grade Python services, not prototypes; handle errors, retries, latency, and observability as standard
About you
-
ML foundations - you understand how embeddings, attention, and retrieval systems work at a model level; you can reason about representations, not just API responses
-
Production judgment - you know when to use an LLM, when to use classical ML, and when to use neither; you've made these calls in production, not just in notebooks
-
Evaluation and guardrails mindset - you build evals before things go wrong; you design failure handling and rollout safety into systems from the start
-
Backend execution strength - you ship reliable, maintainable services; your track record shows production ML systems, not just impressive demos
Requirements
- Python backend (FastAPI or equivalent, async patterns, Postgres, Redis)
- Embeddings, vector search, and semantic retrieval (building and evaluating, not just calling)
- ML model evaluation - Metrics design, offline/online eval, failure analysis
- LLM integration with model-level understanding (attention, context windows, trade-offs)
Nice to Have
- Experience with recommendation systems, ranking models, or candidate/item matching
- Classical ML background (supervised/unsupervised, feature engineering, gradient boosting)
- Experience with voice agents or real-time audio pipelines
- Familiarity with Pydantic AI or similar agent frameworks
- Experience in recruiting tech or marketplace systems