Machine Learning Engineer III, Search Relevance

Box, Inc.
Redwood City, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 220K

Job location

Redwood City, United States of America

Tech stack

Java
A/B testing
Amazon Web Services (AWS)
Azure
C++
Apache Lucene
Software Quality
Continuous Integration
Distributed Systems
Elasticsearch
Python
Machine Learning
Performance Tuning
Queueing Systems
TensorFlow
Scala
Search Technologies
Solr
Real Time Systems
PyTorch
Large Language Models
Backend
Kubernetes
Information Technology
Kafka
Terraform
Stream Processing
Data Pipelines
Microservices

Job description

We're looking for a Machine Learning Engineer III to improve search quality end-to-end-signals, ranking, retrieval, and evaluation-while building scalable, low-latency services that serve queries in real time. You'll collaborate with senior engineers, Product, Data, and Infra partners to productionize modern retrieval techniques and experimentation frameworks that directly impact how millions of users work. WHAT YOU'LL DO

  • Design, build, and iterate on components for ranking, retrieval, and recommendations that improve measurable relevance and latency.
  • Implement production features leveraging embeddings, semantic/hybrid search, and LLM-enabled retrieval under mentorship and design guidance.
  • Contribute to offline/online evaluation, A/B tests, and relevance tuning using metrics such as NDCG, MRR, and precision@k.
  • Develop reliable, observable microservices and near real-time indexing pipelines across distributed systems.
  • Own well-scoped projects from design to rollout, writing clear design docs, tests, and operational runbooks.
  • Improve data and feature pipelines (batch/streaming) to ensure quality, freshness, and end-to-end performance.
  • Document patterns and contribute to team best practices that raise the bar on code quality and reliability.
  • Participate in our on-call rotation, available at all times while on-call to help respond to and triage any issues that arise., Box makes reasonable accommodations for applicants with disabilities. If a reasonable accommodation is needed to participate in the job application or interview process, please complete this form. Reasonable accommodations may include scheduling adjustments, document dictation and beyond.

Notice to applicants in Los Angeles: Box, Inc and its related branches will consider for employment, qualified applicants with criminal histories in a manner consistent with the Los Angeles Fair Chair Ordinance. The Fair Chance Ordinance is provided here.

Notice to applicants in San Francisco: Box, Inc and its related branches will consider for employment, qualified applicants with criminal histories in a manner consistent with the San Francisco Fair Chair Ordinance. The Fair Chance Ordinance is provided here.

Requirements

  • 3+ years of industry experience building backend or distributed systems, with production ownership of services or data pipelines.
  • Proficient in at least one of: Java, Scala, C++, or Python; comfortable writing production-grade Python is a plus.
  • Exposure to search, ranking, recommendations, or applied ML in production; understand the basics of training-to-serving workflows.
  • Experience with data pipelines, message queues, or streaming systems (e.g., Kafka, Pub/Sub) and near real-time processing.
  • Familiarity with cloud-native microservices, CI/CD, observability, and performance tuning.
  • BS in Computer Science or related field, or equivalent practical experience.
  • Pragmatic, metrics-driven mindset-eager to experiment, measure impact, and iterate quickly in collaboration with partners.

Preferred

  • Experience with Elasticsearch, Solr, Lucene, or custom search systems; understanding of inverted indexes and scoring functions.
  • Knowledge of relevance tuning, learning-to-rank concepts, and offline/online experimentation practices.
  • Exposure to vector search, dense/sparse embeddings, and hybrid retrieval architectures.
  • Familiarity with IR fundamentals (BM25, TF-IDF, multi-stage retrieval) and query understanding.
  • Experience with Kubernetes/Terraform and a major cloud (GCP/AWS/Azure).
  • Practical exposure to PyTorch or TensorFlow; LLM familiarity helpful but not required.

Box lives its values, with community and in-person collaboration being a core part of our culture. Boxers are expected to work from their assigned office a minimum of 3 days per week.Your Recruiter will share more about how we work and company culture during the hiring process.

Benefits & conditions

Box is committed to fair and equitable compensation practices. Actual base salary (or OTE if commissionable role) is dependent upon factors such as: knowledge, skill level, experience, and work location. This role is also eligible for equity and benefits. For more information, check out our benefits and perks.

About the company

Box (NYSE:BOX) is the leader in Intelligent Content Management. Our platform enables organizations to fuel collaboration, manage the entire content lifecycle, secure critical content, and transform business workflows with enterprise AI. We help companies thrive in the new AI-first era of business. Founded in 2005, Box simplifies work for leading global organizations, including JLL, Morgan Stanley, and Nationwide. Box is headquartered in Redwood City, CA, with offices across the United States, Europe, and Asia. By joining Box, you will have the unique opportunity to continue driving our platform forward. Content powers how we work. It's the billions of files and information flowing across teams, departments, and key business processes every single day: contracts, invoices, employee records, financials, product specs, marketing assets, and more. Our mission is to bring intelligence to the world of content management and empower our customers to completely transform workflows across their organizations. With the combination of AI and enterprise content, the opportunity has never been greater to transform how the world works together and at Box you will be on the front lines of this massive shift. The Search Relevance team at Box powers discovery across billions of files, enabling customers to find the right content quickly, securely, and intelligently. As we expand into a new era of AI-powered content understanding, we're investing in the foundation that makes great search possible: reliable systems, strong signals, and models that learn from real-world usage. This is a rare opportunity to work at the intersection of information retrieval science, applied machine learning, and large-scale distributed systems. You'll be building the infrastructure that powers intelligent content discovery for Fortune 500 companies-where milliseconds matter, relevance is measurable, and your experiments directly impact how millions of users work.

Apply for this position