Machine Learning Engineer

Associated Press
New York, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 180K

Job location

New York, United States of America

Tech stack

API
Airflow
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Cloud Computing
Data Deduplication
Software Debugging
Fault Tolerance
FFmpeg
Machine Learning
Named Entity Recognition
TensorFlow
Search Technologies
Data Logging
Enterprise Software Applications
Deep Learning
Caching
FastAPI
Kubernetes
Low Latency
ONNX (Open Neural Network Exchange) Format
Machine Learning Operations
Terraform
Data Pipelines

Job description

The ML Engineer is a new role within the AP Engineering organization, responsible for shaping how we build and scale machine learning systems at AP, helping to lay the foundation for our machine learning capabilities. The ML Engineer has hands-on experience building andoptimizingML inference systems that run in production environments. This role will develop and tune pipelines that transform millions of photos, videos, and text documents into searchable representations using a combination of deep learning models (e.g.,DistilBERT, SBERT, TransNetV2) and external multimodal APIs. The ideal candidate has experienceoptimizinginference at scale, orchestrating ML workloads, and working with bothPyTorchand TensorFlow in a cloud environment, focusing on model performance, integration patterns, and inference efficiency.

This is an individual contributing role who will reportdirectlyto our Directorof Development, Enterprise Application Services.

What you will do:

  • Design, build, and scale ML-powered inference systems that process large volumes of text, image, and video data to power news-based intelligence products.

  • Productionize andoptimizestate of the artmodels and inference pipelines.These models include, but are not limited to:

  • DistilBERTfor Named Entity Recognition (NER) over hundreds of thousands of search queries/day
  • TransNetV2 for video shot boundary detection at scale for archival video as well as real-time
  • SBERT for embedding generation from textual descriptions
  • External multimodal APIs for image/video captioning
  • Support hybrid search architectures by defining embedding/re-ranking interfaces, evaluation metrics, and inference performance requirements; partner with search/platform engineers on index configuration, sharding, and cluster tuning.

  • Design and implement scalable data processing pipelines across hybrid CPU/GPU environments to handle millions of media assets.

  • Partner withMLOpsand platform engineering to enable the deployment and operation of ML systems reliably, contributing to:

  • Distributed inference architectures
  • Cloud-based execution (e.g., AWS EC2, Batch, Lambda, SageMaker)
  • Efficient resourceutilizationacross workloads
  • Optimizeinference latency and throughput across distributed workloads using cloud-based resources (AWS EC2, Batch, Lambda, SageMaker, etc.)

  • Build resilient asynchronous processing systems for large-scale workloads, ensuring:

  • Reliability (retries, fault tolerance)
  • Efficiency (caching, deduplication)
  • Observability (metrics, logging, traceability)
  • Work closely with data scientists and product teams to iterate on models, improve performance, and deliver measurable impact in production.

Requirements

  • 8+ years of experience building production ML inference systems.

  • Demonstrated ownership of deep-learning inference optimization in production (quantization, distillation, compilation, kernel/profile-level performance work) for transformer NLP and/or CV models.

  • Experience with both TensorFlow (SavedModel,tf.data, XLA,TFLite) andPyTorch(TorchScript, ONNX,FastAPI/TorchServe)

  • Hands-on experienceoptimizinginference pipelines on AWS infrastructure, ideally acrossdifferent typesof media assets.

  • Experience with video frameworks/tools (e.g.,FFmpeg), andworking with large-scale frame-level inference.

  • Demonstrated experience monitoring and debugging model latency, memory, and pipeline throughput.

  • Experience with hybrid search architectures (BM25 + vector search + cross-encoder reranking).

  • Familiarity with OpenAI APIs or other foundation model providers.

  • Familiarity with open sourceHuggingFaceLLMs.

  • Experience with data pipeline and workflow orchestration tools (e.g., Airflow)

Who This Role is Not For:

Candidates whose primary background isMLOpsplatform work (e.g.,DAG orchestration, Terraform, Kubernetes administration, generic CI/CD pipelines) will not be a fit. Weare looking fora senior level engineer whohas experienceprofilinga transformer, rewritingits serving path for a 2-3x latency reduction, tuningan HNSW index, andcantellus which SageMaker instance type will hit our p95 target at the lowest cost.

Benefits & conditions

The anticipated salary range for this position is $145,000 - $180,000 based on a candidate's skills, qualifications and location. The Associated Press offers comprehensive benefits, which include:

  • Competitive medical, dental and vision coverage

  • Retirement benefits

  • Company paid life insurance

  • Paid vacation and sick days

  • Paid parental leave for any new parent

  • Mentalwell-beingresources

AP seeks to build an inclusive organization grounded in respect for differences. We support all aspects of diversity and provide equal employment opportunities to all employees and applicants without regard to race, color, religion, sex, marital status, national origin, age, sexual orientation, gender identity, disability, status as a veteran, or other characteristic protected by law.

About the company

The Associated Press is an independent global news organization dedicated to factual reporting. Founded in 1846, AP today remains the most trusted source of fast, accurate, unbiasednews in all formats and the essential provider of the technology and services vital to the news business. More than half the world's population sees AP journalism every day., AP seeks to build an inclusive organization grounded in respect for differences. We support all aspects of diversity and provide equal employment opportunity to all employees and applicants without regard to race, color, religion, sex, marital status, national origin, age, sexual orientation, gender identity, disability or status as a veteran in accordance with applicable nondiscrimination laws.

Apply for this position