Senior Machine Learning Engineer

Wallarm

Barcelona, Spain

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Barcelona, Spain

Tech stack

API

Amazon Web Services (AWS)

Big Data

Google BigQuery

Cluster Analysis

Directed Acyclic Graph (Directed Graphs)

Information Engineering

ETL

DevOps

Python

Machine Learning

Raw Data

Azure

Software Deployment

Data Streaming

Tokenization

Management of Software Versions

Feature Engineering

Large Language Models

Spark

Software Security

Cyber Threat Analysis

Kafka

Machine Learning Operations

Feature Extraction

Document Classification

Software Version Control

Data Pipelines

Job description

Since 2016, Wallarm has been on a mission to secure the internet's critical infrastructure: APIs. Today, we are the trusted choice for over 200 of the world's most innovative companies, from high-growth startups to Fortune 500 and Nasdaq leaders. Our unified platform provides full-lifecycle API security - helping teams discover their attack surface, protect against modern threats, and respond to incidents in real-time. As a graduate of Y Combinator and fueled by a recent $55M Series C, we are scaling our global, remote-first team of 150+ innovators to solve the next generation of security challenges.

We're building ML-powered detection systems that protect APIs from automated abuse credential stuffing, scraping, enumeration, and attack patterns that evolve daily. This is a greenfield effort: we have the data and the ideas, but the ML infrastructure, pipelines, and models need to be built from scratch.

You'll be the first dedicated ML engineer on the team, working closely with engineers, security researchers and DevOps. This is a senior IC role with a clear path to technical leadership - we plan to grow the ML function around this hire., * Build the ML stack from the ground up - Design and implement the data pipelines, feature extraction, model training, and serving infrastructure needed for production-grade anomaly detection.

Detecting anomalies in API traffic - Your first major outcome: build a system that identifies malicious behavioral patterns across client sessions with high precision and recall, trained per-client.
Own the full lifecycle - From raw data exploration and feature engineering through model development, evaluation, deployment, and continuous monitoring. No handoffs to a separate "productionization" team.
Design experiments and metrics - Build offline evaluations, define detection-quality metrics, and monitor for false positives, drift, and adversarial adaptation.
Work with text and structured behavioral data - Extract signals from API sessions, request sequences, payloads, and traffic metadata using NLP and statistical techniques.
Leverage LLMs where they add value - Explore embedding-based models and LLM-augmented approaches for signal enrichment, classification, and explainability.
Shape the technical direction - Document findings, present to cross-functional teams, and help define the ML roadmap as the team grows.

Requirements

5+ years in Applied ML or ML Engineering with production deployment experience (not research-only backgrounds).
Strong NLP / text data experience - hands-on work with text classification, pattern extraction, tokenization, embeddings, or similar. This is the core of the work.
Proficiency in Python and production-grade systems (APIs, data pipelines, model serving).
Solid data engineering skills - experience building ETL/data pipelines, working with batch and streaming data, and understanding the full ML data lifecycle (DAGs, data versioning, feature stores).
Deep hands-on experience across ML fundamentals: classification, anomaly detection, clustering, statistical methods - and the judgment to choose the right approach for a given problem.
Comfort with imperfect data - noisy labels, class imbalance, evolving distributions - and practical strategies for labeling, evaluation, and shipping reliable models.
End-to-end ownership mindset - ability to take a problem from raw data to production deployment, working with DevOps to stand up the necessary infrastructure.
Strong experimentation skills: prototype fast, design rigorous evaluations, measure outcomes, reason about trade-offs (cost, quality, latency).Strongly Preferred
Experience in domains where adversaries actively adapt to detection (fraud, bot mitigation, abuse prevention, spam). The ML mindset of handling concept drift and adversarial evasion matters more than specific domain knowledge.
Familiarity with ML lifecycle tooling: experiment tracking (MLflow, W&B), model versioning (DVC), weak-supervision tools (Snorkel, cleanlab), drift monitoring.
Experience with big data / streaming stacks (Spark, Kafka, BigQuery) or cloud ML platforms (AWS SageMaker, GCP Vertex).
Background in security research or threat intelligence (not required - domain context can be learned).Who Thrives Here
You're a full-stack ML engineer - equally comfortable building a data pipeline and tuning a model, designing an experiment and deploying it to production.
You've built from scratch before - you know what it takes to go from "we have data and ideas" to "we have a working detection system."
You're energized by ambiguity and ownership - this isn't a well-scoped ticket queue, it's an open problem space where you define the path.
You're ready to grow into leadership - mentoring engineers, shaping technical strategy, and owning the ML roadmap as the team scales around you.
You leverage modern tools (AI-assisted development, LLM-augmented workflows) to move faster without cutting corners.