Machine Learning Engineer

Tiempo Completo
Municipality of Madrid, Spain
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Municipality of Madrid, Spain

Tech stack

HTML
Java
JavaScript
API
Artificial Intelligence
Automation of Tests
C Sharp (Programming Language)
Code Review
Databases
Data Validation
Web Scraping
ETL
Data Mining
Cursor (Graphical User Interface Elements)
Database Design
Graph Database
JSON
Python
PostgreSQL
Machine Learning
Neo4j
Named Entity Recognition
Software Engineering
TypeScript
Unstructured Data
XML
Rust
Data Processing
GitHub Copilot
Large Language Models
Grafana
SOAPAPI
Indexer
Kotlin
Containerization
Api Design
REST
Document Classification
Data Pipelines
Docker
Go
Programming Languages

Job description

Technical resource on the TME product team, partnering with a Project Manager. Builds and maintains data extraction and processing pipelines that transform EU tariff measures data (customs duties, tariffs, VAT, excise taxes) from government sources into structured, client-ready formats. Works using AI-assisted development tools and the BMAD methodology .This is a senior-level position requiring independent architecture and delivery decisions.

What You Will Do:

  • Develop AI native applications that extract and process trade compliance data from central EU and member state government sources - both structured (XML, JSON, CSV, HTML tables, APIs) and unstructured (PDF documents, legal text, prose regulations, scanned publications)
  • Design and built AI native data pipelines that monitor, detect changes, extract, normalise, validate, and release tariff measures data to clients
  • Integrate AI/LLM capabilities into extraction workflows - using large language models for document understanding, entity extraction, classification, and data structuring
  • Design and maintain data models for customs duties, VAT/excise information, and tariff measure metadata
  • Ensure data quality through automated testing, benchmarking against official sources, data source and legal source comparison, and data validation pipelines

Requirements

  • 3+ years of professional software development
  • Proficiency in at least two programming languages - e.g., Python, JavaScript/TypeScript, Java, Go, C#, Rust, Kotlin, or similar. We value strong fundamentals over specific language experience
  • Methodology / Framework: Bmad Method
  • Data extraction and processing - web scraping, document parsing, API integration, ETL/ELT pipelines. Must be comfortable with both:
  • Structured data: XML, JSON, CSV, HTML tables, databases, REST/SOAP APIs
  • Unstructured data: PDFs, legal text, prose regulations, HTML without clear structure, scanned documents
  • Database design and querying - relational (PostgreSQL or similar) and/or document-based databases. Schema design, migrations, indexing
  • API development - building and consuming RESTful APIs
  • Version control - Git workflow, branching strategy, code review
  • Automated testing - unit, integration, and data validation tests as part of development workflow
  • Self-sufficiency - ability to analyse, design and build complete solution
  • Rapid development - segment development into phasing to get results faster
  • Influencer - demonstrate approach to other team members to lift group maturity
  • Communication - can work with technical, business and project team members, participates in and leads discussion

Preferred Technical Skills:

  • LLM/AI API integration (OpenAI, Anthropic Claude, Google Gemini) - for data processing, document understanding, or content extraction
  • AI-assisted development tools (Claude Code, Cursor, GitHub Copilot, Windsurf, or similar)
  • NLP / document processing - OCR, text extraction, entity recognition, text classification
  • Graph databases (Neo4j) or vector databases (pgvector)
  • Observability and monitoring (SigNoz, Grafana, Langfuse, or similar)
  • Containerisation and deployment (Docker, CI/CD pipelines)

About the company

Based in the heart of Thuringia,Germany. Our company specializes in providing custom-built worldwide data and software for Global Business Solution. In order to support a wide range of business companies, we provide data encompassing compliance with various authorities to financial and custom regulations. These data and software solutions follow international standards and are compatible with software and worldwide data.

Apply for this position