Software Engineer, Data & AI Platform

Cortea AI
Berlin, Germany
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Berlin, Germany

Tech stack

Artificial Intelligence
Automated Storage and Retrieval Systems
Big Data
Google BigQuery
Databases
Information Engineering
Data Infrastructure
ETL
Data Retention
Data Stores
Data Warehousing
Database Queries
Software Debugging
Distributed Systems
Python
Operational Databases
Software Architecture
SQL Databases
AI Infrastructure
Data Logging
Large Language Models
Model Validation
Backend
AI Platforms
Production Code
Build Tools
Vertica
Data Pipelines

Job description

We are looking for an Engineer with strong data engineering and AI systems experience to build the data, evaluation, and observability foundation for production-grade LLM agents used in complex audit workflows.

This role sits at the intersection of backend engineering, data engineering, AI infrastructure, and LLM operations. You will work hands-on in our backend and agent architecture, building the systems that help us evaluate, monitor, debug, optimize, and continuously improve AI agents in production.

This is not a traditional analytics, BI, or dashboarding role. You should expect to write production code, design infrastructure, work inside backend systems, and directly improve the quality, cost, reliability, and performance of LLM-based agents., You will help building and operating the technical infrastructure around our AI agents, with a focus on data infrastructure, evaluation, observability, and optimization. Your work will include:

  • Building online and offline evaluation systems for LLM agents, including pipelines that use golden datasets, ground-truth data, human review workflows, and experiment results.
  • Creating automated quality gates so changes to prompts, context, models, or agent logic can be tested before reaching production.
  • Analyzing large volumes of agent traces and executions to identify failure modes, quality regressions, latency issues, reliability gaps, and cost optimization opportunities.
  • Working with columnar data stores and analytical databases such as BigQuery, ClickHouse, or similar technologies.
  • Building reliable data retention and replay mechanisms for long-term analysis of production agent behaviour.
  • Creating observability tooling for trace analysis, experiment monitoring, production dashboards, logging, tracing, and debugging.
  • Working inside our core backend and agent architecture, including building new agents or improving existing agents when needed.

Requirements

Do you have experience in SQL?, Do you have a Master's degree?, You will fit into this role if you:

  • Have strong Python and/or backend engineering experience.
  • Have strong SQL skills and are comfortable working with large datasets.
  • Have deployed and operated systems in the cloud, ideally on GCP.
  • Have practical experience designing data pipelines, ETL/ELT workflows, event-processing systems, or feedback loops for production data.
  • Are comfortable working with analytical databases, data warehouses, columnar stores, and high-volume event or trace data.
  • Understand system design, reliability, observability, monitoring, logging, debugging, and operational trade-offs.
  • Can work in complex existing systems and quickly build a mental model of how they operate.
  • Bring senior-level engineering judgment: you can make architectural decisions, communicate trade-offs, and build systems that other engineers can extend.
  • Are comfortable with ambiguity, able to reason from first principles, and excited to build infrastructure for AI systems that are actively used in production.

Nice-to-haves that are a plus:

  • Building infrastructure around LLM-based products or agentic systems, including optimizing LLM usage, context windows, reasoning tokens, or model selection.
  • Working with production traces from complex distributed systems.
  • Building internal platforms for engineers, domain experts, or operations teams.
  • Using workflow orchestration systems such as Temporal or similar.
  • Familiarity with audit, finance, compliance, or other high-accuracy domains.
  • Experience in an early-stage startup or fast-moving engineering environment.

No one checks every box. If you've shipped retrieval systems and like owning evaluations and pipelines, let's talk.

Benefits & conditions

  • Attractive compensation: competitive salary plus significant equity
  • Personal development: Learning budget for courses and conferences
  • Startup perks: Flexible vacation, team lunches, retreats, central Berlin office

About the company

We're Cortea, a Berlin startup transforming audits with AI. Manual, document-heavy audits waste expert time while demand keeps rising. Our AI-powered software and specialized AI agents remove the repetitive work so auditors can focus on judgment.

Apply for this position