Datacenter Engineer - London

SmartTrade

Charing Cross, United Kingdom

3 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Shift work

Languages

English

Experience level

Intermediate

Job location

Charing Cross, United Kingdom

Tech stack

Java

Link Aggregation (Ethernet)

Amazon Web Services (AWS)

Apache HTTP Server

Intelligent Platform Management Interface

Bash

Border Gateway Protocol

BIOS

Ubuntu (Operating System)

CentOS

Configuration Management

Data Centers

Dynamic Host Configuration Protocol

Linux

RAID

DNS

Elasticsearch

Trunking

Firmware

Apache Hypertext Transfer Protocol Server

Java Web Services

Python

Logical Volume Manager

MySQL

Network Diagrams

Routing

Red Hat Enterprise Linux - RHEL

Ansible

Virtual Local Area Networks

Scripting (Bash/Python/Go/Ruby)

Computer Network Operations

Juniper

Gitlab

GIT

Centreon

Kafka

Lxc

Puppet

Terraform

Network Server

Dynatrace

Docker

Job description

We are seeking a hands-on Linux Systems & Datacenter Administrator to join our Europe Operations team. You'll be the on-the-ground owner for our Slough (Equinix) environment and a key contributor to our global private cloud.

The role blends Linux systems administration (Ubuntu), containerized compute (LXD/LXC, some Docker), networking, and datacenter operations.

You will partner with engineering, network, and security teams to ensure reliability, performance, and change control in a 24×7, market-facing environment.

This is a production-oriented role: you'll prepare, review and execute changes, troubleshoot live issues, execute maintenance windows, and continuously improve our platform through automation and rigorous documentation.

Our Environment

Servers: Dell, HPE, Supermicro.
Storage: LVM, software and hardware RAID (mdadm, MegaRAID, LSI, …).
Containers: LXD/LXC (primary), some Docker.
Networking (day-2 ops): VLANs, LACP, ACLs, routing basics; vendors include Dell, Supermicro, Arista, Juniper, VyOS.
Applications & Data: MySQL, Elasticsearch, Kafka, Java, Apache HTTPD, …
Automation & IaC: Git/GitLab, Chef, Terraform; scripting with Bash/Python.
Monitoring/Observability: Centreon, Observium (plus logs pipelines).

What You'll Do

Operate and improve Linux fleets (Ubuntu) in production.
Manage LXD/LXC container platforms (no hypervisors/VM stacks)
Provide level-3 incident response for infrastructure issues (systems, containers, network paths, storage), restoring service within SLAs and driving post-mortems.
Own datacenter operations in Slough: rack/stack, cabling, optics, power planning, servers installation, console/OOB, manage inventory, RMA logistics, and vendor coordination (Equinix Smart Hands, carriers, OEMs).
Perform day-2 network operations on switches and firewalls (ACLs, VLANs, LAGs, routing basics), and collaborate closely with network engineering for changes.
Automate with Chef for configuration management and Terraform for IaC on AWS where applicable. Build reliable tooling for repeatable ops (config generation, pre-change checks, deployments, and validation).
Contribute to change management (runbooks, maintenance windows, rollback plans) and keep documentation current (network diagrams, inventories, SOPs).
Participate in a Follow-the-Sun operations model, coordinating with your EMEA/APAC peers.

Requirements

Do you have experience in Ubuntu?, * 2-3+ years operating Linux (Ubuntu, CentOS, RedHat) in production environments.

This position requires occasional on-call availability outside of standard business hours to respond to urgent or critical operational issues. Flexibility to be contacted outside regular working hours is required.
Previous datacenter work exposure: rack/stack, structured cabling (fiber/copper), PDUs, console/OOB, vendor/Smart Hands coordination, and accurate inventory. If no prior experience, willingness to learn and work in such environments.
Containers: exposure to LXC or Docker in a production environment and their inner workings.
Server hardware & storage: LVM, software RAID, MegaRAID tooling, firmware/BIOS/BMC (iDRAC/iLO/IPMI), and hands-on diagnostics and replacements.
Networking fundamentals for day-to-day ops: VLANs, LACP, trunking, ACLs, static routes, BGP, DNS/DHCP, link/MTU issues; ability to execute well-scoped changes on Dell/Arista/Juniper/VyOS under peer review.
Automation & SCM: Bash/Python, Git/GitLab; experience with Chef or Ansible or Puppet in production.
Clear runbook-style writing, disciplined change control, and calm, structured troubleshooting under time pressure.

Nice to have:

Familiarity with Equinix processes (cross-connects, tickets, remote hands) and carrier coordination.
Ops exposure to MySQL, Elasticsearch, Kafka, Java services, Apache; ability to collaborate with app teams on infra-adjacent issues.
Experience with Centreon and Dynatrace (or equivalent monitoring/observability stacks).
Config management/IaC depth (Ansible, Puppet, Terraform modules, Secret management), and CI pipelines in GitLab.
Deeper networking (EVPN/VXLAN, BGP, multicast) and/or traffic engineering.

Benefits & conditions

Standard business hours aligned to Central European Time with flexibility for maintenance windows.
Rotational Weekend work (Friday/Saturday/Sunday) for planned changes and datacenter work; comp day granted during the week.
Travel: Twice a week in Slough Equinix datacenters, once a month in London city center, and exceptional travels outside UK

About the company

smartTrade Technologies builds mission-critical trading and connectivity platforms for global financial markets. Our low-latency infrastructure support real-time services across EMEA, North America, and APAC.