SRE/Kafka Platform Engineer

The Technical
Richardson, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Richardson, United States of America

Tech stack

Java
API
Amazon Web Services (AWS)
Azure
Continuous Integration
Distributed Systems
JSON
Python
Parquet
File Transfer Protocol (FTP)
Data Ingestion
System Availability
Kubernetes
Kafka
Data Management
Data Pipelines
Microservices

Job description

  • Support, monitor, and maintain microservices-based data platforms in production environments.
  • Proactively monitor, troubleshoot, and resolve incidents across distributed systems and streaming platforms.
  • Ensure high availability, reliability, and performance of Kafka-based ingestion and processing pipelines.
  • Own operational readiness across non-production and production environments, including

Requirements

  • Kafka + SRE + Microservices + Platform Ops
  • Strong in production support + incident management + observability
  • Comfortable with cloud (AWS/Azure) + Kubernetes + CI/CD tools
  • Having hands-on experience in real-time data pipelines and operational readiness

Requirements:

  • Strong programming expertise in Java and/or Python.
  • Hands-on experience with Apache Kafka, including:
  • Topics, partitions, brokers
  • Consumer groups
  • Kafka Connect
  • Lag monitoring and alert handling
  • Proven experience in microservices architecture (build, support, and troubleshooting).
  • Solid understanding of SRE principles
  • Experience with CI/CD pipelines and tools.
  • Strong troubleshooting capability across logs, metrics, traces, infrastructure dependencies, and application failures.
  • Kafka, Platform Operations & Environment Readiness
  • Experience with data ingestion mechanisms including Kafka, SFTP, and APIs.
  • Knowledge of data formats such as JSON and Parquet.

Apply for this position