SRE/Kafka Platform Engineer
The Technical
Richardson, United States of America
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
EnglishJob location
Richardson, United States of America
Tech stack
Java
API
Amazon Web Services (AWS)
Azure
Continuous Integration
Distributed Systems
JSON
Python
Parquet
File Transfer Protocol (FTP)
Data Ingestion
System Availability
Kubernetes
Kafka
Data Management
Data Pipelines
Microservices
Job description
- Support, monitor, and maintain microservices-based data platforms in production environments.
- Proactively monitor, troubleshoot, and resolve incidents across distributed systems and streaming platforms.
- Ensure high availability, reliability, and performance of Kafka-based ingestion and processing pipelines.
- Own operational readiness across non-production and production environments, including
Requirements
- Kafka + SRE + Microservices + Platform Ops
- Strong in production support + incident management + observability
- Comfortable with cloud (AWS/Azure) + Kubernetes + CI/CD tools
- Having hands-on experience in real-time data pipelines and operational readiness
Requirements:
- Strong programming expertise in Java and/or Python.
- Hands-on experience with Apache Kafka, including:
- Topics, partitions, brokers
- Consumer groups
- Kafka Connect
- Lag monitoring and alert handling
- Proven experience in microservices architecture (build, support, and troubleshooting).
- Solid understanding of SRE principles
- Experience with CI/CD pipelines and tools.
- Strong troubleshooting capability across logs, metrics, traces, infrastructure dependencies, and application failures.
- Kafka, Platform Operations & Environment Readiness
- Experience with data ingestion mechanisms including Kafka, SFTP, and APIs.
- Knowledge of data formats such as JSON and Parquet.