Data Engineer job in Columbia
Real Soft Inc.
Columbia, United States of America
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
EnglishJob location
Columbia, United States of America
Tech stack
Java
Agile Methodologies
Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Unit Testing
Bash
Information Engineering
ETL
Linux
Data Flow Control
Integrated Development Environments
IntelliJ
JSON
Python
Maven
Amazon Web Services (AWS)
Software Engineering
SQL Databases
Data Streaming
XML
YAML
Computer Network Operations
Data Ingestion
Gitlab
Containerization
Kubernetes
Kafka
Apache Nifi
Amazon Web Services (AWS)
Data Pipelines
Docker
Requirements
- A current U.S. Government Security Clearance is not required at start. but candidate should be "clearable". U.S. Citizenship required.
- 14+ years of experience in data engineering, software engineering, or related technical fields.(Additional experience may be considered in lieu of a degree)
- Strong experience designing and building ETL pipelines and data ingestion frameworks
- Hands-on experience with Kafka, NiFi, and AWS (S3, SQS)
- Proficiency in Java and/or Python, with experience in unit and integration testing
- Solid understanding of data formats (JSON, XML, SQL schemas, compressed formats)
- Experience with Kubernetes, Docker, or containerized deployments
- Experience troubleshooting data pipelines, system performance, and dataflow issues
- DoD 8570 IAT II certification (or higher), * Experience supporting cyber or network operations environments
- Familiarity with Agile development environments
- Exposure to Apache Airflow or Mage AI
- Strong documentation and communication skills
- Experience developing training materials or mentoring team members
Tech Environment
Languages: Java, Python, SQL
Data & Streaming: Kafka, Apache NiFi
Cloud: AWS (S3, SQS, SNS)
Tools: GitLab, Maven, VSCode, IntelliJ, PyCharm
Other: YAML configuration, Linux (Bash), data modeling & ETL frameworks