Accumulo Database Engineer

Kforce Inc.

San Antonio, United States of America

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

San Antonio, United States of America

Tech stack

Java

Apache Accumulo

Big Data

Databases

Data Security

Database Schema

Software Debugging

Distributed Data Store

Distributed Systems

Hadoop

Hadoop Distributed File System

Python

Log Analysis

Performance Tuning

Backup and Restore

Apache Zookeeper

Data Processing

Apache Yarn

Reliability of Systems

Job description

An enterprise federal program is seeking a highly specialized Big Data Database Engineer with deep expertise in Apache Accumulo to support a mission-critical distributed data platform. This role is focused on the sustainment, optimization, security, and modernization of large-scale data environments operating across multiple classification levels. This is a hands-on engineering role requiring true subject matter expertise in Accumulo and its supporting ecosystem-not a general database administrator., Accumulo Engineering & Administration

Administer and optimize distributed Accumulo clusters, including core services and system components Perform advanced tuning of compactions, tablet management, and table-level configurations Design and maintain table structures, iterators, partitioning strategies, and performance settings Troubleshoot complex system-level issues including iterator failures, class loading issues, and runtime errors

Distributed Systems Management

Configure and maintain Hadoop ecosystem components including HDFS, Zookeeper, and YARN Optimize distributed storage and compute performance for high-volume ingestion and analytics workloads Ensure proper integration and stability between platform components

Data Security & Access Control

Implement and enforce fine-grained data security controls, including cell-level access restrictions Ensure compliance with multi-classification data handling requirements Support secure data flows across enterprise environments

Backup, Recovery & Sustainment

Manage backup and recovery strategies across distributed data platforms Support operations across multiple network environments with differing classification levels Maintain system reliability and support ongoing modernization efforts

System Diagnostics & Optimization

Analyze logs and performance metrics to identify and resolve ingest bottlenecks and system constraints Diagnose dependency issues across distributed services Partner with engineering teams to improve system performance and scalability

Requirements

Active TS/SCI clearance or TS with SCI eligibility Proven, hands-on experience engineering and administering Apache Accumulo in enterprise environments (required) Strong experience with distributed systems, including HDFS, Zookeeper, and YARN Background in Hadoop ecosystem configuration, troubleshooting, and performance tuning Proficiency with Python and/or Java for system interaction and debugging Ability to troubleshoot deep technical issues within distributed data platforms, Experience supporting large-scale data platforms for analytics, ingestion, or cyber-related use cases Familiarity with mission-driven or highly secure data environments Experience developing automation for database management, monitoring, and maintenance workflows Exposure to Java-based application stacks that integrate with distributed data stores

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all