Accumulo Database Engineer

Kforce Inc.
San Antonio, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

San Antonio, United States of America

Tech stack

Java
Apache Accumulo
Big Data
Databases
Data Security
Database Schema
Software Debugging
Distributed Data Store
Distributed Systems
Hadoop
Hadoop Distributed File System
Python
Log Analysis
Performance Tuning
Backup and Restore
Apache Zookeeper
Data Processing
Apache Yarn
Reliability of Systems

Job description

An enterprise federal program is seeking a highly specialized Big Data Database Engineer with deep expertise in Apache Accumulo to support a mission-critical distributed data platform. This role is focused on the sustainment, optimization, security, and modernization of large-scale data environments operating across multiple classification levels. This is a hands-on engineering role requiring true subject matter expertise in Accumulo and its supporting ecosystem-not a general database administrator., Accumulo Engineering & Administration

Administer and optimize distributed Accumulo clusters, including core services and system components Perform advanced tuning of compactions, tablet management, and table-level configurations Design and maintain table structures, iterators, partitioning strategies, and performance settings Troubleshoot complex system-level issues including iterator failures, class loading issues, and runtime errors

Distributed Systems Management

Configure and maintain Hadoop ecosystem components including HDFS, Zookeeper, and YARN Optimize distributed storage and compute performance for high-volume ingestion and analytics workloads Ensure proper integration and stability between platform components

Data Security & Access Control

Implement and enforce fine-grained data security controls, including cell-level access restrictions Ensure compliance with multi-classification data handling requirements Support secure data flows across enterprise environments

Backup, Recovery & Sustainment

Manage backup and recovery strategies across distributed data platforms Support operations across multiple network environments with differing classification levels Maintain system reliability and support ongoing modernization efforts

System Diagnostics & Optimization

Analyze logs and performance metrics to identify and resolve ingest bottlenecks and system constraints Diagnose dependency issues across distributed services Partner with engineering teams to improve system performance and scalability

Requirements

Active TS/SCI clearance or TS with SCI eligibility Proven, hands-on experience engineering and administering Apache Accumulo in enterprise environments (required) Strong experience with distributed systems, including HDFS, Zookeeper, and YARN Background in Hadoop ecosystem configuration, troubleshooting, and performance tuning Proficiency with Python and/or Java for system interaction and debugging Ability to troubleshoot deep technical issues within distributed data platforms, Experience supporting large-scale data platforms for analytics, ingestion, or cyber-related use cases Familiarity with mission-driven or highly secure data environments Experience developing automation for database management, monitoring, and maintenance workflows Exposure to Java-based application stacks that integrate with distributed data stores

Apply for this position