Accumulo Database Engineer
Role details
Job location
Tech stack
Job description
An enterprise federal program is seeking a highly specialized Big Data Database Engineer with deep expertise in Apache Accumulo to support a mission-critical distributed data platform. This role is focused on the sustainment, optimization, security, and modernization of large-scale data environments operating across multiple classification levels. This is a hands-on engineering role requiring true subject matter expertise in Accumulo and its supporting ecosystem-not a general database administrator., Accumulo Engineering & Administration
Administer and optimize distributed Accumulo clusters, including core services and system components Perform advanced tuning of compactions, tablet management, and table-level configurations Design and maintain table structures, iterators, partitioning strategies, and performance settings Troubleshoot complex system-level issues including iterator failures, class loading issues, and runtime errors
Distributed Systems Management
Configure and maintain Hadoop ecosystem components including HDFS, Zookeeper, and YARN Optimize distributed storage and compute performance for high-volume ingestion and analytics workloads Ensure proper integration and stability between platform components
Data Security & Access Control
Implement and enforce fine-grained data security controls, including cell-level access restrictions Ensure compliance with multi-classification data handling requirements Support secure data flows across enterprise environments
Backup, Recovery & Sustainment
Manage backup and recovery strategies across distributed data platforms Support operations across multiple network environments with differing classification levels Maintain system reliability and support ongoing modernization efforts
System Diagnostics & Optimization
Analyze logs and performance metrics to identify and resolve ingest bottlenecks and system constraints Diagnose dependency issues across distributed services Partner with engineering teams to improve system performance and scalability
Requirements
Active TS/SCI clearance or TS with SCI eligibility Proven, hands-on experience engineering and administering Apache Accumulo in enterprise environments (required) Strong experience with distributed systems, including HDFS, Zookeeper, and YARN Background in Hadoop ecosystem configuration, troubleshooting, and performance tuning Proficiency with Python and/or Java for system interaction and debugging Ability to troubleshoot deep technical issues within distributed data platforms, Experience supporting large-scale data platforms for analytics, ingestion, or cyber-related use cases Familiarity with mission-driven or highly secure data environments Experience developing automation for database management, monitoring, and maintenance workflows Exposure to Java-based application stacks that integrate with distributed data stores