Incident Response Engineer, Senior
ASM
Dover, United States of America
29 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Senior Compensation
$ 155KJob location
Dover, United States of America
Tech stack
Distributed Systems
Information Technology Operations
Log Analysis
Reliability Engineering
Service Design
Security Information and Event Management
Information Technology
Performance Monitor
Job description
- Technical Lead (under Major Incident Management direction): Lead complex investigations from scoping through closure; drive hypothesis-based troubleshooting; validate permanent fixes across distributed systems.
- Observability & Diagnostics: Use modern monitoring/SIEM/observability to correlate metrics, traces, logs; distinguish symptoms from root causes; map impacts across infra/app/network/identity.
- Runbooks & Automation: Design/refine technical runbooks; implement scripts/orchestration to standardize responses and reduce manual effort; codify remediation/verification checks.
- SRE & Architecture Integration: Translate incident insights into capacity planning, reliability metrics, and service design changes; partner with platform/reliability engineering teams.
- Technical PIRs & Coaching: Produce high-quality technical PIRs for engineers/executives; mentor responders in tools, diagnostics, documentation discipline, and IM practice adherence.
- Cyber IR Interface: Coordinate with SOC/cyber responders when security indicators emerge; align IT ops IR and cyber IR workflows without compromising restoration velocity/safety.
- Technical Mentoring: coach incident responders and operations staff, raising the bar on diagnostic techniques, tool usage, documentation discipline, and adherence to incident management practices., Compensation ranges for ASM Research positions vary depending on multiple factors; including but not limited to, location, skill set, level of education, certifications, client requirements, contract-specific affordability, government clearance and investigation level, and years of experience. The compensation displayed for this role is a general guideline based on these factors and is unique to each role. Monetary compensation is one component of ASM's overall compensation and benefits package for employees.
Requirements
- Bachelor's degree in Information Technology, Computer Science, Business Administration, or related field, or equivalent relevant work experience.
- Minimum of 8 years of experience in incident management, IT operations, reliability engineering, or related IT roles, including frequent responsibility for leading complex, multi-system incident resolution.
- Strong mastery of ITIL-aligned incident management principles and best practices, with demonstrated experience coordinating major incidents in a large enterprise or federal IT environment.
- Advanced proficiency with incident management tools and modern monitoring/observability platforms used for log analysis, performance monitoring, and alerting.
- Proven ability to manage multiple complex incidents concurrently, synthesize technical information quickly, and communicate clearly and confidently with both technical teams and leadership.
- Active or obtainable SECRET clearance and U.S. citizenship, with the ability to satisfy all applicable federal suitability and security requirements., * Background leading incident response in large-scale, cloud-centric, or hybrid environments, including ownership of cross-team technical coordination and complex investigations.
- Advanced incident response, cybersecurity, or IT service management certifications (such as higher-level ITIL, incident-response-oriented, or security certifications).
- Experience embedding incident insights into site reliability engineering practices, including error budgeting, reliability metrics, and capacity planning.
- Demonstrated success building and refining automation for common remediation actions and verification checks., The physical requirements described in "Knowledge, Skills and Abilities" above are representative of those which must be met by an employee to successfully perform the primary functions of this job. (For example, "light office duties' or "lifting up to 50 pounds" or "some travel" required.) Reasonable accommodations may be made to enable individuals with qualifying disabilities, who are otherwise qualified, to perform the primary functions.