Network Operations Center Engineer
Role details
Job location
Tech stack
Job description
The NOC Engineer role serves as our Product installation expert as well as a senior escalation resource. Additionally, this role is responsible for the scope, implementation, and development of NOC monitoring and toolsets. This role will provide direct support for the automation of alerts and the systematic creation of information-rich tickets. The NOC Engineer will work with partner development teams to identify, plan, and manage the development of tools to support the NOC and enhance Operator capabilities. This individual will also work with NOC leadership to document and create associated user and training materials for supported tools and systems., * Participate in and work to evolve the product installation process from pre-install, all the way through to post-install validations.
- Management of the Problem queue, ensuring that the underlying cause of incidents is addressed and driving operational stability.
- Serves as a senior escalation point for incident and problem tickets prior to escalation to development teams
- Engages with the NOC team to identify and develop supporting tools and processes
- Works with NOC leadership to prioritize and align tool development with the NOC Objectives and Strategic Vision
- Develop new process standards to enable automation and monitor for compliance
- Performs administration and maintenance for NOC tools and systems as owner and SME of these systems
- Proactively develop reports on health, risks, deficiencies, and future state for NOC tools and systems and communicate these reports with NOC leadership
Requirements
Do you have experience in Technical support?, * Value partnership and customer service excellence
- Can develop and implement continual improvement processes
- Are self disciplined and take ownership for assigned work and supported systems
- Are an excellent communicator with the ability to communicate technical concepts and create clear technical documentation
- Have experience with monitoring and alerting tools
- Understand how to work with middleware and api integrations
- Understand the software development lifecycle and have experience with working within an agile development cycle
- Are excited about working with cutting edge technology and leading a team to solutions for problems that have never been encountered before
- Are flexible to working 24/7/365 and are willing to participate in an on-call rotation
- Want the ability and freedom to build something new from the ground up
Bonus points if you:
- Have direct development experience
- Have experience engineering and managing monitoring solutions for IoT, Cloud, or Containerization
- Understand concepts of secure software development, AAA, service accounts, and least privileged access
- Familiar with ITIL service management concepts
- Relevant formal education or qualifying Certifications
Technical Familiarity
IT Operations is a unique IT discipline that brings together a very wide range of technology skills. An ideal candidate will have most of the following competencies.
- 2+ years of experience working with Linux Server administration with emphasis on remote command line administration. Ubuntu experience preferred
- 2+ years of experience with Networking, with familiarity in cabling,configuration,management and monitoring
- 3+ years of experience with full stack capable Monitoring tools
- 3+ years of experience with Log collection and analysis tools such as Splunk
- 3+ years of experience conducting Failure mode analysis on systems
- 2+ years of experience conducting or managing root cause analysis processes