DevOps Automation Engineer

Mphasis
Westfield, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Shift work
Languages
English
Experience level
Senior

Job location

Westfield, United States of America

Tech stack

Kubernetes Security
Bash
Big Data
Configuration Management
Continuous Integration
Linux
DevOps
Disaster Recovery
Distributed Computing Environment
Distributed Systems
Fault Tolerance
Monitoring of Systems
HP Systems Insight Manager
Python
Key Management
Kernel-Based Virtual Machine
Network Troubleshooting
Linux System Administration
MongoDB
Openshift
Performance Tuning
Role-Based Access Control
Red Hat Enterprise Linux - RHEL
Reliability Engineering
Ansible
Cloudera
Software Deployment
Data Streaming
Virtual Machines
Virtualization Technology
Software Vulnerability Management
Data Logging
Scripting (Bash/Python/Go/Ruby)
Enterprise Software Applications
Real Time Systems
Data Ingestion
Istio
Delivery Pipeline
Spark
HybridCloud
Containerization
Kubernetes
Infrastructure Automation Frameworks
Information Technology
Apache Flink
Deployment Automation
Kafka
Data Management
Stream Processing
Splunk
VMware
Microservices

Job description

We are seeking a highly skilled Senior DevOps Engineer to design, implement, automate, and support enterprise-scale infrastructure and platform solutions across hybrid cloud and on-premises environments. The ideal candidate will have deep expertise in Linux systems administration, container orchestration, CI/CD automation, big data platforms, observability, and distributed data processing technologies., This role requires strong experience with OpenShift, Kubernetes, MongoDB, Kafka, Flink, Spark/Cloudera ecosystems, virtualization platforms, and infrastructure automation using Ansible. The engineer will collaborate closely with development, architecture, security, and operations teams to build highly available, scalable, secure, and automated platforms supporting mission-critical enterprise applications., Infrastructure & Platform Engineering

  • Design, deploy, configure, and maintain enterprise Linux-based infrastructure environments.
  • Administer and optimize Red Hat/OpenShift and Kubernetes container platforms.
  • Manage Linux virtual machine environments across VMware, KVM, or cloud-based virtualization platforms.
  • Implement highly available, fault-tolerant, and scalable infrastructure architectures.
  • Perform capacity planning, performance tuning, and infrastructure optimization.

Kubernetes & OpenShift Administration

  • Build and maintain Kubernetes/OpenShift clusters for production and non-production environments.
  • Configure ingress controllers, networking, storage classes, service mesh, operators, and cluster security.
  • Automate deployment pipelines and container lifecycle management.
  • Implement GitOps and Infrastructure-as-Code practices.
  • Troubleshoot cluster performance, node failures, networking issues, and container runtime problems.

Automation & DevOps

  • Develop automation solutions using Ansible for provisioning, patching, configuration management, and application deployment.
  • Build and maintain CI/CD pipelines supporting microservices and distributed platforms.
  • Standardize deployment and operational processes through scripting and automation.
  • Integrate security, compliance, and operational controls into deployment workflows.

Data & Streaming Platform Support

  • Administer and support Apache Kafka clusters including brokers, topics, partitions, replication, and security.
  • Support Apache Flink streaming data platforms and real-time processing pipelines.
  • Manage Cloudera/Spark ecosystems for distributed data processing workloads.
  • Optimize distributed compute and data platforms for performance and resiliency.
  • Support data ingestion, streaming, and large-scale analytics environments.

Monitoring & Observability

  • Implement enterprise monitoring, logging, and observability solutions using Splunk and related tooling.
  • Develop dashboards, alerts, and operational metrics for infrastructure and application monitoring.
  • Conduct root cause analysis and incident troubleshooting across distributed systems.
  • Support production operations, incident response, and problem management activities.

Security & Compliance

  • Implement infrastructure hardening, RBAC, secrets management, and container security best practices.
  • Support enterprise security standards, vulnerability remediation, and compliance initiatives.
  • Ensure operational reliability, backup strategies, and disaster recovery readiness.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, Engineering, or related field (or equivalent experience).
  • 6+ years of experience in DevOps, Infrastructure Engineering, Site Reliability Engineering, or Platform Engineering roles.
  • Strong expertise in Linux systems administration (RHEL).
  • Hands-on experience with OpenShift and Kubernetes administration in enterprise environments.
  • Extensive experience with Ansible automation and Infrastructure-as-Code methodologies.
  • Strong experience supporting Kafka, Flink, MongoDB, and Spark/Cloudera platforms.
  • Experience managing Linux virtual machines and virtualization platforms.
  • Experience with CI/CD tools and automated deployment pipelines.
  • Strong scripting skills using Bash, Python, or similar languages.
  • Experience with monitoring and logging platforms such as Splunk.
  • Strong troubleshooting and performance tuning capabilities across distributed systems.

Preferred Experience Areas

  • Enterprise-scale distributed systems
  • Financial services or high-availability environments
  • Real-time data streaming platforms
  • Large-scale containerized environments

Behavioral Skills:

  • Self-starter and experienced in leading the junior resources
  • Hand-on architect with ability to implement and validate the solution
  • Good Communication skills
  • Flexible to rotational shifts, 5 days WFO
  • Team Player
  • Ability to work in a changing environment

Apply for this position