Application Software Engineer | Senior
Informatica
Mexico, United States of America
15 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Mexico, United States of America
Tech stack
Java
Amazon Web Services (AWS)
Software Applications
User Authentication
Software as a Service
DNS
Monitoring of Systems
Networking Basics
OAuth
Performance Tuning
JSON Web Token
Security Assertion Markup Language (SAML)
TCP/IP
Load Balancing
Cloud Platform System
System Availability
Kubernetes
Information Technology
Docker
Programming Languages
Job description
Responsible for managing the day-to-day operations, ensuring product administration, platform reliability, and overseeing incident management and resolution processes. You will collaborate closely with engineering, product, and infrastructure teams to ensure the smooth functioning of systems and platforms and provide a high level of operational support to meet business goals.
- Own incident resolution processes for L1 and L2 operations, ensuring timely and effective troubleshooting of technical issues.
- Define and implement procedures for handling escalations and high-priority incidents.
- Ensure root cause analysis is conducted for major incidents, and follow up on remediation actions.
- Develop and enforce Service Level Agreements (SLAs) and Key Performance Indicators (KPIs) for platform performance and product support operations.
- Monitor adherence to SLAs and manage escalations to maintain customer satisfaction.
- Oversee the platform's operational stability and performance, ensuring high availability and scalability.
- Monitor and manage platform performance metrics, proactively addressing any potential issues.
- Ensure comprehensive documentation of operational procedures, troubleshooting guides, and runbooks for the L1/L2 support teams.
- Track and create detailed operational reports and dashboards for tracking system health
- Manage SaaS platform administrations and self-hosted applications in cloud environments like AWS.
Requirements
- 6+ years of experience in IT
- Strong understanding of IT infrastructure, cloud platforms, and operational best practices.
- Strong experience on Docker, Kubernetes & Helm along with any programming language (Java preferred) experience to support platform KLO & monitoring
- Proven experience with incident management, service management, and driving process improvements.
- Expertise in monitoring tools, automation frameworks, and platform performance optimization.
- Strong understanding of networking fundamentals (TCP/IP, DNS, load balancing, etc) and authentication/authorization mechanisms (OAuth 2.0, SAML, JWT etc)