Hpc Infrastructure Reliability Engineer

Tata Consultancy Services
Meira, Spain
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English, Spanish
Experience level
Senior

Job location

Remote
Meira, Spain

Tech stack

Artificial Intelligence
Bash
Data Centers
Ethernet
InfiniBand
Python
Software Maintenance
Ansible
Graphics Processing Unit (GPU)
Infrastructure as Code (IaC)
Infrastructure Automation Frameworks
Information Technology
Bare Metal
Hardware Asset Management
Network Server
ServiceNow

Job description

Are you a HPC Infrastructure Reliability Engineer seeking a new interesting challenge ?¿Interesado en saber más sobre este trabajo?Desplácese hacia abajo y descubra qué habilidades, experiencia y cualificaciones académicas se necesitan.If your answer is yes, it's your lucky day so keep reading, it can be just what you're looking for !WHAT WILL YOU DO?We are looking for a dynamic, proactive and talented person to join our team and perform the following tasks :Manage and optimize high-performance physical infrastructure (servers, GPUs, and advanced networking)Ensure availability, performance, and reliability of HPC and AI environmentsDrive infrastructure automation (IaC) and enable zero-touch provisioningOversee the full hardware lifecycle (capacity planning, deployment, and decommissioning)Work with tools such as HPE OneView, Lenovo XClarity, and ServiceNow CMDBCollaborate with R&D, science, and engineering teams to design optimal infrastructure solutionsOptimize resource utilization (CPU/GPU) and improve overall infrastructure efficiencyWHAT ARE WE LOOKING FOR?5-7+ years of experience in Data Center Engineering, Bare Metal, or HPC InfrastructureStrong expertise in enterprise hardware (HPE, Lenovo) and high-performance systemsHands-on experience with GPUs (NVIDIA) and AI/HPC environmentsSolid knowledge of high-speed networking (e.G., InfiniBand, high-throughput Ethernet)Proven experience in Infrastructure as Code (IaC) and automation (Python, Bash, Ansible, or similar)Experience with infrastructure management tools such as HPE OneView and/or Lenovo XClarityGood understanding of the hardware lifecycle (capacity planning, deployment, decommissioning)Strong communication skills in English, with the ability to collaborate with technical and business stakeholdersWHERE AND WHEN?Workplace: Madrid (Hybrid model)Work Schedule: Business HoursWHAT CAN WE OFFER YOU?Permanent contract - We offer indefinite contracts from the first day.Pay and benefits - Competitive salary and a flexible compensation plan adapted to your needs (Ticket restaurant plan, Childcare Ticket, Transport Ticket and Health Insurance).Work from home - We offer you a financial support every month for your working from home expenses.In addition, in the first month we will give you a bonus to help you to set up your workplaceOpportunity knocks - Being a part of a growing company, we want to support your path with a career development plan and annual performance-based compensation reviews .Learn as you grow - Starting with a fantastic onboarding program, TCS has robust learning platforms that will allow you to learn and grow personal as professionally.Bring your buddy - If you have referred a friend for an open position under the BYB Scheme and she/he is hired you'll receive a very attractive cash award.xqysrnhConnect globally - Work with people from all over the world.You can feel the multicultural workforce .Benefit from being a TCSer - By being part of the TCS Spain family you can enjoy benefits, offers and corporate discounts on the best brands .And so on - Appreciations, incentives, Team Building activities, diversity and inclusion programs, sustainability activities, corporative events...This has only just begun!WHO ARE WE?Tata Consultancy Services (TCS) is an Information Technology (IT) company founded in **** as part of the Indian Tata Group and is one of the top 3 technology companies globallyWith a presence in 55 countries and more than 590,000 employees, TCS is considered one of the 10 best companies to work for worldwide in **** according to the Top Employers InstituteTCS Spain started operations in **** and currently has a diverse workforce that collaborates with the main Spanish and multinational companiesTCS Spain has been certified as a Top Employer **** and has also been chosen as one of the 100 Best Companies to Work for in Spain in **** according to ForbesAmong the portfolio of services, TCS has information technology services, asset-based solutions, global consulting, engineering and industrial services, digital solutions and services, application maintenance and development, quality assurance and testing services, IT infrastructure and BPSResponsible for development, TCS Spain is committed to inclusion, diversity and sustainability, and promotes flexibility policies that support wellbeing and work-life balanceWELCOME, WE ARE WAITING FOR YOU!

Requirements

We are looking for a dynamic, proactive and talented person to join our team and perform the following tasks :Manage and optimize high-performance physical infrastructure (servers, GPUs, and advanced networking)Ensure availability, performance, and reliability of HPC and AI environmentsDrive infrastructure automation (IaC) and enable zero-touch provisioningOversee the full hardware lifecycle (capacity planning, deployment, and decommissioning)Work with tools such as HPE OneView, Lenovo XClarity, and ServiceNow CMDBCollaborate with R&D, science, and engineering teams to design optimal infrastructure solutionsOptimize resource utilization (CPU/GPU) and improve overall infrastructure efficiencyWHAT ARE WE LOOKING FOR? 5-7+ years of experience in Data Center Engineering, Bare Metal, or HPC InfrastructureStrong expertise in enterprise hardware (HPE, Lenovo) and high-performance systemsHands-on experience with GPUs (NVIDIA) and AI/HPC environmentsSolid knowledge of high-speed networking (e.G., InfiniBand, high-throughput Ethernet)Proven experience in Infrastructure as Code (IaC) and automation (Python, Bash, Ansible, or similar)Experience with infrastructure management tools such as HPE OneView and/or Lenovo XClarityGood understanding of the hardware lifecycle (capacity planning, deployment, decommissioning)Strong communication skills in English, with the ability to collaborate with technical and business stakeholdersWHERE AND WHEN?

Benefits & conditions

Permanent contract - We offer indefinite contracts from the first day.Pay and benefits - Competitive salary and a flexible compensation plan adapted to your needs (Ticket restaurant plan, Childcare Ticket, Transport Ticket and Health Insurance). Work from home - We offer you a financial support every month for your working from home expenses. In addition, in the first month we will give you a bonus to help you to set up your workplaceOpportunity knocks - Being a part of a growing company, we want to support your path with a career development plan and annual performance-based compensation reviews . Learn as you grow - Starting with a fantastic onboarding program, TCS has robust learning platforms that will allow you to learn and grow personal as professionally.Bring your buddy - If you have referred a friend for an open position under the BYB Scheme and she/he is hired you'll receive a very attractive cash award. xqysrnhConnect globally - Work with people from all over the world. You can feel the multicultural workforce . Benefit from being a TCSer - By being part of the TCS Spain family you can enjoy benefits, offers and corporate discounts on the best brands . And so on - Appreciations, incentives, Team Building activities, diversity and inclusion programs, sustainability activities, corporative events... This has only just begun!

About the company

Tata Consultancy Services (TCS) is an Information Technology (IT) company founded in **** as part of the Indian Tata Group and is one of the top 3 technology companies globallyWith a presence in 55 countries and more than 590,000 employees, TCS is considered one of the 10 best companies to work for worldwide in **** according to the Top Employers InstituteTCS Spain started operations in **** and currently has a diverse workforce that collaborates with the main Spanish and multinational companiesTCS Spain has been certified as a Top Employer **** and has also been chosen as one of the 100 Best Companies to Work for in Spain in **** according to ForbesAmong the portfolio of services, TCS has information technology services, asset-based solutions, global consulting, engineering and industrial services, digital solutions and services, application maintenance and development, quality assurance and testing services, IT infrastructure and BPSResponsible for development, TCS Spain is committed to inclusion, diversity and sustainability, and promotes flexibility policies that support wellbeing and work-life balanceWELCOME, WE ARE WAITING FOR YOU!

Apply for this position