Data Transfer Engineer

Roche
Municipality of Madrid, Spain
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Municipality of Madrid, Spain

Tech stack

Artificial Intelligence
Application Layers
AWK (Programming Language)
Bash
Data Transmissions
ETL
File Systems
General Parallel File Systems
Identity and Access Management
InfiniBand
Lightweight Directory Access Protocols (LDAP)
Linux System Administration
OpenID
Performance Tuning
Red Hat Enterprise Linux - RHEL
Ansible
TCP/IP
AI Infrastructure
SSL Certificate Management
Sed (Programming Language)
Scripting (Bash/Python/Go/Ruby)
High Performance Computing
Storage Technologies
Information Technology
Hardware Acceleration
Grep
Virtual Agents
Data Pipelines

Job description

As a Data Transfer Engineer within the Accelerated Compute Engineering (ACE) team, you will be responsible for owning the high-speed data transfer services that feed our advanced compute environments. With the introduction of our industry-leading AI Factory, the ability to move massive, petabyte-scale datasets rapidly and securely is more critical than ever. You will own the deployment, configuration, and ongoing optimization of specialized Data Transfer Appliances that bridge our traditional HPC clusters with our next-generation AI infrastructure. By ensuring seamless, high-bandwidth data mobility and aggressively eliminating network and I/O bottlenecks, you will play a foundational role in ensuring that our researchers can train complex AI models and execute large-scale computational science workloads without friction., Pipeline Architecture & Management

  • Own the end-to-end deployment and lifecycle management of specialized Data Transfer Appliances.
  • Identify and resolve systemic network and I/O bottlenecks to maximize throughput between storage tiers and compute nodes.

Technical Operations & Optimization

  • Manage integration with various storage architectures, including Object Storage, NAS (NFS), and parallel file systems such as GPFS.
  • Implement automation for on-site deployment and configuration of our factory-built and tuned Data Transfer appliances using Ansible, Kickstart, or Red Hat Satellite to ensure scalable and reproducible environments.

Performance & Monitoring

  • Analyze system logs, kernel messages, and network statistics to proactively monitor the health and performance of the data transfer fabric.
  • Define and track success metrics for data mobility, providing insights to leadership on infrastructure utilization and performance trends.

User Enablement & Support Experience .

  • Explore innovative approaches to drive more frictionless consumption of our Data Transfer appliances, e.g, Agentic AI and MCP
  • Develop user-facing documentation, scripts, and tools that simplify complex data movement tasks for the broader scientific community.
  • Provide high-level technical support, isolating complex issues across storage, network, and application layers in a collaborative, cross-functional manner.

Requirements

  • Bachelor's or an advanced degree in Computer Science, Engineering, or a similar technical discipline.
  • Proven experience in managing large-scale Linux environments (RHEL/rebuilds) with deep proficiency in CLI tools (grep, awk, sed, etc.).
  • Demonstrated experience in high-performance computing (HPC) or AI infrastructure environments.

Technical & Business Skills:

  • Scripting & Automation: Strong proficiency in Bash and Python scripting, along with hands-on experience using Ansible for infrastructure as code.
  • Storage & Networking: Deep understanding of parallel file systems (Lustre, GPFS) and high-speed networking (InfiniBand, TCP/IP tuning).
  • Security: Solid grasp of IAM configurations (LDAP, AD, OIDC), JWT tokens, and certificate management.
  • Problem Solving: A diagnostic mindset with the ability to interpret complex logs (system, kernel, network) to isolate performance degradation.
  • Technical Communication: Excellent ability in technical English (reading and writing) to document complex architectures and guide users.

Leadership & Mindset:

  • Lean & Agile Mindset: You focus on automation and efficiency to scale support and operations.
  • Enterprise Mindset: Ability to break down silos and collaborate across organizational boundaries to ensure end-to-end data mobility.
  • Intellectual Curiosity: A passion for staying current with major IT market trends, specifically in AI hardware and high-speed data movement.

About the company

Hosting and Infrastructure (HI) provides mission-critical on-premise infrastructure, cloud hosting, connectivity, and technology products that enable all functions at every Roche site to develop, innovate, connect, and deliver compliant digital products across the Roche Enterprise. The Value Streams - Accelerated Compute Engineering (ACE) Team is focused on driving both customer success and platform success by acting as a center of excellence and delivery for the High Performance Compute and AI Infrastructure supporting AI and HPC use cases across Roche. This team facilitates seamless onboarding and adoption for business vertical customers needing accelerated compute-helping those infrastructure consumers with needs optimized for high availability, seamless data transfer, flexibility, speed, and the rapidly changing needs of AI-helping achieve rapid time-to-value., A healthier future drives us to innovate. Together, more than 100'000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact. Let's build a healthier future, together. Roche is an Equal Opportunity Employer.

Apply for this position