Lab Support Engineer (Linux & Windows)
Role details
Job location
Tech stack
Job description
As a Lab Support Engineer at Graphcore, you'll play a key role in keeping our people and systems running efficiently. You'll provide day-to-day support to internal customers and external collaborators in on-prem Hardware Labs and Silicon development projects in hybrid Linux & Windows environments. You will help colleagues resolve issues quickly while maintaining a strong focus on security and following established guidelines. The Team You'll be joining a multidisciplinary team with strong technical skills and a very encouraging culture. We work closely together and regularly share knowledge, and your skills will make a direct impact on our business. It's an exciting and pivotal moment for us right now, with plenty of new projects ahead. If you're looking to solve interesting problems and see your work deliver real-world results, this is the team for you. Responsibilities and Duties
- Logging and solving support requests face-to-face & via ticketing system, L1&L2 support
- Handling and maintaining Linux-based systems
- Handling and providing support for Windows-based systems
- Managing a fleet of servers, helping Hardware Lab teams with daily activities
- Installing and troubleshooting servers, hardware maintenance, fault finding
- Detailing solutions and maintaining a clear, up-to-date internal knowledge base
Requirements
- Excellent communication and customer service skills
- Good understanding of troubleshooting principles and methodical problem-solving
- Strong Linux administration skills in Debian & RedHat derivatives
- Good Windows administration skills
- Good networking skills such as VLANs, VPNs, Wi-Fi, routing, subnetting
- Familiarity with desktop and server hardware, BMCs, Out-of-Band networks, firmware & BIOS upgrades, PDU, rack mounts
- Managing Infrastructure-as-Code using Puppet, Ansible, or similar
Desirable
- Experience with identifying network/storage/CPU/RAM bottlenecks across complex workloads
- Experience with various monitoring solutions and stack (e.g. Zabbix/Prometheus/Grafana Mimir/Open Telemetry)
- Experience in managing web servers, load-balancers, reverse-proxies (e.g.: ha-proxy, nginx)
- Proficiency with containerisation frameworks and orchestration (e.g.: Docker/containerd/Kubernetes)
- Python programming skills, with the ability to write code to interact with APIs, process data, and build small applications