Site Reliability Engineer New

Sony
Berlin, Germany
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Shift work
Languages
English
Experience level
Senior

Job location

Berlin, Germany

Tech stack

Java
Amazon Web Services (AWS)
Bash
C++
Configuration Management
Data Normalization
Relational Databases
Linux
Distributed Data Store
Elasticsearch
Hadoop
Python
PostgreSQL
Load Testing
MongoDB
MySQL
NoSQL
Package Management Systems
Redis
Reliability Engineering
Ansible
Prometheus
Software Engineering
Web Services
Ceph
Software Distribution
System Availability
Saltstack
Grafana
Technical Debt
Kubernetes
Cassandra
Kafka
Puppet
SDET
Go
Programming Languages

Job description

  • Lead team technical discussions, especially around ongoing improvements in Reliability and Scalability
  • Be involved in creating High Level Designs (HLDs) for new products and platforms
  • Mentor junior SRE staff and enable them for success
  • Lead incident response and post-mortem activities within your assigned service team
  • Work with other Engineers in a cross-functional team to prioritise reliability improvements to address technical debt and toil
  • Contribute to code to improve reliability
  • Implement automation to reduce ongoing toil

Requirements

  • Minimum of 5+ years working experience in Software Development and/or Linux Systems Administration role.
  • Strong interpersonal, written and verbal communication skills.
  • Available to be scheduled in on-call rotation.

Skills & Knowledge:

  • Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
  • Development experience in one or more of the following programming languages:
  • Python (preferred)
  • Bash, Go, Java, C++, or Rust
  • In addition, experience with at least 3 of the following topics:
  • Distributed data storage at scale (Hadoop, Ceph)
  • NoSQL at scale (MongoDB, Redis, Cassandra)
  • Data Aggregation technologies. (ElasticSearch, Kafka)
  • Scaling and running traditional RDBMS (PostgreSQL, MySQL) with High Availability
  • Monitoring & Alerting (Prometheus, Grafana), and Incident Management toolsets
  • Kubernetes and/or AWS (deployment and management)
  • Software Distribution (Package management and distribution at scale)
  • Configuration Management (ansible, saltstack, puppet, chef)
  • S/W Performance analysis and load testing (QA or SDET experience: a plus)

Benefits & conditions

  • Training Provided
  • Regular team and company events
  • Free drinks, fruit or food
  • Flexible working
  • Free Gym or Gym Subsidy
  • Private Medical/Dental healthcare
  • Bonus/Reward Scheme
  • Cycle to work scheme
  • Game Jams

About the company

PlayStation isn't just the Best Place to Play - it's also the Best Place to Work. Today, we're recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStationreg;5, PlayStationreg;4, PlayStationreg;VR, PlayStationreg;Plus, acclaimed PlayStation software titles from PlayStation Studios, and more. PlayStation also strives to create an inclusive environment that empowers employees and embraces diversity. We welcome and encourage everyone who has a passion and curiosity for innovation, technology, and play to explore our open positions and join our growing global team. The PlayStation brand falls under Sony Interactive Entertainment, a wholly-owned subsidiary of Sony Group Corporation. Site Reliability Engineer As a part of Sony Computer Entertainment, the Future Technology Group (FTG) is leading the cloud gaming revolution, putting console-quality video games on any device, from TVs to consoles to mobile devices and beyond. Our Site Reliability Engineering team plays a significant role in delivering on the promise of a great cloud gaming experience to our customers. We do this by influencing design and operational decisions towards the overall stability of the gaming service. Our SREs focus on three main things: overall ownership of production, production code quality, and deployments. The successful candidate will be self-directed and able to participate in the way we make decisions at different levels. We expect our SREs to have opinions on the state of our service and provide critical feedback during different phases of the operational lifecycle. We are engaged throughout the software development lifecycle, ensuring the operational readiness and stability., Sony Interactive Entertainment pushes the boundaries of entertainment and innovation, starting from the launch of the original PlayStation in Japan in 1994. Today, we continue to deliver innovative and thrilling experiences to a global audience through our PlayStation line of products and services that include generation-defining hardware, pioneering network services, and award-winning games.

Apply for this position