DevOps Platform Engineer - 100% Remote (from anywhere in Europe)
Digistore24 GmbH
15 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English, German Experience level
IntermediateJob location
Remote
Tech stack
Amazon Web Services (AWS)
Continuous Integration
DevOps
Elasticsearch
Monitoring of Systems
Information Technology Operations
Performance Tuning
Cloud Services
Prometheus
Systems Architecture
Grafana
Infrastructure as Code (IaC)
Kubernetes
Terraform
Job description
- Automation and Infrastructure as Code (IaC): You automate repetitive tasks, deployments, and system management to reduce human error and improve efficiency. This might involve creating scripts, CI/CD pipelines, or automating infrastructure provisioning.
- Collaboration with Development and SRE Teams: You work closely with developers and sire reliability engineers to align platform capabilities with application requirements. You provide tools and frameworks that promote best practices in CI/CD, testing and deployment.
- Reliability and Performance Optimization: You continuously improve the system uptime by identifying bottlenecks and optimizing system architecture.
- Capacity Planning and Scaling: You assess and predict system resource requirements (CPU, memory, storage) to ensure the infrastructure can scale with increasing demand. Implement auto-scaling solutions to handle load spikes without human intervention, ensuring systems remain performant under various conditions.
- System Monitoring and Incident Response: Continuously monitor system performance, uptime, and reliability using tools like Prometheus, Grafana, or ElasticSearch. The goal is to detect and respond to issues before they impact users. Manage and respond to incidents, outages, and failures quickly, aiming to minimize downtime. This includes managing incident documentation, communication, and post-incident analysis.
Your benefits at Digistore24
- Work in your home office, as long as you can guarantee uninterrupted internet access
- Regular further education
- The stability of an extremely successful German high-tech company that is funded by its successful product and not by investors
- Outcome focused teams and a culture of direct feedback
- Modern equipment
- International, collaborative team with strong cohesion
- Spectacular team events in various European countries
- Autonomy from day one
- Work in your team on a first-name basis, without a dress code, and at eye level
- Flexible working hours from Mondays to Fridays, * Daily morning video call to talk to your team about yesterday's progress and today's plans.
- You check the latest cloud logs and pull requests and update an IaC module after checking the change-log for breaking changes.
- Then you take a new ticket from the backlog to develop a dashboard solution for a better overview of the system status.
- You discuss different solutions with your colleagues and start writing the code for the solution.
- After your lunch break, a developer needs help with a new CI/CD workflow. You discuss the requirements with him and provide him with an initial prototype.
- You take the ticket to check the resource allocation of an application, check the current utilization and adjust the deployment.
- At the end of the day, you review a colleague's code changes and approve their pull request.
- You find an endpoint that is not yet included in the monitoring. After creating a ticket for this, you immediately write the code in the Terraform project to add it.
Requirements
Do you have experience in Terraform?, * Communication Mastery: You communicate precisely and in a recipient-friendly manner. You diffuse potential conflicts with sensitivity and a solution-oriented approach. You always strike the right tone with stakeholders, developers and your team, even under time pressure, and can seamlessly switch from German to English if necessary.
- Collaboration Wizardry: You collaborate with developers, stakeholders and operations and bring everyone on the same page. You understand the challenges of different teams and find solutions that benefit the entire company.
- Automation Sorcery: You promote automation as a way to save time and reduce errors, and implement tools that improve productivity across the team.
- Problem-Solving Genius: You dive deep into problems, identify root causes and come up with solutions that prevent future incidents.
- Self-organization: You thrive on autonomy and excel at organizing and structuring complex projects while working from home.
- Additional MUST HAVE skills:
- Cloud Services (Preferably Google, but AWS is also okay)
- Very good experience with Terraform / Terragrunt
- Very good experience with Kubernetes / Container Technology
- Excellent spelling and grammar in German, * … you have less than 3 years of experience in IT operations
- … you can't take ownership and need to discuss every detail with your supervisor or colleagues
- … you have difficulty planning and prioritizing your tasks
- … you don't like to find solutions for complex problem
- … you are not confident speaking German AND English