Site Reliability Developer

BINGHAMTOM UNIVERSITY
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 99K

Job location

Tech stack

Java
Artificial Intelligence
Amazon Web Services (AWS)
JIRA
Bash
Unix
Computer Programming
Databases
DevOps
Amazon DynamoDB
Github
Gradle
Groovy
Identity and Access Management
Java Virtual Machine (JVM)
Python
Linux System Administration
Node.js
Scrum
Redis
Reliability Engineering
Data Streaming
Datadog
Data Logging
Scripting (Bash/Python/Go/Ruby)
Cloud Platform System
Grafana
Spring-boot
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Deployment Automation
Amazon Web Services (AWS)
Kafka
Build Tools
Route53
Functional Programming
Cloudwatch
Api Gateway
Kibana
Autodesk Autocad
Dynatrace
Docker
Service Stack
ELK
Jenkins
ServiceNow
Go

Job description

We are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure. Reporting to the Engineering Manager, you will be leading design and development of resilient and scalable architecture and innovative solutions for the platform. You will independently manage and deliver end-to-end solutions while engaging with key stakeholders and partners., * Lead architecture, solution design, development and maintenance of cloud infrastructure for microservices architecture

  • Independently manage requirement analysis, solution design, implementation, and release planning
  • Ensure high adherence to trust and security compliance, guidelines and standards
  • Streamline CI/CD processes, improve system reliability, and ensure infrastructure scalability and security
  • Automate infrastructure deployment, scaling, and management using modern DevOps tools and practices
  • Implement and maintain configuration management and infrastructure as code (IaC) using Terraform
  • Lead Disaster Recovery (DR) strategies, failover exercises, gamedays, and period maintenance activities
  • Contribute to critical vulnerability (CVEs) remediation efforts
  • Promote and document security and best practices across all pillars of DevOps/SRE throughout system design
  • Provide real-time operational support and collaborate across functions to resolve system, infrastructure, and CI/CD issues
  • Participate in on-call rotations, providing critical 24×7 support for production systems

Requirements

  • Bachelor's degree or higher in Computer Science, Engineering, or a related field
  • 5+ years of progressive experience in Site Reliability Engineering, DevOps, or a similar field
  • Proficiency with managing AWS resources and understanding of networking and security protocols
  • Expertise in infrastructure as code (IaC) and cloud automation tools such as Terraform, Serverless, and CloudFormation
  • Expertise in defining and building CI/CD processes with tools like Jenkins, GitHub, and Artifactory
  • Experience with container-based technologies like Docker and AWS ECS
  • Experience with monitoring and logging tools such as Dynatrace, Grafana, DataDog, ELK Stack, and CloudWatch
  • Experience in Linux Systems Administration, scripting, and troubleshooting in a production environment
  • Proficiency in programming languages such as UNIX, Python, Go, Bash, Groovy, and Node.js
  • Technology Stack: Java/SpringBoot, AWS (ECS Fargate, Elastic Cache, Lambda, Kinesis, DynamoDB, VPC, IAM policies, API Gateway, NLB/ALB, Route 53, CloudWatch, Kibana, Open Search), Kafka, GoLang, Node.js, Groovy, Python, Jenkins, GitHub, Jira, ServiceNow, and Splunk

Preferred Qualifications

  • Knowledge in applying AI and ML solutions for engineering processes and/or DevOps automation
  • Knowledge of standardized observability frameworks such as OpenTelemetry
  • Relevant certifications (e.g., AWS Certified DevOps Engineer, AWS Site Reliability Engineer)
  • Broad knowledge of AWS, Redis, server programming, databases, and cloud architectures
  • Broad knowledge with data streaming pipelines like Kinesis, Firehose, and Kafka
  • Knowledge on core Java and SpringBoot concepts in JVM optimization
  • Knowledge on build tools, e.g. Gradle
  • Strong interpersonal and communication skills to effectively collaborate in an Agile/Scrum-oriented environment
  • Self-directed team player and independent contributor, demonstrating accountability and end-to-end ownership

#LI-AD1

Benefits & conditions

Salary is one part of Autodesk's competitive compensation package. For Canada-BC based roles, we expect a starting base salary between $98,600 and $144,650. Offers are based on the candidate's experience and geographic location, and may exceed this range. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.

Diversity & Belonging We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging

Are you an existing contractor or consultant with Autodesk?

About the company

Welcome to Autodesk! Amazing things are created every day with our software - from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made. We take great pride in our culture here at Autodesk - it's at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world. When you're an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!

Apply for this position