Site Reliability Engineer - AML Global Recommendation - USDS

TikTok
Culver City, United States of America
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Charing Cross, United Kingdom

Tech stack

Mxnet
C
Artificial Intelligence
Systems Engineering
C++
Program Optimization
Computer Programming
Computer Engineering
Data Structures
Relational Databases
Linux
Distributed Systems
Fault Tolerance
Python
Machine Learning
Recommender Systems
Reliability Engineering
TensorFlow
Software Engineering
PyTorch
Information Technology
Go

Job description

About the Team: Site Reliability Engineering (SRE) of the AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run a massively distributed AI/ML recommendation system for the United States and all around the world.

On the SRE team, you'll have the opportunity to sharpen your expertise in coding, performance analysis, and large-scale systems operation. Join us and you'll have the chance to shape the future of AML systems and make a real, tangible impact on TikTok users.

Responsibilities: Design, build, and maintain highly available, scalable, and fault-tolerant systems. Monitor and analyze system performance, identifying and resolving issues before causing user impact. Develop and maintain automated monitoring, alerting, and incident response systems. Collaborate closely with software engineering teams to ensure that applications are designed with reliability, scalability, and performance in mind. Implement and maintain security best practices and ensure compliance with regulatory requirements. Participate in on-call rotations and respond to issues and incidents within and outside of normal business hours. Conduct root cause analysis of incidents, hold post-mortem reviews with stakeholders, and implement preventative measures to minimize the risk of similar incidents occurring in the future.

Requirements

3 years of experience in a SRE or software engineering role. Expertise in analyzing and troubleshooting Linux-based distributed systems. Bachelor's/Master's degree in Computer Science, Computer Engineering Experience programming with at least one commonly used language (C, C++, Python, Go). Strong understanding of data structures and algorithms. Competent knowledge of relational database systems.

Preferred Qualifications Ability to design and maintain large-scale systems. Strong understanding of code optimization and routine task automation. Proficiency in at least one machine learning framework: TensorFlow, PyTorch, MXNet or PaddlePaddle

About the company

TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.​

Why Join Us

Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.​

We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.​

Diversity & Inclusion​

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.​

Apply for this position