Marek Suppa

May 26, 2021 • WeAreDevelopers LIVE

Serverless deployment of (large) NLP models

How do you fit a 400MB NLP model into a 250MB serverless function? Learn the model distillation and dependency tricks that make it possible.

#1about 9 minutes

Exploring practical NLP applications at Slido

Several NLP-powered features are used to enhance user experience, including keyphrase extraction, sentiment analysis, and similar question detection.

#2about 4 minutes

Choosing serverless for ML model deployment

Serverless was chosen for its ease of deployment and minimal maintenance, but it introduces challenges like cold starts and strict package size limits.

#3about 8 minutes

Shrinking large BERT models for sentiment analysis

Knowledge distillation is used to train smaller, faster models like TinyBERT from a large, fine-tuned BERT base model without significant performance loss.

#4about 8 minutes

Building an efficient similar question detection model

Sentence-BERT (SBERT) provides an efficient alternative to standard BERT for semantic similarity, and knowledge distillation helps create smaller, deployable versions.

#5about 3 minutes

Using ONNX Runtime for lightweight model inference

The large PyTorch library is replaced with the much smaller ONNX Runtime to fit the model and its dependencies within AWS Lambda's package size limits.

#6about 3 minutes

Analyzing serverless ML performance and cost-effectiveness

Increasing allocated RAM for a Lambda function improves inference speed, potentially making serverless more cost-effective than a dedicated server for uneven workloads.

#7about 3 minutes

Key takeaways for deploying NLP models serverlessly

Successful serverless deployment of large NLP models requires aggressive model size reduction, lightweight inference libraries, and an understanding of the platform's limitations.

18 days ago

Senior Machine Learning Engineer (f/m/d)

MARKT-PILOT GmbH
Stuttgart, Germany

Remote

Senior

13 days ago

Machine Learning Engineer

Picnic Technologies B.V.
Amsterdam, Netherlands

Intermediate

Senior

6 days ago

AI Software Engineer (m/f/d)

Sunhat
Köln, Germany

Remote

Senior

Featured Partners

DevOps for AI: running LLMs in production with Kubernetes and KubeFlow

DevOps for AI: running LLMs in production with Kubernetes and KubeFlow

Aarno Aukia

about a year ago • WeAreDevelopers LIVE

Leverage Cloud Computing Benefits with Serverless Multi-Cloud ML

Leverage Cloud Computing Benefits with Serverless Multi-Cloud ML

Linda Mohamed

about 5 years ago • WeAreDevelopers LIVE

Multilingual NLP pipeline up and running from scratch

Multilingual NLP pipeline up and running from scratch

Kateryna Hrytsaienko

about a year ago • WeAreDevelopers LIVE

From ML to LLM: On-device AI in the Browser

From ML to LLM: On-device AI in the Browser

Nico Martin

about 10 months ago • WeAreDevelopers LIVE

The state of MLOps - machine learning in production at enterprise scale

The state of MLOps - machine learning in production at enterprise scale

Bas Geerdink

about 4 years ago • WeAreDevelopers LIVE

From Traction to Production: Maturing your LLMOps step by step

From Traction to Production: Maturing your LLMOps step by step

Maxim Salnikov

about 11 months ago • WeAreDevelopers LIVE

What do language models really learn

What do language models really learn

Tanmay Bakshi

about 5 years ago • WeAreDevelopers LIVE

Optimizing your AI/ML workloads for sustainability

Optimizing your AI/ML workloads for sustainability

Sohan Maheshwar

about 3 years ago • WeAreDevelopers LIVE

From learning to earning

Jobs that call for the skills explored in this talk.

7 days ago

Machine Learning Engineer

Speechmatics
Charing Cross, United Kingdom

Remote

€39K

Machine Learning

Speech Recognition

3 days ago

Backend Engineer (Python) with ML

ERNI
Barcelona, Spain

Remote

API

Azure

Flask

DevOps

+6

yesterday

AI/ Machine Learning Engineer NLP / LLM - Contract

Involved Solutions LTD.
Manchester, United Kingdom

Remote

€117-130K

Machine Learning

Natural Language Processing

10 days ago

ML/DevOps Engineer at dynamic AI/ Computer Vision company

Nomitri
Berlin, Germany

C++

Bash

Azure

DevOps

Python

+12

9 days ago

Machine Learning (IA/ML) + DevOps (MLOps)

Alten
Municipality of Madrid, Spain

Remote

Java

DevOps

Python

Kubernetes

+3

4 days ago

Machine Learning Engineering Manager, LLM Serving & Infrastructure

Spotify

€67K

Machine Learning

9 days ago

Security-by-Design for Trustworthy Machine Learning Pipelines

Association Bernard Gregory

Machine Learning

Continuous Delivery

7 days ago

Senior DevOps Engineer - Machine Learning

Lonza, Inc.
Cambridge, United Kingdom

Senior

GIT

Azure

Keras

DevOps

Python

+4

10 days ago

Machine Learning Evangelist @AI Startup

Lightly
Zürich, Switzerland

Intermediate

Python

PyTorch

TensorFlow

Computer Vision

Machine Learning