Ron Dagdag
Making neural networks portable with ONNX
#1about 6 minutes
Understanding ONNX as a portable format for ML models
Machine learning models are made portable across different frameworks and hardware using the ONNX open standard, similar to how PDF works for documents.
#2about 2 minutes
When to use ONNX for your machine learning projects
ONNX is ideal for deploying models across different programming languages, achieving low-latency inferencing, and running on resource-constrained edge or IoT devices.
#3about 12 minutes
Four methods for creating or acquiring ONNX models
Models can be obtained from the ONNX Model Zoo, built with tools like Azure Custom Vision, converted from existing frameworks like PyTorch, or used as an intermediary format.
#4about 7 minutes
Deploying models with the high-performance ONNX Runtime
The ONNX Runtime is a high-performance inference engine for deploying models to the cloud or edge devices, bridging the gap between data science and production software engineering.
#5about 4 minutes
Running an ONNX model in a Node.js application
A practical demonstration shows how to load an ONNX model and perform inference within a server-side Node.js application using the `onnxruntime-node` package.
#6about 9 minutes
Performing inference in the browser with ONNX Runtime Web
An emotion detection model is run directly in the browser using ONNX Runtime Web, showcasing client-side inference with JavaScript for privacy and offline capability.
#7about 3 minutes
Optimizing ONNX models for mobile and React Native
ONNX Runtime Mobile provides a lightweight solution for iOS and Android by converting models to a pre-optimized `.ort` format for smaller binary sizes.
#8about 8 minutes
Q&A on starting a career in machine learning
Advice is given on how software developers can enter the machine learning field by starting with model integration and deployment before diving deep into model creation.
Related jobs
Jobs that call for the skills explored in this talk.
ROSEN Technology and Research Center GmbH
Osnabrück, Germany
Senior
TypeScript
React
+3
Douglas GmbH
Düsseldorf, Germany
Intermediate
Senior
TypeScript
React
Matching moments
01:34 MIN
Understanding the ONNX format for model interoperability
Introduction to Azure Machine Learning
02:50 MIN
Using ONNX Runtime for lightweight model inference
Serverless deployment of (large) NLP models
06:37 MIN
Simplifying development with high-level AI frameworks
Privacy-first in-browser Generative AI web apps: offline-ready, future-proof, standards-based
03:07 MIN
Deploying machine learning models in a JavaScript environment
Shoot for the moon - machine learning for automated online ad detection
24:48 MIN
Consuming an ONNX model in a .NET console application
Machine Learning in ML.NET
04:26 MIN
Deploying ML models as a web service with Ktor
The best of both worlds: Combining Python and Kotlin for Machine Learning
01:02 MIN
Comparing platform-dependent and independent model deployment strategies
Developing an AI.SDK
03:58 MIN
Deploying the trained model as an inference service
From Code to Motion: Building an Autonomous Hat-Hunting Robot with Kubernetes & ML
Featured Partners
Related Videos
From ML to LLM: On-device AI in the Browser
Nico Martin
How AI Models Get Smarter
Ankit Patel
WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA
Ankit Patel
Machine learning in the browser with TensorFlowjs
Håkan Silfvernagel
MLOps - What’s the deal behind it?
Nico Axtmann
The best of both worlds: Combining Python and Kotlin for Machine Learning
Nils Kasseckert
Machine Learning in ML.NET
Marco Zamana
Computer Vision from the Edge to the Cloud done easy
Flo Pachinger
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.





Imec
Azure
Python
PyTorch
TensorFlow
Computer Vision
+1

Norbert Health
Paris, France
Senior
C++
Python
PyTorch
Computer Vision
Machine Learning


Nvidia
Remote
Intermediate
C++
Python
Machine Learning
Software Architecture

Neural Concept
Großmehring, Germany
Fluid
Python
Machine Learning