Bobur Umurzokov
Convert batch code into streaming with Python
#1about 5 minutes
Why Python is ideal for data streaming frameworks
Python-based frameworks unify streaming and processing components, simplifying connections to data sources and allowing focus on business logic.
#2about 2 minutes
Key use cases for Python streaming frameworks
Explore applications for Python streaming frameworks, including event-driven microservices, real-time data pipelines, and ML/LLM applications.
#3about 2 minutes
Introducing Pathway for unified batch and streaming
Pathway is a Python framework that allows you to build a data pipeline once and run it in both batch and streaming modes with a single configuration change.
#4about 3 minutes
Understanding Pathway's internal data handling and connectors
Data is structured into tables with defined schemas that are automatically updated in real-time, and custom connectors can be built for any data source.
#5about 3 minutes
Building real-time AI applications with Pathway
Use Pathway for real-time data indexing in RAG applications and leverage the llm-app project to avoid vector database synchronization issues.
#6about 7 minutes
Showcasing real-time AI application examples
Review several practical AI applications built with Pathway, including a document Q&A tool, a discount finder, and a real-time alerting system.
#7about 5 minutes
Live demo of a real-time Dropbox Q&A application
A walkthrough of a Python application that connects to Dropbox, indexes documents in real-time, and answers questions across multiple files.
#8about 2 minutes
Key takeaways for modern data processing
Python frameworks offer a unified platform for batch and streaming, enable custom data pipelines, and simplify bringing real-time data to LLM applications.
#9about 6 minutes
Q&A on latency, event processing, and migration challenges
Addressing audience questions about how Pathway ensures low latency, handles complex event processing, and the common challenges of migrating from batch to streaming.
#10about 4 minutes
Q&A on performance, parallelism, and organizational impact
Answering questions about handling data skew, load balancing, data parallelism for speed, and how real-time processing impacts organizational decision-making.
#11about 8 minutes
Q&A on future trends and the developer advocate role
Discussing the future evolution of real-time technologies, resource optimization, UX improvements, and the role of a developer advocate in the tech industry.
Related jobs
Jobs that call for the skills explored in this talk.
Featured Partners
Related Videos
Python-Based Data Streaming Pipelines Within Minutes
Bobur Umurzokov
Let's Get Started With Apache Kafka® for Python Developers
Lucia Cerchie
PySpark - Combining Machine Learning & Big Data
Ayon Roy
Tips, Techniques, and Common Pitfalls Debugging Kafka
DeveloperSteve
A beginner’s guide to modern natural language processing
Jodie Burchell
Accelerating Python on GPUs
Paul Graham
Overview of Machine Learning in Python
Adrian Schmitt
Alibaba Big Data and Machine Learning Technology
Dr. Qiyang Duan
From learning to earning
Jobs that call for the skills explored in this talk.
Data Engineer Python PySpark SQL
Client Server
Municipality of Madrid, Spain
€120K
Azure
Spark
Python
Pandas
+2
Senior Data Engineer (Python, Spark, Kafka)
Roku, Inc.
Cambridge, United Kingdom
€53K
Senior
Hive
Spark
Kafka
Python
+2
Data Analyst with Power BI, Python, PL SQL and Azure Synpase
Towards AI, Inc.
Municipality of Bilbao, Spain
ETL
Azure
Python
Agile Methodologies
AI Platform Engineer with Python and Terraform
EPAM Systems
Lleida, Spain
GIT
DevOps
Python
Docker
Terraform
+4
ML & Data Engineer - GenAI & Cloud Infra - Pur Player Data IA ML
WAKE IT UP
Paris, France
Python
Ansible
Terraform
Continuous Integration


