Sign up or log in to watch the video
PySpark - Combining Machine Learning & Big Data
Ayon Roy - 4 years ago
With the ever increasing flow of data, comes the industry focus on how to use those data for driving business & insights; but what about the size of the data these days, we have to deal with ? The more cleaner data you have, its good for training your ML ( Machine Learning ) models, but sadly neither the world feeds you clean data nor the huge amount of data is capable of fast processing using common libraries like Pandas etc. How about using the potential of big data libraries with support in Python to deal with this huge amount of data for deriving business insights using ML techniques? But how can we amalgamate the two? Here comes “ **PySpark : Combining Machine Learning & Big Data** “. Usually people in the ML domain prefer using Python; so combining the potential of Big Data technologies like Spark etc to supplement ML is a matter of ease with pyspark ( A Python package to use the Spark’s capabilities ).
Jobs with related skills
Senior Software Engineer für Datenprojekte
inovex GmbH
·
1 month ago
Munich, Germany
+7
Senior AI and Data Engineer (m/f/d)
CGI
·
10 days ago
Eschborn, Germany
+2
Hybrid
ANWENDUNGSENTWICKLER (M/W/D) MIT SCHWERPUNKT ERP
RENA Technologies GmbH
·
31 days ago
Gütenbach, Germany
Software Developer (m/w/d) – Web Services & Network
Pan Dacom Direkt GmbH
·
31 days ago
Dreieich, Germany
Related videos