Master Databricks and Apache Spark Step by Step: Lesson 21 - PySpark Using RDDs
Автор: Bryan Cafferky
Загружено: 2021-06-22
Просмотров: 12692
In this video, we use PySpark to analyze data with Resilient Distributed Datasets (RDD). RDDs are the foundation of Spark. You learn what RDDs are, what Lazy Evaluation is and why it matters, and how to use Transformations and Actions. Everything is demonstrated using a Databricks notebook.
Video slides and code at:
https://github.com/bcafferky/shared/b...
Apache Spark Transformations Docs
https://spark.apache.org/docs/latest/...
Apache Spark Actions Docs
https://spark.apache.org/docs/latest/...
Apache Spark RDD
https://spark.apache.org/docs/latest/...
For information on how to upload files to Databricks see:
• Видео
Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: