DataChef Logo
Distributed Computing

Distributed Computing

Weeks
Week 12
Description
Introduction to Spark
Test Knowledge
Train a model on a free databricks cluster
Spark is an open-source platform for processing extremely large data. Pyspark is a python API designed to facilitate use of Spark in Python.
To get familiar with Spark and getting started from Python, you can use the Introduction to Pyspark on DataCamp!
 
×