Spark is an open-source platform for processing extremely large data. Pyspark is a python API designed to facilitate use of Spark in Python.
To get familiar with Spark and getting started from Python, you can use the Introduction to Pyspark on DataCamp!