Note that there are separate sets of assignments for CS 451/651 and CS 431/631. Make sure you work on the correct asssignments!
This assignment requires you to compute statistics over tokens in the text of Shakespeare's plays, as you did for Assignment 1. However, instead of performing the analysis purely in Python, you will use Spark (with a Python driver program).
For this assignment, you should continue to use the
Waterloo CS Jupyter hub. Use
your password for the
You will need to upload the following files to your working folder on the hub:
When you have finished, submit A2 using the same procedure you used for A1:
That's it! Note that your A2.ipynb should be committed and pushed to origin before the deadline.