Accompanying code for the video.
Please note, that this is an algorithm tutorial, not production-ready code. If you'd like to compress text, use something like Brotli :)
Copy the archive to ./data folder and unpack with unzip archive.zip. Feel free to remove the archive.
The resulting dataset should be ~892MB.