Skip to content
forked from MSeal/kdd2016

Notebook Code following NLP talk at KDD 2016

License

Notifications You must be signed in to change notification settings

buggooo/kdd2016

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository is a dump of many of the code snippets used in the Big Data NLP talk at KDD in 2016. Feel free to reach out to us with questions, improvements, or suggestions.

Here's a list of modules we'll be using in ipython notebooks. This will function with any Operating System.

Install Python 2.7 and some module dependencies 
    pip install ipython nltk networkx zss datasketch agglomcluster
    # This is slow, the wordnet dependency is large consider downloading after talk
    python -e "import nltk; nltk.download(punkt); nltk.download(wordnet)"

Stanford parser (bash commands -- very large, consider downloading after talk):
    wget http://nlp.stanford.edu/software/stanford-corenlp-full-2015-12-09.zip
    unzip stanford-corenlp-full-2015-12-09.zip

    git clone https://github.com/brendano/stanford_corenlp_pywrapper
    cd stanford_corenlp_pywrapper
    pip install .
    cd ..

Code using these modules:
https://github.com/MSeal/kdd2016

About

Notebook Code following NLP talk at KDD 2016

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%