This project is now deprecated. If you would like to complete it, feel free to send PR's.
General natural language processing utilities for Dart. It provides a simple API for getting started with natural language processing (NLP), Artificial Intelligence (AI) and Natural Language Generation (NLG) tasks.
This package will initially support English. In future, it may support other major languages like Spanish, Russian, Chinese (maybe).
- Tokenization
- String distance
- n-grams
- Markov chain
- Classifiers
- Phonetics
- Language identification
- Summarization
- Part-of-speech tagging (POS)
- Sentiment Analysis
- TF-IDF
- Words Inflection and Lemmatization
SpaceTokenizer tokenizer = new SpaceTokenizer();
tokenizer.tokenize('brown fox jumps');
===> ['brown', 'fox', 'jumps']
tokenizer.tokenize('Stand on your head!');
===> ['Stand', '', '', 'on', '', '', 'your', '', '', 'head!']
TabTokenizer tokenizer = new TabTokenizer();
tokenizer.tokenize('brown\tfox\tjumps');
===> ['brown', 'fox', 'jumps']
http://blog.codinghorror.com/markov-and-you/
$ chmod u+x tool/run_tests.sh
$ ./tool/run_tests.sh