- Los Angeles, California, United States
- https://www.linkedin.com/in/jiajieyan/
-
Machine-Learning-Plate Public
Various machine learning algorithm implementation tastes made of Python and Numpy. Enjoy!
-
Jiayan Public
甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon construction, tokenizing, POS tagging, sentence segmentation a…
-
Classical-Chinese Public
Forked from BangBOOM/Classical-Chinese古文现代文翻译平行语料库
-
-
yugioh-card-data Public
A Yu-Gi-Oh card collector based on YGOPRO ADS game data.
-
chinese-rhymer Public
轻量中文押韵神器,100%绝对可用,傻瓜式命令行操作,秒速实现烈焰单押,闪电双押,龙卷三押以及海啸式四押,目前版本 v0.2.6。Search for rhymes for Chinese words, with 1, 2, 3 and 4 characters, released on PyPI with current version of 0.2.6.
-
Java implementation of various data structures and algorithms.
Java GNU General Public License v3.0 UpdatedMay 25, 2018 -
RelationExtraction Public
Implementation of relation extraction between entities in texts, feature engineering with Maximum Entropy template, provided by Mallet.
-
nlp-datasets Public
Forked from niderhoff/nlp-datasetsAlphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
UpdatedMar 18, 2018 -
machine_learning_basics Public
Forked from zotroneneis/machine_learning_basicsPlain python implementations of basic machine learning algorithms
-
-
LeetCode-Cracking Public
Fun with LeetCode puzzles in Java.
Java Apache License 2.0 UpdatedFeb 24, 2018 -
Machine-Learning-Tutorials Public
Forked from ujjwalkarn/Machine-Learning-Tutorialsmachine learning and deep learning tutorials, articles and other resources
-
-
Convolutional-Neural-Network Public
A CNN model performs discourse sense relation between 2 sentences, built with TensorFlow.
-
Operating-Systems Public
Implementation of multi-threading, synchronization and various operating system theories in Java.
-
Information-Retrieval Public
A class to perform various information retrieval techniques. TF-IDF algorithm includes keywords extraction, documents searching by keyword list, similar documents searching and text summarization.
Python Apache License 2.0 UpdatedDec 3, 2017 -
Minimum-Edit-Distance Public
A Python implementation of minimum edit distance, reports minimum cost and use backpointers to alignment between source and target strings.
Python Apache License 2.0 UpdatedNov 22, 2017 -
Hidden-Markov-Model Public
An implementation of HMM with Numpy matrices, Viterbi, Forward, Backward, EM algorithms and Baum-Welch (forward-backward) algorithm involved.
-
Maximum-Entropy-Model Public
An implementation of MaxEnt, with Mini-batch Stochastic Gradient approach to get the optimized parameters.
-
Naive-Bayes-Model Public
A classic naive bayes classifier with add-0.01 smoothing, both Multinomial and Bernoulli are available.
Python GNU General Public License v3.0 UpdatedNov 1, 2017 -
Fun implementations of advanced programming with OOD.
Java MIT License UpdatedOct 30, 2017 -
Plagiarization-Sensor Public
A detector to find all longest repeated language segments among large volumes of texts, multiprocessing is applied.
-
chinese-poetry Public
Forked from chinese-poetry/chinese-poetry最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
-
Courseworks for 114 spring Foundamentals of Computational Linguistics
Python MIT License UpdatedFeb 28, 2017 -
Dig out admission information about American CS master programs to form final query responses.
-
Callibresubmissions Public
Forked from morpheus384/CallibresubmissionsCallibre Submissions
UpdatedApr 23, 2012