Stars
Making large AI models cheaper, faster and more accessible
Chinese Pre-Trained Language Models (CPM-LM) Version-I
[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.
Headless chrome/chromium automation library (unofficial port of puppeteer)
Play couplet with seq2seq model. 用深度学习对对联。
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
开源微信爬虫:爬取公众号所有 文章、阅读量、点赞量和评论内容。易部署。持续维护!!!
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
各种nlp 框架(自然语言处理)集成以及使用包括 word2vec nltk textblob crf++ 等