Skip to content
View kaynezhang's full-sized avatar

Block or report kaynezhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LangChain 的中文入门教程

8,908 705 Updated Apr 19, 2025

MindSpore online courses: Step into LLM

Jupyter Notebook 482 127 Updated Dec 22, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 158,869 32,752 Updated Apr 6, 2026

Fess is very powerful and easily deployable Enterprise Search Server.

Java 1,104 172 Updated Apr 5, 2026

ACHE is a web crawler for domain-specific search.

Java 480 137 Updated Aug 31, 2025
Java 2 1 Updated Apr 17, 2017

专注于解决自然语言处理领域的几个核心问题:词法分析,句法分析,语义分析,语种检测,信息抽取,文本聚类和文本分类. 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了多种自然语言处理算法,适配了多个自然语言处理框架. 兼容Lucene/Solr/ElasticSearch插件.

Java 120 29 Updated Apr 12, 2023

thulac analysis plugin for elasticsearch

Java 192 27 Updated Sep 18, 2020

BosonNLP Analysis for ElasticSearch

Java 105 23 Updated Apr 17, 2017

Elasticsearch with T5/Bert/Other models provided by huggingface Transfomers.

Python 14 3 Updated Jun 12, 2023

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 10,194 1,391 Updated Jul 15, 2025

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Python 6,702 985 Updated Nov 5, 2022

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 79,797 15,156 Updated May 10, 2024

100+ Chinese Word Vectors 上百种预训练中文词向量

Python 12,197 2,325 Updated Oct 30, 2023

中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。

4,278 1,010 Updated Nov 9, 2025

It's an image similarity search Engine built on top of Lire. The images can be filtered using a query by keywords [support Chinese]and are afterwards optically ranked. This engine provides an easy …

Java 9 3 Updated Oct 17, 2023
Java 3 4 Updated Jun 4, 2021

ChainSQL: the collaboration of blockchain and database

C++ 213 75 Updated Jan 12, 2023

WebViewer UI built in React

JavaScript 468 389 Updated Apr 2, 2026

Small python-gtk application, which helps the user to merge or split PDF documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface.

Python 5,322 291 Updated Apr 5, 2026

PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages

Java 4,333 393 Updated Apr 6, 2026

公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。

1,292 369 Updated Mar 27, 2024

An open source engine for license management on the Java Virtual Machine.

Java 391 78 Updated Nov 16, 2022

A reverse image search engine powered by elastic search and tensorflow

Python 328 51 Updated Apr 3, 2021

Simple image search engine

Python 781 242 Updated Nov 14, 2021

🎇 Quickly search over billions of images

Python 2,982 404 Updated Dec 6, 2022

Face search engine

Python 200 42 Updated Sep 16, 2016

Convert Word documents to simple and clean HTML

Java 291 58 Updated Mar 13, 2026

Instant Message

Java 3 2 Updated Jun 23, 2021

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

Java 6,542 2,288 Updated Nov 19, 2023
Next