Skip to content
View xlianghang's full-sized avatar

Block or report xlianghang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

NLP

9 repositories

TensorFlow code and pre-trained models for BERT

Python 39,752 9,709 Updated Jul 23, 2024

结巴中文分词

Python 34,657 6,733 Updated Aug 21, 2024

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,920 423 Updated Nov 30, 2025

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Python 67 7 Updated Jul 14, 2025

“法阿”中文分词:做最好的 Python 法律中文分词组件

Python 34 10 Updated Dec 15, 2020

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 32,971 4,633 Updated Nov 27, 2025

💫 Models for the spaCy Natural Language Processing (NLP) library

Python 1,823 312 Updated May 27, 2025

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 35,999 10,882 Updated Nov 15, 2025

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,755 622 Updated Feb 21, 2025