hanzi-pinyin
Here are 6 public repositories matching this topic...
A NLP package for Chinese text:Preprocessing, Tokenization, Chinese Fonts, Word Embeddings, Text Similarity and Sentiment Analysis 轻量级中文自然语言处理软件包
-
Updated
Nov 3, 2024 - Python
将汉字转为拼音。基于 luna_pinyin\pypinyin\clover-pinyin 数据。(共提供50W左右拼音数据)。基于 百度汉语数据(共抓取35W词组拼音数据) 。基于 jieba分词工具。
-
Updated
Oct 20, 2021 - Python
Python sort key methods for UTF-8 encoded Chinese character strings based on either Pinyin (pronunciation) or Bihua (strokes). 为UTF-8编码下的中文文字串提供类似英文字符串的排序功能:可分为以拼音排序和笔画数排序。一个包括4万多中文文字的数据文件为各种简体繁体的文本分析提供前所未有的支持。
-
Updated
Jul 26, 2021 - Python
Create a personal REST API to translate Hanzi Chinese characters to Pinyin
-
Updated
Dec 15, 2023 - Python
Improve this page
Add a description, image, and links to the hanzi-pinyin topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hanzi-pinyin topic, visit your repo's landing page and select "manage topics."