Skip to content
View mingyuan's full-sized avatar

Block or report mingyuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

qData is an open-source data governance and data development platform that integrates ETL, data development, metadata management, data quality, data assets, API services, and AI-powered data Q&A.

PLpgSQL 472 110 Updated Jun 12, 2026

今日头条中文新闻(文本)分类数据集

Python 407 69 Updated May 19, 2021

百度NLP:分词,词性标注,命名实体识别,词重要性

C++ 4,000 590 Updated May 25, 2021

A high-performance MySQL proxy

Go 6,411 1,219 Updated Jun 5, 2026

Language Detection Library for Java

Java 586 162 Updated Jul 23, 2022

A curated list of the most important and useful resources about elasticsearch: articles, videos, blogs, tips and tricks, use cases. All about Elasticsearch!

5,049 565 Updated May 7, 2025

FirePath is a Firebug extension that adds a development tool to edit, inspect and generate XPath expressions and CSS3 Selectors.

JavaScript 35 14 Updated Feb 23, 2016
JavaScript 5,616 735 Updated Feb 12, 2024

Advanced manager and monitor for Apache Tomcat, forked from Lambda Probe

Java 1,331 384 Updated Jun 12, 2026

Java fake data generator

Java 742 151 Updated May 30, 2026
Java 1 5 Updated Jul 13, 2013

Generate Random User-agent strings in java

Java 68 42 Updated Mar 16, 2022

Utilities for processing user-agent strings. Can be used to handle http requests in real-time or to analyze log files.

Java 916 401 Updated Mar 10, 2023

A cross-language remote procedure call(RPC) framework for rapid development of high performance distributed services.

Java 5,878 1,749 Updated Nov 24, 2025

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 36,397 10,916 Updated Nov 15, 2025

2016阿里巴巴面试题目

714 323 Updated Mar 16, 2016

Powerful Mindmap Editing Tool

JavaScript 3,181 914 Updated Sep 12, 2023

web admin interface for elasticsearch

JavaScript 2,390 324 Updated Nov 21, 2019

Java JsonPath implementation

Java 9,413 1,712 Updated Feb 22, 2026

🚌 The IK Analysis plugin integrates Lucene IK analyzer into Elasticsearch and OpenSearch, support customized dictionary.

Java 17,463 3,280 Updated May 11, 2026

A simple expressive web framework for java. Spark has a kotlin DSL https://github.com/perwendel/spark-kotlin

Java 9,658 1,567 Updated Oct 8, 2023

Open Source Web Crawler for Java

Java 4,624 1,905 Updated Nov 4, 2021

搜狐视频(sohu tv)Redis私有云平台 :支持Redis多种架构(Standalone、Sentinel、Cluster)高效管理、有效降低大规模redis运维成本,提升资源管控能力和利用率。平台提供快速搭建/迁移,运维管理,弹性伸缩,统计监控,客户端整合接入等功能。(CacheCloud is a Redis cloud management platform. It suppor…

HTML 8,852 2,023 Updated Jun 8, 2026

A scalable web crawler framework for Java.

Java 11,679 4,128 Updated Dec 20, 2025

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

Java 6,529 2,278 Updated Nov 19, 2023

A dashboard for zookeeper and Qconf

JavaScript 680 198 Updated Mar 25, 2021