Skip to content
View chenchongthu's full-sized avatar

Block or report chenchongthu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ValueCell is a community-driven, multi-agent platform for financial applications.

Python 10,800 1,808 Updated Mar 9, 2026

Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"

Python 590 44 Updated Jun 10, 2026

⚡ Open nof1.ai | Autonomous AI Trading Agent (自主AI交易系统)

TypeScript 670 199 Updated May 15, 2026

A opensource AI trading platform in real market,

TypeScript 803 192 Updated Apr 1, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 19,383 1,484 Updated Feb 27, 2026

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 4,867 548 Updated Jun 10, 2026

A Survey of Multimodal Retrieval-Augmented Generation

20 2 Updated Nov 3, 2025

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,933 550 Updated Mar 31, 2026

T2Ranking: A large-scale Chinese benchmark for passage ranking.

Python 163 9 Updated Jul 3, 2023

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,797 250 Updated Dec 12, 2023

预训练中英文混合bert模型

Python 1 Updated Feb 6, 2023

:paper: 作文数据集 - 第 1 部分

13 5 Updated Apr 9, 2020

colbert for dense retrieval, including multi view version, dureader-retrieval as an example

Python 6 Updated Jun 16, 2022

An Open-Source Package for Information Retrieval

Python 167 20 Updated May 25, 2026

A Semantic Search Engine Built on Arxiv dataset from Kaggle.

Jupyter Notebook 7 2 Updated May 7, 2021

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing…

MDX 25,559 2,847 Updated Jun 12, 2026

Codebase for RetroMAE and beyond.

Python 274 24 Updated Jun 7, 2024

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 779 68 Updated Apr 7, 2023

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,128 377 Updated Jun 8, 2026

Awesome Machine Unlearning (A Survey of Machine Unlearning)

Jupyter Notebook 955 74 Updated May 7, 2026

Source code and dataset for ACL2022 Findings Paper "LEVEN: A Large-Scale Chinese Legal Event Detection dataset"

Python 123 26 Updated Aug 4, 2023

bert-pli应用于LeCaRD

Python 18 4 Updated Nov 14, 2021
Jupyter Notebook 4 1 Updated Jun 7, 2022

Source code and checkpoints for legal pre-trained language models.

Python 194 25 Updated May 9, 2021

A python package that takes tables from a web page and processes them to get high quality tables

Python 45 2 Updated Aug 30, 2022
Python 379 51 Updated Oct 9, 2023

A Chinese legal case retrieval dataset.

Python 166 24 Updated Jan 2, 2024

KDD'2022: Towards Representation Alignment and Uniformity in Collaborative Filtering

Python 71 6 Updated Oct 27, 2022

Must-read papers on prompt-based tuning for pre-trained language models.

4,315 391 Updated Jul 17, 2023

Towards Explainable Artificial Intelligence

5 Updated Jun 20, 2020
Next