Skip to content
View chongyangtao's full-sized avatar

Block or report chongyangtao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

Python 22,686 2,189 Updated Feb 2, 2026

Kode CLI — Design for post-human workflows. One unit agent for every human & computer task.

TypeScript 4,353 657 Updated Jan 23, 2026

Train transformer language models with reinforcement learning.

Python 17,388 2,495 Updated Feb 17, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,674 1,204 Updated Feb 17, 2026

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 727 123 Updated Jan 26, 2026

Crawl all your citations from Google Scholar

Python 58 11 Updated Aug 7, 2017

Ranking Google Scholar search results based on the number of citations

Python 980 184 Updated Apr 22, 2025

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

3,226 217 Updated Feb 1, 2026

🏡 GitHub Pages template for personal academic homepage

HTML 1 Updated Jul 12, 2024

ClinicRealm: Re-evaluating Large Language Models with Conventional Machine Learning for Non-Generative Clinical Prediction Tasks

Python 11 2 Updated Dec 12, 2025

Autonomous agents for everyone

TypeScript 17,524 5,415 Updated Feb 17, 2026

Some tricks of pytorch... ⭐

1,194 125 Updated Jun 20, 2024

This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners.

Python 1,316 78 Updated Feb 3, 2026

The source code of CodeS (SIGMOD 2024).

Python 195 24 Updated Nov 20, 2024

Chinese version of the Stanford's modern information retrieval slides

24 7 Updated May 21, 2022

An official implementation of Pangu-Weather

Python 1,312 233 Updated Jan 12, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,251 70 Updated Mar 9, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 70,104 8,396 Updated Jan 25, 2026

Awesome LLM for NLG Evaluation Papers

25 1 Updated Jan 23, 2024

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Python 5,864 473 Updated Sep 26, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,251 1,947 Updated Nov 19, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,090 122 Updated Jun 1, 2023

[ACL 2023] Reasoning with Language Model Prompting: A Survey

996 68 Updated May 21, 2025

Resource, Evaluation and Detection Papers for ChatGPT

456 25 Updated Mar 21, 2024

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,540 147 Updated Aug 11, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,406 4,778 Updated Jun 2, 2025

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

766 24 Updated Jul 20, 2023

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,468 433 Updated Sep 13, 2024

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,785 135 Updated Mar 13, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,082 523 Updated Jul 1, 2025
Next