-
GSAI@RUC
- Beijing
- https://kid-22.github.io/
Highlights
- Pro
Lists (9)
Sort Name ascending (A-Z)
Stars
Paper list about hyperbolic embedding, hyperbolic models,hyperbolic applications
Large language models for document ranking.
This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.
[ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
Paper list for Efficient Reasoning.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Official Repo for Open-Reasoner-Zero
Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓
A Comprehensive Survey on Long Context Language Modeling
Toolkit for implementing fairness- and diversity-aware algorithms in Information Retrieval
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.
A curated (most recent) list of resources for Learning with Noisy Labels
TorchTune recipes for ranking using RM: ORPO recipe (single GPU + DDP) + DDP for DPO (to avoid existing bug in FSDP) + ranking evals