Stars
Textbook on reinforcement learning from human feedback
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
[AAAI-2026] Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning
Multimodal Whole Slide Foundation Model for Pathology - Nature Medicine
A Curated List of Awesome Works in Computational Pathology, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in Digital Pathology.
Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions
LLM Wiki is a cross-platform desktop application that turns your documents into an organized, interlinked knowledge base — automatically. Instead of traditional RAG (retrieve-and-answer from scratc…
Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels
Building on Anthropic's Circuit Tracer, Neuronpedia, Ameisen et al. (2025) and Lindsey et al. (2025), we attempt to extend the paradigm with adaptive context engineering to enable recursive self-in…
Code for "Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models".
Lightweight coding agent that runs in your terminal
Minimal Implementation of a D3PM in pytorch
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
The official github repo for "Diffusion Language Models are Super Data Learners".
ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman
😼 优雅地使用基于 clash/mihomo 的代理环境
Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)