boluoyu

Follow

boluoyu

Follow

4 followers · 4 following

Achievements

Achievements

Stars

JiangHaoPG11 / LGSID

This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".

Python 17 2 Updated Nov 18, 2025

xai-org / x-algorithm

Algorithm powering the For You feed on X

Rust 16,328 2,820 Updated Jan 20, 2026

deepseek-ai / Engram

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,287 315 Updated Jan 14, 2026

selous123 / al_sid

[Pytorch] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"

Python 207 20 Updated Feb 9, 2026

facebookresearch / memory

Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense f…

Python 376 32 Updated Dec 12, 2024

IAAR-Shanghai / Awesome-AI-Memory

Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个集…

Python 750 62 Updated Apr 18, 2026

zjunlp / LightMem

[ICLR 2026] LightMem: Lightweight and Efficient Memory-Augmented Generation

Python 761 68 Updated Apr 3, 2026

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,949 204 Updated Feb 9, 2026

mlfoundations / dclm

DataComp for Language Models

HTML 1,435 131 Updated Sep 9, 2025

fabienfrfr / tptt

😊 TPTT: Transforming Pretrained Transformers into Titans

Python 62 Updated Nov 24, 2025

AkaliKong / MiniOneRec

Minimal reproduction of OneRec

Python 1,457 205 Updated Mar 31, 2026

Lauorie / DFT

Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629

Python 23 2 Updated Oct 14, 2025

zhuchichi56 / ASFT

[ICLR 2026] The official implementation of the paper “Anchored Supervised Fine-Tuning”

Jupyter Notebook 37 4 Updated Feb 12, 2026

DA-southampton / RedGPT

70 17 Updated Apr 14, 2023

kmccleary3301 / nested_learning

A Reproduction of GDM's Nested Learning Paper

Python 674 97 Updated Feb 25, 2026

aakaran / reasoning-with-sampling

Python 418 56 Updated Nov 7, 2025

thu-coai / MiniPLM

[ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Models

Python 75 9 Updated Nov 23, 2024

tensorgi / TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 448 37 Updated Jan 26, 2026

SwarmClone / MiniLM2

计划的核心——大语言模型

Python 7 3 Updated Feb 17, 2026

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT！🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 47,239 5,889 Updated Apr 10, 2026

iflytek / MiniRBT

MiniRBT (中文小型预训练模型系列)

Python 302 19 Updated Jul 15, 2025

charent / Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 589 67 Updated Jul 11, 2024

anonx3247 / llm-trm

Merge of LLMs and TRMs

TeX 1 Updated Feb 4, 2026

rkomu / WeatherForcastLLMUsingERA5_HRM_Llama

Jupyter Notebook 1 Updated Nov 2, 2025

codewithdark-git / TTV-HRM

Text-to-Video generation model using a Hierarchical Reasoning Model (HRM) optimized for T4 GPUs.

Python 6 1 Updated Apr 9, 2026

SamsungSAILMontreal / TinyRecursiveModels

Python 6,455 1,005 Updated Apr 1, 2026

sapientinc / HRM

Hierarchical Reasoning Model Official Release

Python 12,383 1,801 Updated Mar 31, 2026

sionic-ai / muvera-py

Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)

Python 411 26 Updated Dec 10, 2025

Linear95 / bert-intent-slot-detector

BERT-based intent and slots detector for chatbots.

Python 239 32 Updated Feb 21, 2025

srush / llama2.rs

A fast llama2 decoder in pure Rust.

Rust 1,062 57 Updated Nov 30, 2023