Skip to content
View boluoyu's full-sized avatar

Block or report boluoyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

One Net Client — A cross-platform desktop client for databases, SSH/SFTP, terminals & AI, all in one place.

Rust 443 47 Updated Jun 16, 2026

Physical Phone Experiments (in-app experiment collection)

62 25 Updated Jun 2, 2026

Physical Phone Experiments

C 611 79 Updated Jun 15, 2026

This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".

Python 18 2 Updated Nov 18, 2025

Algorithm powering the For You feed on X

Rust 26,186 4,502 Updated May 15, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,455 340 Updated Jan 14, 2026

[KDD2026] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"

Python 231 22 Updated Feb 9, 2026

Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense f…

Python 377 32 Updated Dec 12, 2024

Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个 集…

Python 993 93 Updated Jun 15, 2026

[ICLR 2026] LightMem: Lightweight and Efficient Memory-Augmented Generation

Python 931 88 Updated Jun 16, 2026

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,959 208 Updated Jun 6, 2026

DataComp for Language Models

HTML 1,447 133 Updated Sep 9, 2025

😊 TPTT: Transforming Pretrained Transformers into Titans

Python 64 2 Updated Jun 7, 2026

Minimal reproduction of OneRec

Python 1,647 235 Updated May 14, 2026

Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629

Python 23 2 Updated Oct 14, 2025

[ICLR 2026] The official implementation of the paper “Anchored Supervised Fine-Tuning”

Jupyter Notebook 44 4 Updated Jun 16, 2026

A Reproduction of GDM's Nested Learning Paper

Python 697 101 Updated Feb 25, 2026

[ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Models

Python 78 9 Updated Nov 23, 2024

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (https://arxiv.org/abs/2501.06425)

Python 458 38 Updated Jun 15, 2026

计划的核心——大语言模型

Python 7 3 Updated May 13, 2026

🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h!

Python 51,848 6,665 Updated Jun 1, 2026

MiniRBT (中文小型预训练模型系列)

Python 305 20 Updated Jul 15, 2025

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 592 67 Updated Jul 11, 2024

Merge of LLMs and TRMs

TeX 1 Updated Feb 4, 2026
Jupyter Notebook 1 Updated Apr 27, 2026

Text-to-Video generation model using a Hierarchical Reasoning Model (HRM) optimized for T4 GPUs.

Python 7 1 Updated Apr 9, 2026

Hierarchical Reasoning Model Official Release

Python 12,547 1,829 Updated Mar 31, 2026
Next