Starred repositories
Towards Unifying Sequence Modeling and Feature Interaction for Large-scale Recommendation
Release-Time Verification and Repair for Reliable Recommendation with Large Language Models
A Multi-Agent Recommendation & Optimization Framework for Large-Scale HR Semantic Modeling — Qwen3 family + bidirectional matching + multi-agent coordination. Undergraduate thesis (2026).
This is the implementation of the paper "LLM-ISR: Interest-based Sequential Recommendation via Large Language Models for Long-Tail Users".
Generative recommendation based on semantic IDs and large models
Reciprocal Collaborative and Semantic Fusion with Calibration via Large Language Models for Recommendation Reranking
An index for papers on large language model agents for recommendation and search.
NeurIPS 2025 | P-Law: entropy-guided quantitative scaling law prediction for large recommendation models.
How Stable, Consistent, and Intention-Sensitive Are Life-Advice Recommendations Produced by Modern Large Language Models?
An easy to set up and use "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model" using Docker image on GPU as well as CPU.
Simulator and scenario evaluation notebooks for "Diagnosing ML-Driven Queue Systems in Production" — ICDM 2026 Applied Track
YaFF is a high-performance C++ serialization library that provides a zero-copy wire format for the Protobuf ecosystem.
A PyTorch framework for training transformer language models with Mixture of Experts (MoE) architecture support, Mixture of Depths (MoD), and DeepSpeed integration. Implements models from 70M to 30…
RecStore: High-performance parameter storage for large-scale recommendation models, unifying heterogeneous memory as a scalable embedding pool.
Full fine-tuning project for Qwen3-VL-2B with HF parquet-to-JSON conversion and post-finetune evaluation
Experimental GPT-2 scale (~124M param) LLM trained from scratch. Trained on 22B tokens od Cosmopedia Dataset. Includes full training pipeline, with SFT FineTuning and log analysis tools with backen…
TOML-driven diffusion training on Linux: DeepSpeed, LoRA/LoKr/full finetune (SDXL, Cosmos Predict2), optional web UI — rengu CLI
An easy-to-configure and extensible veRL extension for agent RL training with skill co-evolution.
DPP + Slide Window + GPU. DDP多样性推荐算法,滑窗+GPU加速。
Winner 🏆 (Agent-only) MLSys 2026 - FlashInfer AI Kernel Generation Contest for the DeepSeek Sparse Attention (DSA) track with an average speedup of 34.93x
This Repository includes recent papers (RecSys, SIGIR, WWW, etc.) related to the Recommender Systems