Skip to content
View WenBiming's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report WenBiming

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,000 4,248 Updated Dec 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,377 7,805 Updated Dec 23, 2025

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 4,481 651 Updated Aug 30, 2025

My solution to the book <A collection of Data Science Take-home Challenges>

Jupyter Notebook 985 528 Updated Oct 31, 2022

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,449 1,155 Updated Apr 30, 2025

2019腾讯广告算法大赛完整代码(冠军)

Python 637 212 Updated Jul 20, 2020

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 50,888 4,222 Updated Dec 23, 2025
Python 15 Updated Jul 13, 2025

GRID: Generative Recommendation with Semantic IDs

Python 507 90 Updated Oct 15, 2025

An implementation of a deep learning recommendation model (DLRM)

Python 4,006 870 Updated Oct 2, 2025

[WSDM'2024 Oral] "SSLRec: A Self-Supervised Learning Framework for Recommendation"

Python 552 76 Updated Mar 21, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,626 324 Updated Dec 19, 2025

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Python 1,234 155 Updated Dec 4, 2025

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

Python 1,336 217 Updated Jun 16, 2025
Python 278 40 Updated Aug 28, 2024

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Python 2,023 259 Updated Dec 4, 2025

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 662 92 Updated Sep 22, 2025

A list of awesome papers and resources of recommender system on large language model (LLM).

2,168 156 Updated Mar 17, 2025

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

Python 7,966 2,239 Updated Aug 9, 2024

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 12,547 1,225 Updated Nov 29, 2025

A self-hosted dashboard that puts all your feeds in one place

Go 30,540 1,139 Updated Dec 10, 2025

An alternative to the immich-CLI command that doesn't depend on nodejs installation. It tries its best for importing google photos takeout archives.

Go 4,982 170 Updated Dec 20, 2025

golang im server

Go 1,980 608 Updated Nov 3, 2025

MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.

Go 59,386 6,811 Updated Dec 3, 2025

Proxy UDP traffic over a TCP stream

Rust 518 84 Updated Dec 13, 2025

A Counter-Strike 2 Demo Parser for Go (demoinfo)

Go 901 122 Updated Dec 18, 2025

Fast, secure, efficient backup program

Go 31,355 1,683 Updated Dec 3, 2025

Open Source Continuous File Synchronization

Go 78,452 4,861 Updated Dec 23, 2025

A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.

Rust 55,073 2,310 Updated Dec 22, 2025
Next