Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Stars
A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
The code for 2020 Tencent College Algorithm Contest, and the online result ranks 1st.
DocAgent is a system designed to generate high-quality, context-aware code documentation for Python codebases using a multi-agent approach and hierarchical processing.
verl: Volcano Engine Reinforcement Learning for LLMs
[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
[NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation
Fully open reproduction of DeepSeek-R1
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
100+套大数据可视化炫酷大屏Html5模板;包含行业:社区、物业、政务、交通、金融银行等,全网最新、最多,最全、最酷、最炫大数据可视化模板。陆续更新中
GRID: Generative Recommendation with Semantic IDs
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
该仓库尝试整理推荐系统领域的一些经典算法模型
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )
A Lighting Pytorch Framework for Recommendation Models (PyTorch推荐算法框架), Easy-to-use and Easy-to-extend. https://datawhalechina.github.io/torch-rechub/
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
(原创)全网最全-币圈区块链各类常用工具与相关信息资料大全-虚拟加密货币-欧易OKX币安Binace芝麻开门Gate-交易所App注册-NFT-Defi-加密钱包-比特币-新手入门教程 -持续更新
A live reading list for LLM data synthesis (Updated to July, 2025).
Our code for ICLR'25 paper "DataMan: Data Manager for Pre-training Large Language Models".