Skip to content
View shenfe's full-sized avatar
🌕
I may be slow to respond.
🌕
I may be slow to respond.
  • ByteDance
  • Beijing

Block or report shenfe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,526 405 Updated Jul 16, 2023

Official implementation of Meta Prompting for AI Systems (https://arxiv.org/abs/2311.11482)

Python 301 35 Updated Dec 23, 2025

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

Python 380 19 Updated Aug 25, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,985 623 Updated May 3, 2024

Awesome papers about unifying LLMs and KGs

2,599 179 Updated May 2, 2025

Resources of deep learning for mathematical reasoning (DL4MATH).

372 29 Updated Dec 22, 2023

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,211 289 Updated May 23, 2026

Safety Score for Pre-Trained Language Models

Python 97 5 Updated Oct 18, 2023

DSPy: The framework for programming—not prompting—language models

Python 35,018 2,976 Updated Jun 11, 2026

Industry leading face manipulation platform

Python 28,788 4,692 Updated Jun 14, 2026

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,687 86 Updated Mar 8, 2024

Task-based datasets, preprocessing, and evaluation for sequence models.

Python 594 59 Updated Jun 12, 2026

Multipack distributed sampler for fast padding-free training of LLMs

Python 207 17 Updated Aug 10, 2024

Model API for GALACTICA

Jupyter Notebook 2,737 264 Updated Mar 5, 2023
Jupyter Notebook 44 6 Updated Nov 17, 2024

[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization

Python 29 4 Updated Sep 12, 2024

Pipeline for pulling and processing online language model pretraining data from the web

Python 179 22 Updated Jul 31, 2023

Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience

JavaScript 61,567 6,703 Updated Jun 13, 2026

🔥Highlighting the top ML papers every week.

12,557 795 Updated Jun 8, 2026

MTEB: Massive Text Embedding Benchmark

Python 3,302 625 Updated Jun 14, 2026

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 16,458 1,301 Updated Jan 18, 2025

Awesome Pruning. ✅ Curated Resources for Neural Network Pruning.

176 14 Updated May 8, 2026

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,395 240 Updated May 11, 2026

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,946 1,852 Updated Apr 19, 2026

Repository for Decomposed Prompting

Python 99 10 Updated Nov 15, 2023

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features…

Python 3,198 446 Updated Jun 2, 2026

Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.

Jupyter Notebook 692 51 Updated Oct 21, 2025

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,604 133 Updated Nov 24, 2025

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,773 144 Updated Aug 4, 2024
Next