Skip to content
View wanli0815's full-sized avatar

Block or report wanli0815

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for "Learning to summarize from human feedback"

Python 1,051 153 Updated Sep 5, 2023

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,374 171 Updated Jul 25, 2023

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 67,095 7,143 Updated Nov 5, 2025

🦜🔗 The platform for reliable agents.

Python 118,909 19,582 Updated Nov 5, 2025

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 54,997 7,334 Updated May 14, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 10,693 1,092 Updated Apr 30, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,722 483 Updated Jan 8, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,439 1,285 Updated Oct 6, 2025

The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

7,955 479 Updated Sep 12, 2025

A technical report on convolution arithmetic in the context of deep learning

TeX 14,534 2,306 Updated Jun 8, 2023

Utilities intended for use with Llama models.

Python 7,325 1,262 Updated Oct 10, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,795 7,473 Updated Nov 5, 2025

Examples and guides for using the OpenAI API

Jupyter Notebook 69,032 11,539 Updated Nov 4, 2025

The official Python library for the OpenAI API

Python 29,171 4,396 Updated Nov 4, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,962 1,256 Updated Oct 27, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,823 135 Updated Jan 17, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 12,914 2,146 Updated Sep 6, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 12,690 1,204 Updated Oct 28, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,973 565 Updated Feb 26, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,369 6,130 Updated Sep 18, 2024

This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube

Python 70 25 Updated Apr 12, 2017

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,408 519 Updated Oct 8, 2025

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Python 936 114 Updated Oct 6, 2022

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Python 950 162 Updated Apr 26, 2024

Code for ALBEF: a new vision-language pre-training method

Python 1,728 220 Updated Sep 20, 2022

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,648 1,640 Updated Sep 30, 2025
Next