Skip to content
View yangfly's full-sized avatar

Block or report yangfly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official repository for our paper "Doctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement Learning" published in ICRL 2026.

Python 32 1 Updated Apr 11, 2026

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,382 83 Updated May 16, 2025

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

Python 220 13 Updated Feb 28, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,415 371 Updated May 15, 2026
Python 54 2 Updated Sep 11, 2024

This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured data, assuming a real-world scenario. The sample aims to be …

Jupyter Notebook 63 15 Updated Apr 21, 2026

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 6,911 234 Updated May 16, 2026

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

Makefile 181 6 Updated Oct 27, 2023

NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Python 561 80 Updated Dec 4, 2025

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Python 270 19 Updated Sep 12, 2024

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,659 503 Updated Jul 18, 2024

Making large AI models cheaper, faster and more accessible

Python 41,382 4,512 Updated May 11, 2026

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 737 129 Updated May 2, 2026

Yet another elegant Wiz Note Client, which was built with Quasar UI Framework and based on Electron.

JavaScript 368 35 Updated Mar 5, 2023

C++ Parallel Computing and Asynchronous Networking Framework

C++ 14,350 2,563 Updated May 9, 2026

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,548 207 Updated Jul 18, 2025

Fast and Accurate ML in 3 Lines of Code

Python 10,341 1,150 Updated May 15, 2026

DezhouKV的C++版本实现(C++ implementation of DezhouKV database)

Roff 3 2 Updated Mar 21, 2018

Python code for "Probabilistic Machine learning" book by Kevin Murphy

Jupyter Notebook 7,073 1,625 Updated Feb 26, 2026

VS Code in the browser

TypeScript 77,593 6,677 Updated May 8, 2026

Caffe with contrib applications out of box.

Jupyter Notebook 1 1 Updated Aug 2, 2019

Code for 3rd Place Solution in Face Anti-spoofing Attack Detection Challenge @ CVPR2019,model only 0.35M!!! 1.88ms(CPU)

Python 953 280 Updated Oct 6, 2020

《Java 程序员眼中的 Linux》

Shell 8,724 2,445 Updated Jun 11, 2022

Visualizer for neural network, deep learning and machine learning models

JavaScript 32,898 3,118 Updated May 16, 2026

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

276,195 20,871 Updated Aug 22, 2025

This is a mxnet version implementation of SSR-Net for age and gender Estimation

Python 113 40 Updated Oct 11, 2018

Use TensorRT API to implement Caffe-SSD, SSD(channel pruning), Mobilenet-SSD

C++ 250 83 Updated Oct 23, 2018

A hyperparameter optimization framework

Python 14,174 1,326 Updated May 15, 2026

a casual work about retraining to optimize mtcnn Pnet and ONet. it can achieve 100+fps on CPU with minSize 60 (1920x1080) on intel i7 6700k

C++ 203 62 Updated Jul 20, 2018
Next