Skip to content
View XD111ds's full-sized avatar

Block or report XD111ds

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference…

368 10 Updated Mar 23, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 158,463 32,628 Updated Mar 27, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,145 244 Updated Mar 10, 2026

slime is an LLM post-training framework for RL Scaling.

Python 4,991 668 Updated Mar 26, 2026
Python 143 4 Updated Feb 13, 2026

The CS61A course of UC Berkeley. A python version of SICP.

Python 4 Updated Jan 23, 2025

A basic framework for testing everything in a maching learning model.

Python 2 Updated Mar 4, 2025

Official implementation of FedSub: An efficient subspace algorithm for federated learning on heterogeneous data.

Python 2 Updated Dec 11, 2025

Lightweight data loader with zero-padding sentence packing for LLM training.

Python 6 Updated Nov 11, 2025

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 537 240 Updated Mar 27, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,775 362 Updated Mar 26, 2026

[CVM'26]BoCoR-Seg: Bidirectional Co-Refinement Framework for High-Resolution Remote Sensing Image Segmentation

Python 8 Updated Feb 27, 2026

The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".

Python 22 1 Updated Sep 1, 2022
Jupyter Notebook 23 3 Updated Aug 1, 2024

The source code of Paper "PathQG: Neural Question Generation from Facts".

Python 23 3 Updated Jan 4, 2021

The code of Paper "Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text".

Python 48 9 Updated Mar 2, 2023

Implementation of "Interleaved Latent Visual Reasoning with Selective Perceptual Modeling".

Python 45 3 Updated Jan 21, 2026

A paper list of Awesome Latent Space.

407 19 Updated Mar 26, 2026

[NeurIPS 2025 Spotlight] SparseMVC: Probing Cross-view Sparsity Variations for Multi-view Clustering [Pytorch repository]

Python 42 2 Updated Jan 7, 2026

[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Python 256 18 Updated Aug 2, 2025

https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT

Python 129 7 Updated Jan 30, 2026

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

77,402 8,949 Updated Feb 5, 2026

A cross-platform bilibili toolbox. 跨平台哔哩哔哩工具箱,支持下载视频、番剧等等各类资源

Rust 4,872 316 Updated Feb 7, 2026

An ASR (Automatic Speech Recognition) adversarial attack repository.

Jupyter Notebook 39 2 Updated Nov 7, 2023

LLM inference in C/C++

C++ 99,505 15,863 Updated Mar 27, 2026

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,851 2,228 Updated Mar 26, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,129 8,434 Updated Mar 27, 2026

Enhanced Deep Image Prior for Unsupervised Hyperspectral Image Super-resolution, TGRS. (PyTorch)

Python 34 1 Updated Sep 24, 2025

A repo recording: How I use UNet to solve problems

Jupyter Notebook 23 2 Updated Nov 6, 2024

🚀 Cross attention map tools for huggingface/diffusers

Python 398 28 Updated Feb 2, 2026
Next