XD111ds

Follow

XD111ds

Follow

10 followers · 9 following

Achievements

Achievements

Stars

xhguo7 / PAPO-Eval

Official Evaluation Module For PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning

Python 8 2 Updated Jan 30, 2026

tongjingqi / AI-Can-Learn-Scientific-Taste

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference…

381 10 Updated Mar 29, 2026

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 158,745 32,722 Updated Apr 3, 2026

duoan / TorchCode

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,364 274 Updated Mar 27, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,107 688 Updated Apr 3, 2026

AlenjandroWang / UniReason

Python 144 4 Updated Feb 13, 2026

JimmyAwoe / CS61A_24Spring

The CS61A course of UC Berkeley. A python version of SICP.

Python 4 Updated Jan 23, 2025

JimmyAwoe / MLFrameworkForTesting

A basic framework for testing everything in a maching learning model.

Python 2 Updated Mar 4, 2025

JimmyAwoe / FedSub

Official implementation of FedSub: An efficient subspace algorithm for federated learning on heterogeneous data.

Python 2 Updated Dec 11, 2025

JimmyAwoe / PackTron

Lightweight data loader with zero-padding sentence packing for LLM training.

Python 6 Updated Nov 11, 2025

NVIDIA-NeMo / Megatron-Bridge

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 547 247 Updated Apr 3, 2026

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,801 365 Updated Mar 26, 2026

ShiJinghao566 / BiCoR-Seg

[CVM'26]BoCoR-Seg: Bidirectional Co-Refinement Framework for High-Resolution Remote Sensing Image Segmentation

Python 8 Updated Feb 27, 2026

SiyuanWangw / StepwiseQA

The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".

Python 22 1 Updated Sep 1, 2022

SiyuanWangw / ULogic

Jupyter Notebook 23 3 Updated Aug 1, 2024

SiyuanWangw / PathQG

The source code of Paper "PathQG: Neural Question Generation from Facts".

Python 23 3 Updated Jan 4, 2021

SiyuanWangw / LReasoner

The code of Paper "Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text".

Python 48 9 Updated Mar 2, 2023

XD111ds / ILVR

Implementation of "Interleaved Latent Visual Reasoning with Selective Perceptual Modeling".

Python 45 3 Updated Jan 21, 2026

YU-deep / Awesome-Latent-Space

A paper list of Awesome Latent Space.

475 20 Updated Apr 3, 2026

cleste-pome / SparseMVC

[NeurIPS 2025 Spotlight] SparseMVC: Probing Cross-view Sparsity Variations for Multi-view Clustering [Pytorch repository]

Python 43 2 Updated Mar 30, 2026

UMass-Embodied-AGI / Mirage

[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Python 260 18 Updated Aug 2, 2025

multimodal-reasoning-lab / Bagel-Zebra-CoT

https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT

Python 131 7 Updated Jan 30, 2026

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

77,714 9,000 Updated Feb 5, 2026

btjawa / BiliTools

A cross-platform bilibili toolbox. 跨平台哔哩哔哩工具箱，支持下载视频、番剧等等各类资源

Rust 4,904 316 Updated Feb 7, 2026

hammaad2002 / ASRAdversarialAttacks

An ASR (Automatic Speech Recognition) adversarial attack repository.

Jupyter Notebook 40 2 Updated Nov 7, 2023

ggml-org / llama.cpp

LLM inference in C/C++

C++ 101,051 16,277 Updated Apr 3, 2026

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,879 2,233 Updated Apr 2, 2026

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,463 8,449 Updated Apr 1, 2026

JiaxinLiCAS / EDIP-Net_TGRS

Enhanced Deep Image Prior for Unsupervised Hyperspectral Image Super-resolution, TGRS. (PyTorch)

Python 34 1 Updated Sep 24, 2025

lgy112112 / UNethology

A repo recording: How I use UNet to solve problems

Jupyter Notebook 23 2 Updated Nov 6, 2024