Skip to content
View upnana's full-sized avatar
  • China

Block or report upnana

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ppo算法实现

Python 34 2 Updated Jun 5, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,100 790 Updated Oct 9, 2025

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 1,556 164 Updated Oct 5, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,122 2,517 Updated Oct 9, 2025

SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks (CVPR'25)

Python 13 1 Updated Jul 1, 2025

Pytorch implementation of 'Clothes-Changing Person Re-identification with RGB Modality Only. In CVPR, 2022.'

Python 156 21 Updated Aug 23, 2022

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,613 253 Updated Feb 13, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 12,646 2,126 Updated Sep 6, 2025

from MHA, MQA, GQA to MLA by 苏剑林, with code

Jupyter Notebook 28 4 Updated Feb 19, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,108 3,408 Updated Sep 25, 2025

RAG兴趣小组,全手写的一个RAG应用。Langchain的大部分库会很方便,但是你不一定理解其中原理,所以代码尽可能展现基本算法,主打理解RAG的原理

Jupyter Notebook 235 13 Updated Sep 25, 2024

A collection of AWESOME things about mixture-of-experts

1,211 84 Updated Dec 8, 2024

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 1,179 110 Updated Apr 19, 2024

[NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue Liang*, Zhiwen Fan*, Rishov Sarkar, Ziyu Jiang, Tianlong Che…

Python 131 18 Updated Nov 30, 2022

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,941 529 Updated Sep 25, 2024

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Jupyter Notebook 750 88 Updated Oct 30, 2024
Python 33 5 Updated Dec 10, 2024

【IEEE TITS2025】Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification

Python 14 1 Updated Dec 30, 2024

【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt

Python 80 3 Updated May 13, 2025

Cross Visual Prompt Tuning [ICCV 2025]

Python 10 1 Updated Aug 3, 2025

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Python 778 64 Updated Jul 24, 2023

BasicIRSTD toolbox

Python 241 37 Updated Dec 20, 2024

Implementation of Patch-wise Adversarial Regularization from "Learning Robust Global Representations by Penalizing Local Predictive Power"

Python 18 1 Updated Oct 27, 2019

Official Implementation for Flare-Aware Cross-modal Enhancement for Multi-spectral Vehicle ReID

Python 11 1 Updated Dec 19, 2024

[NeruIPS2024] RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification

Python 2 Updated May 22, 2025
Python 6 Updated Oct 8, 2023

Welcome to the Awesome Multi-Modal Object Re-Identification Repository! This repository is dedicated to curating and sharing the latest methods, datasets, and resources focused specifically on the …

78 5 Updated Aug 10, 2025

[CVPR 2023] Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identification

Python 131 12 Updated Dec 23, 2023

[CVPR2024]Day-Night Cross-domain Vehicle Re-identification

Python 45 1 Updated Oct 28, 2024
Next