Skip to content
View njuhugn's full-sized avatar

Block or report njuhugn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repo for Implicit Diffusion Q-Learning

Python 119 14 Updated Dec 5, 2023

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,301 1,250 Updated Aug 4, 2025

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Python 2,725 506 Updated Apr 29, 2024

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Python 523 69 Updated Oct 6, 2022

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,554 2,838 Updated Dec 17, 2025

OpenMMLab Detection Toolbox and Benchmark

Python 32,181 9,829 Updated Aug 21, 2024

[NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Python 188 21 Updated Jul 7, 2025

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Jupyter Notebook 813 119 Updated Aug 16, 2024

A programming framework for agentic AI

Python 52,610 7,995 Updated Oct 8, 2025

Official implementation of T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness Recognition

Jupyter Notebook 19 2 Updated Oct 23, 2024

[ICLR 2025 Oral🔥] SD-LoRA: Scalable Decoupled Low-Rank Adaptation for Class Incremental Learning

Python 73 11 Updated Jun 27, 2025

PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)

Jupyter Notebook 404 21 Updated Jun 30, 2025

Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in CVPR 2025 (Full Score, Highlight).

Python 398 33 Updated Oct 10, 2025

Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning".

Python 43 2 Updated Aug 4, 2025

This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.

Python 45 4 Updated Aug 22, 2025

[ICML 2024] Official implementation of "LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions."

Python 10 2 Updated Nov 12, 2024

Guangneng Hu, Assoc. Prof. @ Xidian Univ, PhD at HKUST, BA/MS at Nanjing Univ.

SCSS 2 Updated Nov 21, 2025
Python 9 2 Updated May 6, 2025

Code that accompanies the paper Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning - Accepted to ICML2024

Python 15 3 Updated May 8, 2025

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Python 245 7 Updated Jul 1, 2024
Python 6 2 Updated Aug 7, 2024

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,663 1,022 Updated Dec 4, 2025

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

Python 991 227 Updated Mar 10, 2024

Build effective agents using Model Context Protocol and simple workflow patterns

Python 7,864 792 Updated Dec 13, 2025

Benchmarks of approximate nearest neighbor libraries in Python

Python 5,540 869 Updated Jun 10, 2025

MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding

Python 75 Updated Feb 27, 2025

The model, data and code for the visual GUI Agent SeeClick

HTML 446 26 Updated Jul 13, 2025

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 772 55 Updated Jul 5, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,616 837 Updated Dec 16, 2025
Next