Skip to content
View wangq95's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Tencent
  • Shanghai, People's Republic of China

Block or report wangq95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple yet powerful agent framework that delivers with open-source models

Python 4,510 464 Updated Mar 21, 2026

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 608 46 Updated May 8, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,480 1,002 Updated Mar 30, 2026

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Jupyter Notebook 4,090 565 Updated Mar 26, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,230 8,598 Updated Apr 12, 2026

📋 A list of open LLMs available for commercial use.

12,724 969 Updated Feb 13, 2025

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 12,144 940 Updated Mar 11, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,971 13,978 Updated Apr 16, 2026

Align Anything: Training All-modality Model with Feedback

Python 4,647 506 Updated Nov 27, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,939 377 Updated Mar 12, 2026

Code for Finetune like you pretrain: Improved finetuning of zero-shot vision models

Python 106 15 Updated Aug 13, 2023
Python 667 55 Updated Nov 28, 2023

Mamba SSM architecture

Python 17,999 1,698 Updated Apr 16, 2026

TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)

Python 191 13 Updated Nov 17, 2023

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,490 117 Updated Apr 15, 2026

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Python 676 32 Updated Sep 19, 2022

Hopfield Networks is All You Need

Python 1,917 226 Updated Apr 23, 2023
Python 160 11 Updated Jun 13, 2022

A concise but complete implementation of CLIP with various experimental improvements from recent papers

Python 722 49 Updated Oct 16, 2023

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,088 585 Updated Apr 24, 2024

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,164 423 Updated Aug 23, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,622 490 Updated Aug 7, 2024

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,702 1,448 Updated Jan 4, 2026

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 3,053 232 Updated Feb 9, 2026

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,875 552 Updated Mar 31, 2026

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,365 211 Updated Mar 5, 2024

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Python 1,034 109 Updated Sep 29, 2022
Python 158 9 Updated May 25, 2023

EVA Series: Visual Representation Fantasies from BAAI

Python 2,664 187 Updated Aug 1, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,200 1,105 Updated Nov 18, 2024
Next