Skip to content
View wangq95's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Tencent
  • Shanghai, People's Republic of China

Block or report wangq95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple yet powerful agent framework that delivers with open-source models

Python 4,496 462 Updated Mar 21, 2026

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 609 46 Updated May 8, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,453 1,001 Updated Mar 30, 2026

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Jupyter Notebook 4,063 559 Updated Mar 26, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,535 8,467 Updated Apr 5, 2026

📋 A list of open LLMs available for commercial use.

12,704 967 Updated Feb 13, 2025

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 12,139 938 Updated Mar 11, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,005 13,773 Updated Apr 4, 2026

Align Anything: Training All-modality Model with Feedback

Python 4,639 509 Updated Nov 27, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,926 378 Updated Mar 12, 2026

Code for Finetune like you pretrain: Improved finetuning of zero-shot vision models

Python 106 15 Updated Aug 13, 2023
Python 668 54 Updated Nov 28, 2023

Mamba SSM architecture

Python 17,864 1,679 Updated Mar 30, 2026

TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)

Python 191 13 Updated Nov 17, 2023

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,478 116 Updated Oct 9, 2025

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Python 676 32 Updated Sep 19, 2022

Hopfield Networks is All You Need

Python 1,908 225 Updated Apr 23, 2023
Python 159 11 Updated Jun 13, 2022

A concise but complete implementation of CLIP with various experimental improvements from recent papers

Python 722 49 Updated Oct 16, 2023

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,086 585 Updated Apr 24, 2024

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,167 423 Updated Aug 23, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,605 489 Updated Aug 7, 2024

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,700 1,453 Updated Jan 4, 2026

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 3,041 233 Updated Feb 9, 2026

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,852 549 Updated Mar 31, 2026

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,358 210 Updated Mar 5, 2024

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Python 1,031 109 Updated Sep 29, 2022
Python 158 9 Updated May 25, 2023

EVA Series: Visual Representation Fantasies from BAAI

Python 2,661 187 Updated Aug 1, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,193 1,103 Updated Nov 18, 2024
Next