Skip to content
View mao1207's full-sized avatar

Organizations

@Tongji-Blockchain-Association

Block or report mao1207

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SWE-agent that can solve Unreal Enging coding problem

Python 1 Updated Dec 22, 2025

SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

Python 239 21 Updated Dec 23, 2025

OpenCE (Open Context Engineering): A community toolkit to implement, evaluate, and combine LLM context strategies (RAG, ACE, Compression). Evolved from the `ACE-open` reproduction.

Python 333 47 Updated Nov 14, 2025

Visualizing the attention of vision-language models

Jupyter Notebook 268 22 Updated Feb 28, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,434 1,689 Updated Sep 24, 2025

The code and data for the paper "Evaluating Model Perception of Color Illusions in Photorealistic Scenes"

Python 1 Updated Dec 29, 2024

Biomedical Visual Instruction Tuning with Clinician Preference Alignment

Python 8 2 Updated Nov 18, 2024
Jupyter Notebook 3 Updated Dec 17, 2024

Optimus: the first large-scale pre-trained VAE language model

Python 391 41 Updated Sep 6, 2023

Diffusion-LM

Python 1,210 158 Updated Aug 8, 2024

utilities for decoding deep representations (like sentence embeddings) back to text

Python 1,029 111 Updated Aug 5, 2025

Access a database of word frequencies, in various natural languages.

Python 1,591 109 Updated Jan 4, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,030 395 Updated Dec 24, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,482 980 Updated Aug 12, 2024

This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12390 [ECCV 2024]

Python 152 8 Updated Sep 27, 2025

Evaluation of VLMs

Python 1 Updated May 20, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

4,246 250 Updated Dec 9, 2025

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

2,978 135 Updated Dec 20, 2025

Reproduce the fast matrix multiplication method based on Multiplying Matrices Without Multiplying and Bolt: Accelerated Data Mining with Fast Vector Compression , while doing the speedup of the und…

Python 4 1 Updated Jul 6, 2023
Python 1,838 61 Updated Jun 28, 2024

[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Python 91 3 Updated Apr 30, 2024
Jupyter Notebook 230 30 Updated Dec 18, 2023

This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"

Jupyter Notebook 22 2 Updated Apr 27, 2025

纪念碑谷 WebGL版

JavaScript 23 11 Updated May 31, 2018

Monument Valley mechanism in Unity3D

C# 172 59 Updated May 9, 2014

Code for our CVPR'2024 paper "GauHuman: Articulated Gaussian Splatting from Monocular Human Videos"

Python 403 36 Updated Jul 4, 2024

Algorithm for converting a heterogeneous graph to a homogeneous graph

Python 4 Updated Mar 31, 2024

Code for "The One Where They Reconstructed 3D Humans and Environments in TV shows" appearing in ECCV 2022.

Jupyter Notebook 266 17 Updated Sep 9, 2024
Next