Skip to content
View lizaijing's full-sized avatar

Block or report lizaijing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Interactive GUI Client for Optimus-3

Python 2 Updated Jun 16, 2025

Official Implementation for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts

Python 26 2 Updated Jul 11, 2025

Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.

715 33 Updated Aug 31, 2025

About Awesome things towards foundation agents. Papers / Repos / Blogs / ...

1,757 171 Updated Jul 28, 2025

[CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy

21 2 Updated Jun 17, 2025

A compilation of the best multi-agent papers

TeX 939 73 Updated Sep 30, 2025

Paper List of Minecraft Agents

43 3 Updated Aug 15, 2025

[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Java 2 Updated Oct 21, 2024

[CVPR 2024 Workshop] The Champion Solution for Ego4D EgoSchema Challenge in CVPR 2024

Python 11 Updated Jun 25, 2024

[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Java 85 6 Updated Jun 17, 2025
5 Updated Mar 14, 2024

[ACMMM 2022 Oral] Official Implementation for Bi-directional Heterogeneous Graph Hashing towards Efficient Outfit Recommendation

Python 11 2 Updated Dec 12, 2022

Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)

Python 27 Updated Jul 11, 2024

Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)

Python 23 Updated Jul 11, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 27,360 2,695 Updated Apr 30, 2025

The official site of paper MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation

Python 201 7 Updated Sep 3, 2023

✨✨Latest Advances on Multimodal Large Language Models

16,420 1,064 Updated Sep 24, 2025

😎 curated list of awesome LMM hallucinations papers, methods & resources.

149 14 Updated Mar 23, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,895 177 Updated May 26, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,034 6,396 Updated Oct 9, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 30,978 3,784 Updated Jul 23, 2024

Official repo for consistency models.

Python 6,415 434 Updated Mar 22, 2024

SAEval: A benchmark for sentiment analysis to evaluate the model's performance on various subtasks.

Python 12 2 Updated Apr 29, 2024

UniSA: Unified Generative Framework for Sentiment Analysis

Python 53 5 Updated Apr 29, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,521 188 Updated Apr 2, 2025

Multimodal datasets.

Python 30 9 Updated Jan 26, 2024