LuoweiZhou

🎯

Focusing

Luowei Zhou LuoweiZhou

🎯

Focusing

Research Scientist @ Google Deepmind.

234 followers · 14 following

Google
https://luoweizhou.github.io

Achievements

Organizations

Stars

298 results for source starred repositories

Clear filter

SkyworkAI / Matrix-Game

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,715 176 Updated Oct 4, 2025

Tencent-Hunyuan / HunyuanWorld-1.0

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,380 199 Updated Oct 22, 2025

SamsungLabs / ADIEE

Code for ICCV 2025 paper: "ADIEE: Automatic Dataset Creation and Scorer for Instruction-Guided Image Editing Evaluation"

Python 7 Updated Aug 22, 2025

cline / cline

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 52,115 7,638 Updated Nov 6, 2025

modelscope / FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 5,106 606 Updated Jul 11, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,071 56 Updated Mar 20, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 27,764 2,754 Updated Apr 30, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,258 419 Updated Nov 3, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,674 366 Updated Oct 21, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,416 1,114 Updated Nov 6, 2025

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,532 1,265 Updated Nov 4, 2025

DAMO-NLP-SG / multimodal_textbook

[ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 174 16 Updated Mar 17, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,065 519 Updated Jun 9, 2025

SalesforceAIResearch / xLAM

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Python 577 48 Updated Aug 21, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,629 303 Updated Oct 20, 2025

microsoft / rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

443 15 Updated Apr 18, 2024

openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 1,135 171 Updated Nov 5, 2025

rhymes-ai / Aria

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 1,078 85 Updated Jan 22, 2025

llamastack / llama-stack

Composable building blocks to build Llama Apps

Python 8,144 1,205 Updated Nov 6, 2025

llamastack / llama-stack-apps

Agentic components of the Llama Stack APIs

4,274 635 Updated Aug 5, 2025

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,662 1,710 Updated Apr 26, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,188 1,665 Updated Sep 24, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,552 2,183 Updated Dec 25, 2024

xlang-ai / Spider2-V

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Jupyter Notebook 132 10 Updated Aug 26, 2024

BAAI-Agents / Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,315 226 Updated Nov 7, 2024

real-stanford / universal_manipulation_interface

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 1,051 198 Updated Jul 21, 2025

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,173 319 Updated Oct 15, 2025

pinokiocomputer / pinokio

AI Browser

JavaScript 5,661 540 Updated Nov 6, 2025

lllyasviel / Fooocus

Focus on prompting and generating

Python 46,969 7,587 Updated Sep 2, 2025

OpenTalker / SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,332 2,546 Updated Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Luowei Zhou LuoweiZhou

Achievements

Achievements

Organizations

Block or report LuoweiZhou

Stars

SkyworkAI / Matrix-Game

Tencent-Hunyuan / HunyuanWorld-1.0

SamsungLabs / ADIEE

cline / cline

modelscope / FunClip

bytedance / 1d-tokenizer

hpcaitech / Open-Sora

EvolvingLMMs-Lab / lmms-eval

om-ai-lab / VLM-R1

kvcache-ai / ktransformers

ShishirPatil / gorilla

DAMO-NLP-SG / multimodal_textbook

NVIDIA / Cosmos

SalesforceAIResearch / xLAM

NVlabs / VILA

microsoft / rho

openai / mle-bench

rhymes-ai / Aria

llamastack / llama-stack

llamastack / llama-stack-apps

SakanaAI / AI-Scientist

OpenBMB / MiniCPM-V

facebookresearch / sam2

xlang-ai / Spider2-V

BAAI-Agents / Cradle

real-stanford / universal_manipulation_interface

showlab / Awesome-Video-Diffusion

pinokiocomputer / pinokio

lllyasviel / Fooocus

OpenTalker / SadTalker