Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,420 212 Updated Jan 8, 2026

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,355 210 Updated May 19, 2025

mesolitica / NLP-Models-Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Jupyter Notebook 1,787 719 Updated Jul 20, 2020

Glanvery / LLM-Travel

欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。

Jupyter Notebook 372 40 Updated Jul 21, 2024

CASIA-IVA-Lab / VAST

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 298 18 Updated Mar 14, 2024

meng-tang / rloss

Regularized Losses (rloss) for Weakly-supervised CNN Segmentation

Jupyter Notebook 213 48 Updated Oct 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TianHao Zhang Zth9730

Achievements

Achievements

Block or report Zth9730

Lists (3)

jax related

linux tools

tts

Stars

suno-ai / bark

CompVis / latent-diffusion

google / flax

QwenLM / Qwen3-Omni

google-research / big_vision

mesolitica / NLP-Models-Tensorflow

Glanvery / LLM-Travel

CASIA-IVA-Lab / VAST

meng-tang / rloss