-
Computer of Science and Technology Beijing
Lists (3)
Sort Name ascending (A-Z)
Stars
🔊 Text-Prompted Generative Audio Model
High-Resolution Image Synthesis with Latent Diffusion Models
Flax is a neural network library for JAX that is designed for flexibility.
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Regularized Losses (rloss) for Weakly-supervised CNN Segmentation