Skip to content
View Sonder-zyz's full-sized avatar
💭
In a daze
💭
In a daze
  • Zhejiang University
  • Hangzhou, Zhejiang Province, China

Block or report Sonder-zyz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 39,077 4,954 Updated Dec 9, 2025

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,584 2,283 Updated Oct 17, 2025
Python 128 21 Updated Jun 27, 2021

On the Theoretical Limitations of Embedding-Based Retrieval

Jupyter Notebook 614 47 Updated Sep 15, 2025

Audio Dataset for training CLAP and other models

Python 723 59 Updated Feb 5, 2024

Contrastive Language-Audio Pretraining

Python 1,943 198 Updated May 15, 2025

RayGen: Multi-Modal Dataset Reinforcement for MobileCLIP and MobileCLIP2

Python 33 2 Updated Aug 29, 2025

Python code for handling the Clotho dataset.

Python 85 15 Updated Nov 24, 2020

🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps

Python 200 24 Updated Oct 6, 2025

Audio Large Language Models

Python 828 42 Updated Jul 5, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,854 303 Updated Jun 12, 2025
Python 127 5 Updated Sep 4, 2025

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 287 14 Updated Jun 17, 2025

Open source code for supervised learning of bridge bidding.

Python 4 Updated Oct 31, 2023

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

923 74 Updated Dec 15, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,040 1,271 Updated Oct 11, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,437 1,155 Updated Apr 30, 2025
Python 4,461 435 Updated Sep 14, 2025

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 490 15 Updated Nov 18, 2025

🔥🔥First-ever hour scale video understanding models

Python 593 40 Updated Jul 14, 2025

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 513 20 Updated Nov 5, 2025

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,782 76 Updated Nov 27, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,456 1,999 Updated Nov 1, 2025

Brief guides for ZJU freshmen. [site](https://zjuers.com/welcome/)

HTML 124 19 Updated Oct 24, 2025

Train transformer language models with reinforcement learning.

Python 16,740 2,372 Updated Dec 22, 2025

An open source implementation of CLIP.

Python 13,148 1,220 Updated Nov 4, 2025

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 10,158 1,394 Updated Jul 15, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,042 3,865 Updated Jul 23, 2024

OpenAI CLIP text encoders for multiple languages!

Jupyter Notebook 823 69 Updated May 15, 2023
Next