Skip to content
View Sonder-zyz's full-sized avatar
💭
In a daze
💭
In a daze
  • Zhejiang University
  • Hangzhou, Zhejiang Province, China

Block or report Sonder-zyz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
101 results for source starred repositories
Clear filter

开源剪映小助手|剪映API | 扣子插件 | Open-source CapCut automation toolkit to generate & download draft files.

Python 210 51 Updated Feb 1, 2026

开源剪映小助手客户端|Open-source CapCut automation toolkit to generate & download draft files.

JavaScript 24 12 Updated Jan 10, 2026

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 25,577 2,372 Updated Jan 29, 2026

用 Claude Code Skills 做的视频剪辑 Agent

JavaScript 828 157 Updated Jan 31, 2026

The best ChatGPT that $100 can buy.

Python 42,371 5,471 Updated Feb 5, 2026

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 23,017 2,320 Updated Oct 17, 2025
Python 129 21 Updated Jun 27, 2021

On the Theoretical Limitations of Embedding-Based Retrieval

Jupyter Notebook 624 48 Updated Sep 15, 2025

Audio Dataset for training CLAP and other models

Python 729 59 Updated Jan 8, 2026

Contrastive Language-Audio Pretraining

Python 2,024 202 Updated May 15, 2025

RayGen: Multi-Modal Dataset Reinforcement for MobileCLIP and MobileCLIP2

Python 37 2 Updated Aug 29, 2025

Python code for handling the Clotho dataset.

Python 85 15 Updated Nov 24, 2020

🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps

Python 203 24 Updated Oct 6, 2025

Audio Large Language Models

Python 864 43 Updated Jul 5, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,914 316 Updated Jun 12, 2025
Python 131 6 Updated Jan 30, 2026

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 294 15 Updated Jun 17, 2025

Open source code for supervised learning of bridge bidding.

Python 4 Updated Oct 31, 2023

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

984 83 Updated Dec 15, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,410 1,331 Updated Oct 11, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 12,279 1,237 Updated Apr 30, 2025
Python 4,549 441 Updated Sep 14, 2025

[ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 502 16 Updated Nov 18, 2025

🔥🔥First-ever hour scale video understanding models

Python 611 41 Updated Jul 14, 2025

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 532 21 Updated Jan 4, 2026

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,810 75 Updated Nov 27, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,736 2,031 Updated Jan 13, 2026

Brief guides for ZJU freshmen. [site](https://zjuers.com/welcome/)

HTML 127 20 Updated Oct 24, 2025

Train transformer language models with reinforcement learning.

Python 17,297 2,475 Updated Feb 6, 2026
Next