Skip to content
View Sonder-zyz's full-sized avatar
💭
In a daze
💭
In a daze
  • Zhejiang University
  • Hangzhou, Zhejiang Province, China

Block or report Sonder-zyz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
45 stars written in Python
Clear filter

The world's simplest facial recognition api for Python and the command line

Python 56,238 13,702 Updated Aug 21, 2024

The best ChatGPT that $100 can buy.

Python 50,466 6,617 Updated Mar 26, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,949 2,063 Updated Mar 26, 2026

Train transformer language models with reinforcement learning.

Python 17,813 2,592 Updated Mar 27, 2026

An open source implementation of CLIP.

Python 13,584 1,261 Updated Mar 12, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,714 1,399 Updated Mar 3, 2026

Retrieval and Retrieval-augmented LLMs

Python 11,457 845 Updated Mar 27, 2026

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 10,185 1,392 Updated Jul 15, 2025

Semantic Image Synthesis with SPADE

Python 7,708 973 Updated Aug 7, 2023

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,282 549 Updated May 5, 2025
Python 4,615 452 Updated Sep 14, 2025

Contrastive Language-Audio Pretraining

Python 2,079 206 Updated May 15, 2025
Python 1,864 116 Updated Sep 30, 2025

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,826 76 Updated Nov 27, 2025

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,469 116 Updated Oct 9, 2025

FireRed-OpenStoryline is an AI video editing agent that transforms manual editing into intention-driven directing through natural language interaction, LLM-powered planning, and precise tool orches…

Python 1,321 126 Updated Mar 27, 2026

A high-performance topological machine learning toolbox in Python

Python 979 195 Updated Jun 18, 2024

Audio Large Language Models

Python 895 45 Updated Jul 5, 2025

This repository is the notebook of Data Structure and Algorithms of ZJU "数据结构-浙江大学"

Python 752 250 Updated Sep 22, 2019

Audio Dataset for training CLAP and other models

Python 732 59 Updated Jan 8, 2026

Kepler Mapper: A flexible Python implementation of the Mapper algorithm.

Python 648 185 Updated Mar 7, 2026

🔥🔥First-ever hour scale video understanding models

Python 618 41 Updated Jul 14, 2025

开源剪映小助手|剪映API | 扣子插件 | Open-source CapCut automation toolkit to generate & download draft files. | skills

Python 571 109 Updated Mar 22, 2026

Topological Data Analysis for Python🐍

Python 567 56 Updated Mar 9, 2026

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 552 23 Updated Jan 4, 2026

[ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 511 17 Updated Nov 18, 2025

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 298 15 Updated Jun 17, 2025

🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps

Python 207 24 Updated Oct 6, 2025

美赛常用模型

Python 160 21 Updated Jan 23, 2019
Python 136 6 Updated Feb 9, 2026
Next