- Hangzhou, China
-
01:42
(UTC +08:00) - https://vra.github.io/about
- https://www.zhihu.com/people/yunfeng-87
Highlights
Lists (31)
Sort Name ascending (A-Z)
action-recognition
Application-of-AI
audio
Awesome
Body
C++
CG
计算机图形学的东西Computer Vision
传统视觉算法,跟DL无关的CV算法Dataset
deep learning
Detection
检测任务相关,包括YOLO, 检测框架等doc
e-book
Face
Face Detection, Face Alignment, Face 3DGAN
Hand
Large-Language-Models and AIGC
大语言模型, AIGCmac
machine-learning
mcp
misc
nerf
Python
Python库Pytorch
Pytorch相关库Rust
segmentation
shape3d
tts
VIM
wasm
Web
- All languages
- Assembly
- Awk
- Batchfile
- C
- C#
- C++
- CMake
- CSS
- Cuda
- Cython
- Dockerfile
- Elixir
- Emacs Lisp
- Go
- HTML
- Haskell
- JSON
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Lua
- MATLAB
- MDX
- Markdown
- Mojo
- Nim
- Objective-C
- PHP
- PureBasic
- Python
- R
- Roff
- Ruby
- Rust
- SCSS
- Sass
- Scala
- Shell
- Svelte
- Swift
- TeX
- Terra
- TypeScript
- Vim Script
- Vue
- Zig
Starred repositories
FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation
A command-line interface for running Supertonic TTS models using MNN.
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
Scaling Spatial Intelligence with Multimodal Foundation Models
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model
A task runner that works well with poetry or uv.
A lightweight, single-header C++11 Jinja2 template engine for LLM chat templates.
👓 Solve Rubik's Cube in 20 moves using Xiaomi AI Glasses. 用小米 AI 眼镜在 20 步内还原魔方。
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
The official Soundwave repository
A framework for efficient model inference with omni-modality models
SkyRL: A Modular Full-stack RL Library for LLMs
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
The definitive Web UI for local AI, with powerful features and easy setup.
GraphQA: Natural Language Graph Analysis Framework - Ask questions about any graph in natural language
Awesome Literature Graph Learning Challenges
A dataset of complex questions on semi-structured Wikipedia tables
Code for the ICSC 2025 paper "Ontology-Guided, Hybrid Prompt Learning for Generalization in Knowledge Graph Question Answering"
[Paper][EMNLP 2025] SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
General technology for enabling AI capabilities w/ LLMs and MLLMs
[ACL 2024] TaxoLLaMA: WordNet-based Model for Solving Multiple Lexical Sematic Tasks
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B