Starred repositories
The AI Browser Automation Framework
DeepEP: an efficient expert-parallel communication library
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Best Practices on Recommendation Systems
FlashMLA: Efficient Multi-head Latent Attention Kernels
ModaNet: A large-scale street fashion dataset with polygon annotations
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Align Anything: Training All-modality Model with Feedback
Solve Visual Understanding with Reinforced VLMs
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
openwfm / WRF-SFIRE
Forked from wrf-model/WRFA coupled weather-fire forecasting model built on top of Weather Research and Forecasting (WRF). This is the original https://github.com/openwfm/wrf-fire transitioned to a fork of WRF and selected…
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Muzic: Music Understanding and Generation with Artificial Intelligence
Unified Controllable Visual Generation Model
Official PyTorch implementation of StyleGAN3
支持将Wordpress网站一键转为APP(iPhone和Android两个版本)、微信小程序、百度小程序、支付宝小程序,同时也支持转成风格相似的H5。
Use OpenCV image capture with the powerful Mediapipe library to achieve human movement detection and recognition; The recognition results are synchronized to Unity in real time to realize the recog…
🎨 A powerful multi-end drawing board that brings together a lot of creative brushes to experience a whole new range of drawing effects!
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
Label, clean and enrich text datasets with LLMs.
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
团队协作与管理平台,具有在线多人聊天、消息实时推送、协同编辑等功能
Running large language models on a single GPU for throughput-oriented scenarios.
A python interface for interacting with the Ethereum blockchain and ecosystem.
抖音爬虫——采集账号主页、喜欢、收藏、音乐原声、话题、搜索、合集、作品、关注、粉丝等公开数据。