Stars
Production-ready platform for agentic workflow development.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥
🦜🔗 The platform for reliable agents.
🏡 Open source home automation that puts local control and privacy first.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
分别使用OpenCV、ONNXRuntime部署YOLOX+ByteTrack目标跟踪,包含C++和Python两个版本的程序
BoxMOT: Pluggable SOTA multi-object tracking modules modules for segmentation, object detection and pose estimation models
support deepsort and bytetrack MOT(Multi-object tracking) using yolov5 with C++
Swift implementation of Kalman Filter algorithm
Godot Engine – Multi-platform 2D and 3D game engine
Godot template and component-based framework for 2D games.
GPUImage 3 is a BSD-licensed Swift framework for GPU-accelerated video and image processing using Metal.
Examples and resources about SwiftUI, specifically focused on building Mac OS apps
A SwiftUI system components and interactions demo app
A collaborative list of awesome SwiftUI resources. Feel free to contribute!
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
Anonymous automation via selenium with fingerprint replacement technology.
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
This repository contains the source code for the paper First Order Motion Model for Image Animation
AI-generated-character
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
StyleGAN2 - Official TensorFlow Implementation
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models