Stars
Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0 license. You combine them with any detection model you alre…
Esp32 Multiple Client RTSP Server with Video, Audio & Subtitles
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.
Simple UI for debugging correlations of text embeddings
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
ImageBind One Embedding Space to Bind Them All
Cost-efficient and pluggable Infrastructure components for GenAI inference
A collection of prompts, system prompts and LLM instructions
An implementation of the OPAQUE password-authenticated key exchange protocol
Code for getting started with the FLIR Lepton breakout board
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Spawning-Inc / img2dataset
Forked from rom1504/img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
RSTutorials: A Curated List of Algorithms about Traditional and Social Recommender System.
RSTutorials: A Curated List of Must-read Papers on Recommender System.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Open-source framework for exporting your personal data.
📄 PDF reader in JavaScript only for Expo - Android & iOS capable