Lists (13)
Sort Name ascending (A-Z)
- All languages
- ANTLR
- Assembly
- Astro
- AutoHotkey
- Batchfile
- BibTeX Style
- Bicep
- C
- C#
- C++
- CMake
- CSS
- Circom
- Clojure
- Cuda
- Cython
- D
- Dart
- Dockerfile
- Elixir
- Emacs Lisp
- GLSL
- Go
- Groff
- HLSL
- HTML
- Handlebars
- Haskell
- Inno Setup
- Jai
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Lean
- Lua
- MATLAB
- MDX
- MLIR
- Makefile
- Markdown
- Mathematica
- Max
- Mojo
- Nim
- Objective-C
- Objective-C++
- OpenEdge ABL
- PHP
- PostScript
- PowerShell
- Processing
- Python
- R
- Ren'Py
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- Sass
- Scala
- Scheme
- Shell
- SourcePawn
- Stylus
- Svelte
- Swift
- SystemVerilog
- Tcl
- TeX
- TypeScript
- Verilog
- Vim Script
- Visual Basic 6.0
- Vue
- XC
- XSLT
- YARA
Starred repositories
A dataset containing synchronized visual, inertial and GNSS raw measurements.
FusionFly is an open-source toolkit for standardizing GNSS (Global Navigation Satellite System) and IMU (Inertial Measurement Unit) data with Factor Graph Optimization (FGO). The system provides a …
A unified evaluation suite for speech-to-text translation, covering SpeechLLMs, SFMs, and cascaded systems across diverse real-world speech phenomena.
DPDFNet: causal single-channel speech enhancement that boosts DeepFilterNet2 with dual-path RNN blocks for stronger long-range temporal and cross-band modeling. Repo includes PyTorch implementation…
LibriVAD - a scalable open-source dataset derived from LibriSpeech and augmented with diverse real-world and synthetic noise sources, in addition to deep learning benchmarks..
Towards Scalable Pre-training of Visual Tokenizers for Generation
Live speech translation application built with Electron 34 and React, using OpenAI's Realtime API.
PyTorch Implementation of "Resource Efficient 3D Convolutional Neural Networks", codes and pretrained models.
EdgeFace: Efficient Face Recognition Model for Edge Devices [TBIOM 2024] the winner of compact track of IJCB 2023 Efficient Face Recognition Competition
RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything
Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more.
A library for making PyTorch models streamable
Lightning-YOLOs provides clean, modular YOLO object detection models built on PyTorch Lightning, making it easier to train, extend, and experiment with modern YOLO variants in research and producti…
TPAMI:Frequency-aware Feature Fusion for Dense Image Prediction
Learning in infinite dimension with neural operators.
Official implementation of "Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation".
Official repository of paper "WeDetect: Fast Open-Vocabulary Object Detection as Retrieval"
Official code of Motus: A Unified Latent Action World Model
Official Implementation of Dynamic erf (Derf).
Expressive Body Capture: 3D Hands, Face, and Body from a Single Image
HaMeR: Reconstructing Hands in 3D with Transformers
Pixio: a capable vision encoder dedicated to dense tasks, simply by pixel reconstruction
A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention
The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module designs and a specially curated dataset.
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…