Lists (32)
Sort Name ascending (A-Z)
AI
Book
C/C++
Colorization
ComfyUI
Computer Vision
Object Detection / Segmentation / Recognition, Optical Character Recognition (OCR), Vision-Language Models (VLM)Dataset
Deep Learning
Emacs
Fonts
Games
i3wm
Image/Video Generation
Generative Adversarial Networks (GAN), Autoregressive Models, Diffusion Models (DM), Latent Diffusion Models (LDM)Image/Video Restoration
Denoising, Super-Resolution, Colorization, InpaintingInpainting
Image and Video InpaintingLanguage Models
Natural Language Processing (NLP), Large Language Models (LLM)Mathematics
Media
Metrics
Multimodal Foundation Models
Obsidian
Programming Languages
Python
PyTorch
Rust
Speech and Audio
Stable Diffusion
Transformer
UI Framework
Utils
ViT
Web
Stars
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
Collaborative cheatsheets for console commands 📚.
Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent. #1 on OpenRouter. 1.5M+ Kilo Coders. 25T+ tokens processed
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
A list of publicly available datasets with real-time data maintained by the team at bytewax.io
A fixed version of macOS's Unicode Hex Input keyboard layout
Rust GUI components for building fantastic cross-platform desktop application by using GPUI.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Highlight Org-mode table columns and rows using colored overlays
An open-source AI agent that brings the power of Gemini directly into your terminal.
🪐 Markdown with superpowers: from ideas to papers, presentations, websites, books, and knowledge bases.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
MOVED TO CODEBERG - Web-based environment for live coding algorithmic patterns, incorporating a faithful port of TidalCycles to JavaScript
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
chat with private and local large language models
Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022
A Conversational Speech Generation Model
DSPy: The framework for programming—not prompting—language models
A TTS model capable of generating ultra-realistic dialogue in one pass.
🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity