Stars
Code for building ConceptNet from raw data.
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Code for `LLM2VEC-GEN: Generative Embeddings from Large Language Models`
Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized experts across multiple task domains.
A clean, single-file PyTorch implementation of Attention Residuals (Kimi Team, MoonshotAI, 2026), integrated with Grouped Query Attention (GQA), SwiGLU feed-forward networks, and Rotary Position Em…
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Zotero Add-on Market | Zotero插件市场 | Browsing, installing, and reviewing plugins within Zotero
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
[SIGGRAPH‘2026] PEAR :Pixel-aligned Expressive humAn mesh Recovery
OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability f…
Utilities intended for use with Llama models.
An open source implementation of CLIP.
[NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution
All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
LAVIS - A One-stop Library for Language-Vision Intelligence
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
文档(小说、论文、字幕)翻译工具(支持 pdf/word/excel/json/epub/srt...)Document (Novel, Thesis, Subtitle) Translation Tool (Supports pdf/word/excel/json/epub/srt...)
AMoE: Agglomerative Mixture-of-Experts Vision Foundation Models
PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
SGLang is a high-performance serving framework for large language models and multimodal models.