Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A paper list of some recent works about Token Compress for Vit and VLM
RM-R1: Unleashing the Reasoning Potential of Reward Models
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
The official GitHub page for the survey paper "Self-Supervised learning for Videos: A survey"
Unleashing Reasoning in Medical Large Language Models
PyTorch code and models for V-JEPA self-supervised learning from video.
A simple PyTorch implementation of influence functions.
Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders
[NeurIPS 2023] code for "DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Official implementation of MAIA, A Multimodal Automated Interpretability Agent
A comprehensive list of awesome contrastive self-supervised learning papers.
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
VFS Appointment Bot - This script automates checking for appointments at VFS Global offices in a specified country.
本文详细分析了 Github Copilot 这个基于机器学习的代码自动补全工具的实现原理。作者通过逆向工程的方式,深入探索了 Copilot 的核心逻辑,包括代码提示的入口、获取提示的核心方法、以及相关的缓存策略、实验特性等。
Compare neural networks by their feature similarity
Collection of papers on state-space models
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"