Skip to content
View erow's full-sized avatar

Highlights

  • Pro

Block or report erow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A paper list of some recent works about Token Compress for Vit and VLM

789 37 Updated Dec 18, 2025

RM-R1: Unleashing the Reasoning Potential of Reward Models

Python 155 14 Updated Jun 26, 2025
Python 50 2 Updated Dec 17, 2025

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,132 66 Updated Jul 15, 2025

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 33,856 7,879 Updated Nov 17, 2025

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 2,893 90 Updated Dec 20, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,703 3,465 Updated Dec 21, 2025

The official GitHub page for the survey paper "Self-Supervised learning for Videos: A survey"

8 Updated Jul 19, 2023

Unleashing Reasoning in Medical Large Language Models

12 Updated Mar 19, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,336 332 Updated Feb 27, 2025

Official implementation of SimVAE

Python 4 Updated Feb 27, 2025

A simple PyTorch implementation of influence functions.

Python 92 12 Updated Jun 17, 2024

Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders

Python 68 6 Updated Nov 17, 2023

[NeurIPS 2023] code for "DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models

Python 71 7 Updated Oct 20, 2023

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,005 881 Updated Dec 4, 2025

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Python 351 17 Updated Apr 23, 2025

Official implementation of MAIA, A Multimodal Automated Interpretability Agent

Jupyter Notebook 99 18 Updated Oct 22, 2025

O1 Replication Journey

2,003 63 Updated Jan 14, 2025

A comprehensive list of awesome contrastive self-supervised learning papers.

1,306 129 Updated Sep 10, 2024

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

Python 1,584 97 Updated Apr 24, 2025

VFS Appointment Bot - This script automates checking for appointments at VFS Global offices in a specified country.

Python 345 165 Updated Nov 19, 2024

本文详细分析了 Github Copilot 这个基于机器学习的代码自动补全工具的实现原理。作者通过逆向工程的方式,深入探索了 Copilot 的核心逻辑,包括代码提示的入口、获取提示的核心方法、以及相关的缓存策略、实验特性等。

JavaScript 2,204 256 Updated Jun 30, 2023

Compare neural networks by their feature similarity

Python 377 39 Updated May 17, 2023

Tiny AutoEncoder for Stable Diffusion

Python 840 43 Updated Nov 27, 2025

Mamba SSM architecture

Python 16,775 1,543 Updated Nov 11, 2025

Collection of papers on state-space models

613 22 Updated Nov 4, 2025

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,730 266 Updated Feb 13, 2025

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

Python 320 14 Updated Jun 3, 2024
Next