Skip to content
View erow's full-sized avatar

Highlights

  • Pro

Block or report erow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Blog Write multi agent AI is a custom multi-agent system designed to autonomously create high-quality, research-driven blogs. Using LangChain, Gemini 2.0-Flash-EXP, and Serper Web Search Tool, it a…

Jupyter Notebook 46 11 Updated Aug 11, 2025

AI、CV方向国际学术期刊

29 1 Updated Mar 24, 2025

A paper list of some recent works about Token Compress for Vit and VLM

827 38 Updated Feb 6, 2026

RM-R1: Unleashing the Reasoning Potential of Reward Models

Python 158 15 Updated Jun 26, 2025
Python 53 4 Updated Dec 17, 2025

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,155 69 Updated Jul 15, 2025

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 34,052 7,897 Updated Nov 17, 2025

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 3,060 93 Updated Feb 6, 2026

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,983 3,493 Updated Feb 4, 2026

The official GitHub page for the survey paper "Self-Supervised learning for Videos: A survey"

9 Updated Jul 19, 2023

Unleashing Reasoning in Medical Large Language Models

12 Updated Mar 19, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,513 357 Updated Feb 27, 2025

Official implementation of SimVAE

Python 4 Updated Feb 27, 2025

A simple PyTorch implementation of influence functions.

Python 92 12 Updated Jun 17, 2024

Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders

Python 69 6 Updated Nov 17, 2023

[NeurIPS 2023] code for "DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models

Python 71 7 Updated Oct 20, 2023

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,635 952 Updated Feb 5, 2026

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Python 352 17 Updated Apr 23, 2025

Official implementation of MAIA, A Multimodal Automated Interpretability Agent

Jupyter Notebook 102 20 Updated Oct 22, 2025

O1 Replication Journey

1,999 61 Updated Jan 14, 2025

A comprehensive list of awesome contrastive self-supervised learning papers.

1,311 128 Updated Sep 10, 2024

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

Python 1,598 99 Updated Apr 24, 2025

VFS Appointment Bot - This script automates checking for appointments at VFS Global offices in a specified country.

Python 356 167 Updated Nov 19, 2024

本文详细分析了 Github Copilot 这个基于机器学习的代码自动补全工具的实现原理。作者通过逆向工程的方式,深入探索了 Copilot 的核心逻辑,包括代码提示的入口、获取提示的核心方法、以及相关的缓存策略、实验特性等。

JavaScript 2,207 256 Updated Jun 30, 2023

Compare neural networks by their feature similarity

Python 377 40 Updated May 17, 2023

Tiny AutoEncoder for Stable Diffusion (and other image models)

Python 880 48 Updated Jan 23, 2026

Mamba SSM architecture

Python 17,169 1,584 Updated Jan 12, 2026
Next