Skip to content
View erow's full-sized avatar

Highlights

  • Pro

Block or report erow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Your AI agent army, commanded from Slack/Discord/Wechat/Lark. Stream Claude Code, OpenCode, or Codex in real-time — from anywhere.

Python 360 46 Updated Mar 29, 2026

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 5,803 446 Updated Mar 26, 2026

Official repository of Utonia: Toward One Encoder for All Point Clouds

Python 565 28 Updated Mar 5, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 340,265 67,096 Updated Mar 29, 2026

Blog Write multi agent AI is a custom multi-agent system designed to autonomously create high-quality, research-driven blogs. Using LangChain, Gemini 2.0-Flash-EXP, and Serper Web Search Tool, it a…

Jupyter Notebook 52 11 Updated Aug 11, 2025

AI、CV方向国际学术期刊

32 1 Updated Mar 24, 2025

A paper list of some recent works about Token Compress for Vit and VLM

866 38 Updated Mar 25, 2026

RM-R1: Unleashing the Reasoning Potential of Reward Models

Python 161 15 Updated Jun 26, 2025
Python 58 6 Updated Feb 24, 2026

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,171 75 Updated Jul 15, 2025

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 34,261 7,919 Updated Mar 21, 2026

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 3,160 99 Updated Mar 20, 2026

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,990 3,479 Updated Mar 27, 2026

The official GitHub page for the survey paper "Self-Supervised learning for Videos: A survey"

9 Updated Jul 19, 2023

Unleashing Reasoning in Medical Large Language Models

12 Updated Mar 19, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,650 371 Updated Feb 27, 2025

Official implementation of SimVAE

Python 5 Updated Feb 27, 2025

A simple PyTorch implementation of influence functions.

Python 92 12 Updated Jun 17, 2024

Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders

Python 69 6 Updated Nov 17, 2023

[NeurIPS 2023] code for "DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models

Python 73 7 Updated Oct 20, 2023

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 10,044 1,010 Updated Mar 23, 2026

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Python 354 17 Updated Apr 23, 2025

Official implementation of MAIA, A Multimodal Automated Interpretability Agent

Jupyter Notebook 106 22 Updated Oct 22, 2025

O1 Replication Journey

2,000 61 Updated Jan 14, 2025

A comprehensive list of awesome contrastive self-supervised learning papers.

1,308 126 Updated Sep 10, 2024

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

Python 1,613 99 Updated Apr 24, 2025

VFS Appointment Bot - This script automates checking for appointments at VFS Global offices in a specified country.

Python 368 172 Updated Nov 19, 2024
Next