Skip to content
View StOnEGiggity's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report StOnEGiggity

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 9,427 1,012 Updated Jun 14, 2026

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 193,839 109,960 Updated Jun 8, 2026

基于图像识别的自动化黑盒测试框架 | An automation black-box testing framework based on image recognition

C++ 4,250 473 Updated Jun 15, 2026

OpenMMLab Model Deployment Framework

Python 3,125 713 Updated Sep 30, 2024

MV2DFusion

Python 100 16 Updated Dec 26, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,284 1,312 Updated Jun 7, 2026

Code for CVPR25 paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"

Python 164 13 Updated Jun 23, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,396 273 Updated Sep 12, 2025
Python 6,086 472 Updated Jun 15, 2026

Simulation platform for general-purpose robotics & embodied AI learning.

Python 29,350 2,782 Updated Jun 15, 2026
Python 522 33 Updated Jan 20, 2025

(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators

Python 643 35 Updated Jun 1, 2026

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

286 25 Updated Aug 18, 2025

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 3,124 357 Updated Mar 13, 2026

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,206 144 Updated Jun 13, 2026

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 2,064 156 Updated Dec 6, 2024

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 540 26 Updated Apr 8, 2024

The official repository of "Video assistant towards large language model makes everything easy"

Python 230 15 Updated Dec 24, 2024

Fast and memory-efficient exact attention

Python 24,157 2,831 Updated Jun 10, 2026

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Python 159 19 Updated Dec 9, 2024

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 292 19 Updated Aug 5, 2025

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,504 129 Updated Aug 5, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,180 8,832 Updated Jun 15, 2026

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,491 251 Updated Dec 3, 2024

An Extensible Continual Learning Framework Focused on Language Models (LMs)

Python 290 22 Updated Jan 28, 2024

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 365 28 Updated Sep 20, 2024
Python 209 12 Updated Jul 12, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,695 167 Updated Oct 28, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,957 400 Updated Feb 27, 2025

Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replay

Python 31 3 Updated Jan 5, 2022
Next