Skip to content
View tanABCC's full-sized avatar

Block or report tanABCC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Universal Video Temporal Grounding with Generative Multi-modal Large Language Models

Python 42 2 Updated Nov 25, 2025

Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs, LAION-CLAP, MS-CLAP, DeSync

Python 45 3 Updated Aug 20, 2025
Python 8 1 Updated Jul 19, 2025

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,379 96 Updated Dec 11, 2025

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,108 65 Updated Nov 25, 2025

[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding

Python 150 11 Updated Dec 19, 2025

**Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.

Python 320 7 Updated Nov 3, 2025

A GUI client for Windows, Linux and macOS, support Xray and sing-box and others

C# 92,900 13,882 Updated Dec 22, 2025

AutoDL平台服务器适配梯子, 使用 Clash 作为代理工具

Shell 557 56 Updated Jul 10, 2025

[KDD'2026] "VideoRAG: Chat with Your Videos"

Python 1,388 207 Updated Nov 22, 2025

✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 366 35 Updated Oct 28, 2025

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,671 122 Updated Dec 22, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,709 2,870 Updated Dec 23, 2025

Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation) using a novel three-stage RL curriculum. Includes the Time-…

Python 60 2 Updated Jun 11, 2025

🔥CVPR 2025 Multimodal Large Language Models Paper List

153 4 Updated Mar 12, 2025

CVPR 2025 论文和开源项目合集

21,682 2,772 Updated Jul 2, 2025

amed Entity Recognition (NER) for biomedical research papers using BERT, BioBERT, BiLSTM, and CRF models. Implements deep learning and reinforcement learning to enhance medical text extraction accu…

Jupyter Notebook 2 Updated Mar 18, 2025

Notebook for BERT medical named entity recognition

Jupyter Notebook 43 11 Updated Sep 25, 2022

repository for Publicly Available Clinical BERT Embeddings

Python 742 150 Updated Aug 25, 2020

BioBERT model fine tuned for NER task on Pubmed Dataset

Jupyter Notebook 11 Updated Apr 28, 2023

力扣周赛训练小工具,欢迎使用🎈

TypeScript 745 42 Updated Dec 11, 2025

【蓝桥杯Python冲刺课】视频合集 https://space.bilibili.com/398421867/lists?sid=4898042&spm_id_from=333.788.0.0

Python 455 32 Updated Nov 8, 2025

这是我学习 PyTorch 的笔记对应的代码,点击查看 PyTorch 笔记在线电子书

Python 1,391 294 Updated Dec 5, 2020

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】

Jupyter Notebook 15,468 1,799 Updated Dec 18, 2025

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 2,103 267 Updated Jun 4, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,681 76 Updated May 11, 2025

BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays

Python 44 2 Updated May 21, 2025
Next