Skip to content
View tanABCC's full-sized avatar

Block or report tanABCC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 46,252 5,690 Updated Apr 9, 2026

🔥 大模型 & Agent 面试八股文完全指南 | LLM & Agent Interview Preparation Guide

213 11 Updated Feb 28, 2026

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、LLM大模型、AI Agent、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。

3,437 379 Updated Mar 30, 2026

大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"

Jupyter Notebook 1,820 127 Updated Apr 2, 2026

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 13,765 1,355 Updated Apr 30, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,016 674 Updated Apr 9, 2026

[ICLR 2026] ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Python 503 59 Updated Mar 13, 2026

The Source Code for OmniVideoBench @ICLR 2026

Python 70 3 Updated Feb 12, 2026

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 652 51 Updated Feb 26, 2026

Universal Video Temporal Grounding with Generative Multi-modal Large Language Models

Python 50 3 Updated Mar 20, 2026

Unified Codebase for Advanced World Models.

Python 540 24 Updated Apr 8, 2026

Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs, LAION-CLAP, MS-CLAP, DeSync

Python 65 7 Updated Feb 14, 2026
Python 14 1 Updated Jul 19, 2025

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,569 110 Updated Mar 23, 2026

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,299 81 Updated Apr 3, 2026

[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding

Python 200 16 Updated Dec 19, 2025

**Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.

Python 371 14 Updated Nov 3, 2025

A GUI client for Windows, Linux and macOS, support Xray and sing-box and others

C# 101,369 14,536 Updated Apr 6, 2026

AutoDL平台服务器适配梯子, 使用 Clash 作为代理工具

Shell 678 61 Updated Feb 17, 2026

[KDD'2026] "VideoRAG: Chat with Your Videos"

Python 2,848 405 Updated Mar 18, 2026

✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 409 40 Updated Jan 14, 2026

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 3,142 247 Updated Apr 9, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,550 3,610 Updated Apr 9, 2026

Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation) using a novel three-stage RL curriculum. Includes the Time-…

Python 72 2 Updated Jun 11, 2025

🔥Awesome Multimodal Large Language Models Paper List

154 4 Updated Mar 12, 2025

CVPR 2026 论文和开源项目合集

22,339 2,789 Updated Mar 8, 2026

amed Entity Recognition (NER) for biomedical research papers using BERT, BioBERT, BiLSTM, and CRF models. Implements deep learning and reinforcement learning to enhance medical text extraction accu…

Jupyter Notebook 2 Updated Mar 18, 2025

Notebook for BERT medical named entity recognition

Jupyter Notebook 44 11 Updated Sep 25, 2022
Next