Skip to content
View tanABCC's full-sized avatar

Block or report tanABCC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 45,319 5,500 Updated Apr 1, 2026

🔥 大模型 & Agent 面试八股文完全指南 | LLM & Agent Interview Preparation Guide

169 7 Updated Feb 28, 2026

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、LLM大模型、AI Agent、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。

3,386 374 Updated Mar 30, 2026

大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"

Jupyter Notebook 1,809 126 Updated Apr 1, 2026

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 13,639 1,346 Updated Apr 30, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,986 663 Updated Mar 27, 2026

[ICLR 2026] ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Python 494 55 Updated Mar 13, 2026

The Source Code for OmniVideoBench @ICLR 2026

Python 70 3 Updated Feb 12, 2026

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 649 51 Updated Feb 26, 2026

Universal Video Temporal Grounding with Generative Multi-modal Large Language Models

Python 49 2 Updated Mar 20, 2026

Unified Codebase for Advanced World Models.

Python 303 16 Updated Mar 31, 2026

Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs, LAION-CLAP, MS-CLAP, DeSync

Python 64 7 Updated Feb 14, 2026
Python 13 1 Updated Jul 19, 2025

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,557 109 Updated Mar 23, 2026

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,282 78 Updated Mar 25, 2026

[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding

Python 195 16 Updated Dec 19, 2025

**Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.

Python 367 13 Updated Nov 3, 2025

A GUI client for Windows, Linux and macOS, support Xray and sing-box and others

C# 100,398 14,430 Updated Apr 1, 2026

AutoDL平台服务器适配梯子, 使用 Clash 作为代理工具

Shell 666 61 Updated Feb 17, 2026

[KDD'2026] "VideoRAG: Chat with Your Videos"

Python 2,827 403 Updated Mar 18, 2026

✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 406 40 Updated Jan 14, 2026

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 3,173 242 Updated Mar 28, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,377 3,552 Updated Apr 1, 2026

Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation) using a novel three-stage RL curriculum. Includes the Time-…

Python 71 2 Updated Jun 11, 2025

🔥Awesome Multimodal Large Language Models Paper List

154 4 Updated Mar 12, 2025

CVPR 2026 论文和开源项目合集

22,296 2,786 Updated Mar 8, 2026

amed Entity Recognition (NER) for biomedical research papers using BERT, BioBERT, BiLSTM, and CRF models. Implements deep learning and reinforcement learning to enhance medical text extraction accu…

Jupyter Notebook 2 Updated Mar 18, 2025

Notebook for BERT medical named entity recognition

Jupyter Notebook 43 11 Updated Sep 25, 2022
Next