HanielF

🎯

Focusing

HanielF HanielF

🎯

Focusing

I am a master of the Institute of Computing Technology, Chinese Academy of Sciences, mainly focusing on multimedia understanding.

35 followers · 43 following

ICT
https://hanielxx.com

Achievements

Starred repositories

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 56,541 9,839 Updated Feb 11, 2026

WeThinkIn / AIGC-Interview-Book

【三年面试五年模拟】AIGC/LLM/AI Agent算法工程师面试秘籍。涵盖AIGC、LLM大模型、AI Agent、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。

3,924 411 Updated Jun 13, 2026

priyammaz / PyTorch-Adventures

This repository contains an exhaustive coverage of a hands on approach to PyTorch along side powerful tools to accelerate model tuning and training

Jupyter Notebook 275 57 Updated May 16, 2026

Kwai-Keye / Keye

Python 789 24 Updated Jun 10, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,631 968 Updated Jun 9, 2026

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,953 4,072 Updated Jun 13, 2026

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

4,386 255 Updated May 20, 2026

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

5,014 541 Updated Sep 25, 2024

WLiK / LLM4Rec-Awesome-Papers

A list of awesome papers and resources of recommender system on large language model (LLM).

2,292 164 Updated Mar 17, 2025

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,137 8,826 Updated Jun 13, 2026

huangb23 / VTimeLLM

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python 296 13 Updated Jun 13, 2024

SkalskiP / top-cvpr-2024-papers

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

Python 737 57 Updated Apr 15, 2026

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,298 1,992 Updated Jan 9, 2026

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,203 144 Updated Jun 13, 2026

ollama / ollama

Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 174,072 16,596 Updated Jun 13, 2026

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 1,054 76 Updated Nov 18, 2024

CrazyBoyM / llama3-Chinese-chat

Llama3-中文后训练版

Python 4,152 334 Updated Feb 21, 2026

LlamaChinese / Llama-Chinese

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

Python 14,714 1,301 Updated Apr 6, 2025

OpenBMB / MiniCPM-V

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

Python 25,612 2,005 Updated Jun 4, 2026

thunlp / LLaVA-UHD

LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs

Python 424 20 Updated Dec 20, 2025

JIA-Lab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,326 275 Updated May 4, 2024

pyecharts / pyecharts

🎨 Python Echarts Plotting Library

Python 15,761 2,855 Updated Jun 12, 2026

yyyujintang / Awesome-Mamba-Papers

Awesome Papers related to Mamba.

1,399 74 Updated Oct 17, 2024

Yuliang-Liu / MultimodalOCR

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 850 56 Updated Jun 12, 2026

brightmart / nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,901 1,554 Updated Feb 6, 2026

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,127 592 Updated Apr 24, 2024

yiren-jian / BLIText

[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training

Python 26 2 Updated Dec 5, 2023

clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,881 562 Updated Jul 11, 2024

luogen1996 / LLaVA-HR

[ICLR2025] LLaVA-HR: High-Resolution Large Language-Vision Assistant

Python 249 12 Updated Aug 14, 2024

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,417 558 Updated Oct 19, 2024

HanielF HanielF

Starred repositories

video-retrieval

SpaceVim

Algorithm