Stars
Official Repository of paper MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization
《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Offer!
[CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
Simulation platform for general-purpose robotics & embodied AI learning.
[ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate".
[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning
Video+code lecture on building nanoGPT from scratch
DSIR large-scale data selection framework for language model training
llama3 implementation one matrix multiplication at a time
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
[ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
DeepSeek-VL: Towards Real-World Vision-Language Understanding
A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Mod…
The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.
Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning