Skip to content
View beidiz's full-sized avatar

Highlights

  • Pro

Block or report beidiz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

Python 714 67 Updated Sep 19, 2024

Official repository for the A-OKVQA dataset

Python 106 14 Updated May 8, 2024

[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models

Python 82 10 Updated Feb 16, 2025

Awesome List for Agentic RL

HTML 661 27 Updated Dec 9, 2025

Latent Collaboration in Multi-Agent Systems

Python 636 95 Updated Dec 18, 2025

Official code implementation for paper "PathAgent: Toward Interpretable Analysis of Whole-slide Pathology Images via Large Language Model-based Agentic Reasoning"

Python 5 1 Updated Dec 8, 2025

The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.

Python 104 9 Updated May 30, 2025

Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"

63 8 Updated Jul 22, 2025

Recurrence Meets Transformers for Universal Multimodal Retrieval

Python 13 Updated Dec 15, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,714 312 Updated Nov 13, 2025
Python 2 Updated Nov 11, 2025

[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".

Python 19 2 Updated Jun 16, 2025

This is the official repository for Retrieval Augmented Visual Question Answering

Python 243 20 Updated Dec 19, 2024

Computational Pathology Toolbox developed by TIA Centre, University of Warwick.

Python 498 102 Updated Dec 25, 2025

The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.

508 24 Updated Jul 29, 2025

Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"

Python 420 35 Updated Oct 14, 2025

MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs

Python 246 19 Updated Jun 19, 2025

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 8,032 912 Updated Nov 15, 2025

Implementation Code for "LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination"

Python 12 Updated Apr 25, 2025

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,773 382 Updated Aug 13, 2025

Ensemble Learning of Foundation Models

Python 15 1 Updated Aug 29, 2025

[ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic context control to boost factual accuracy in multimodal medical reas…

Python 17 1 Updated Aug 28, 2025

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,207 260 Updated Jul 25, 2025
Jupyter Notebook 41 Updated May 16, 2025

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 2,278 216 Updated May 25, 2024

Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers powerful capabilities for agent running control, data processing …

Python 4,094 464 Updated Dec 26, 2025

The official implementation of paper "PTCMIL: Multiple Instance Learning via Prompt Token Clustering for Whole Slide Image Analysis" accepted at MICCAI 2025

Python 5 Updated Nov 7, 2025

Toolkit for large-scale whole-slide image processing.

Python 455 99 Updated Nov 19, 2025
Next