Skip to content
View eslambakr's full-sized avatar

Block or report eslambakr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 680 26 Updated Dec 17, 2025

[CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced evaluation modes. The dataset includes extensive contextual desc…

29 Updated Apr 16, 2025

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 319 8 Updated Oct 14, 2025

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Python 505 17 Updated Aug 9, 2024

Vision Language Models are Biased

Python 104 2 Updated Dec 12, 2025

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,755 454 Updated Aug 19, 2024
Python 1,732 77 Updated Dec 16, 2025

[NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"

Python 32 Updated Oct 19, 2025

Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)

Python 242 13 Updated Dec 5, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,762 103 Updated Nov 4, 2025

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 13,034 1,382 Updated Dec 18, 2025

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Python 86 2 Updated Nov 29, 2025

This is an official repo for fine-tuning SAM to customized medical images.

Python 268 43 Updated Oct 18, 2024

EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Python 181 4 Updated Nov 21, 2025

SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process

Python 203 14 Updated Jan 21, 2024

[ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement

Jupyter Notebook 76 3 Updated Apr 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,199 7,782 Updated Dec 18, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,425 360 Updated Nov 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,609 2,852 Updated Dec 19, 2025

The world's simplest facial recognition api for Python and the command line

Python 55,917 13,713 Updated Aug 21, 2024

[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

Python 393 13 Updated Feb 20, 2025

How to create a challenge on EvalAI?

Python 85 186 Updated Oct 3, 2025

DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference

Python 576 34 Updated Nov 24, 2025

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 634 24 Updated May 24, 2024

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,603 83 Updated Oct 29, 2025
Python 195 9 Updated Jul 12, 2024

Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think

Python 648 37 Updated Dec 19, 2025

[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark

Python 251 3 Updated Nov 5, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 2,062 88 Updated Dec 15, 2025

Open-source unified multimodal model

Python 5,477 480 Updated Oct 27, 2025
Next