multimodal-benchmark

Here are 3 public repositories matching this topic...

qcri / Almieyar-Oryx-BloomBench

This paper introduces BloomBench, a new bilingual (English-Arabic), cognitively-informed benchmark based on Bloom's Taxonomy to systematically evaluate the reasoning abilities of Vision-Language Models across different hierarchical cognitive levels.

visual-reasoning visual-question-answering vision-language-models multimodal-benchmark cognitive-evaluation bloom-s-taxonomy

Updated Jun 7, 2026
Python

silvia20177 / TIR-Bench

Star

TIR-Bench: Multi-modal image reasoning benchmark interpretation repository, includes dataset introduction, paper parsing, evaluation pipeline and VLM test results for vision-language model benchmark research.

paper-analysis image-reasoning multimodal-benchmark