✨AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs
📝 Summary:
AlignBench is a new benchmark for fine-grained image-text alignment, using detailed synthetic image-caption pairs. It reveals that CLIP-based models struggle with compositional reasoning and shows detector self-preference.
🔹 Publication Date: Published on Nov 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.20515
• PDF: https://arxiv.org/pdf/2511.20515
• Project Page: https://dahlian00.github.io/AlignBench/
• Github: https://dahlian00.github.io/AlignBench/
✨ Datasets citing this paper:
• https://huggingface.co/datasets/omron-sinicx/AlignBench
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#ImageTextAlignment #MultimodalAI #ComputerVision #Benchmarking #CLIPModels
📝 Summary:
AlignBench is a new benchmark for fine-grained image-text alignment, using detailed synthetic image-caption pairs. It reveals that CLIP-based models struggle with compositional reasoning and shows detector self-preference.
🔹 Publication Date: Published on Nov 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.20515
• PDF: https://arxiv.org/pdf/2511.20515
• Project Page: https://dahlian00.github.io/AlignBench/
• Github: https://dahlian00.github.io/AlignBench/
✨ Datasets citing this paper:
• https://huggingface.co/datasets/omron-sinicx/AlignBench
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#ImageTextAlignment #MultimodalAI #ComputerVision #Benchmarking #CLIPModels