Media is too big
VIEW IN TELEGRAM
✨ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction
📝 Summary:
ENACT is a benchmark evaluating embodied cognition in vision-language models through egocentric world modeling tasks. It reveals a performance gap between VLMs and humans that widens with interaction, and models exhibit anthropocentric biases.
🔹 Publication Date: Published on Nov 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.20937
• PDF: https://arxiv.org/pdf/2511.20937
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#EmbodiedCognition #VisionLanguageModels #AIResearch #WorldModeling #CognitiveScience
📝 Summary:
ENACT is a benchmark evaluating embodied cognition in vision-language models through egocentric world modeling tasks. It reveals a performance gap between VLMs and humans that widens with interaction, and models exhibit anthropocentric biases.
🔹 Publication Date: Published on Nov 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.20937
• PDF: https://arxiv.org/pdf/2511.20937
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#EmbodiedCognition #VisionLanguageModels #AIResearch #WorldModeling #CognitiveScience
❤1