✨CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving
📝 Summary:
Visual mathematical problem solving remains challenging for multimodal large language models, prompting the development of CogFlow, a cognitive-inspired three-stage framework that enhances perception,...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01874
• PDF: https://arxiv.org/pdf/2601.01874
• Project Page: https://shchen233.github.io/cogflow/
• Github: https://shchen233.github.io/cogflow/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Visual mathematical problem solving remains challenging for multimodal large language models, prompting the development of CogFlow, a cognitive-inspired three-stage framework that enhances perception,...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01874
• PDF: https://arxiv.org/pdf/2601.01874
• Project Page: https://shchen233.github.io/cogflow/
• Github: https://shchen233.github.io/cogflow/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
✨NitroGen: An Open Foundation Model for Generalist Gaming Agents
📝 Summary:
NitroGen is a vision-action foundation model trained on extensive gameplay data that demonstrates strong cross-game generalization and effective transfer learning capabilities. AI-generated summary We...
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02427
• PDF: https://arxiv.org/pdf/2601.02427
• Project Page: https://nitrogen.minedojo.org/
• Github: https://github.com/MineDojo/NitroGen
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
NitroGen is a vision-action foundation model trained on extensive gameplay data that demonstrates strong cross-game generalization and effective transfer learning capabilities. AI-generated summary We...
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02427
• PDF: https://arxiv.org/pdf/2601.02427
• Project Page: https://nitrogen.minedojo.org/
• Github: https://github.com/MineDojo/NitroGen
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨X-MuTeST: A Multilingual Benchmark for Explainable Hate Speech Detection and A Novel LLM-consulted Explanation Framework
📝 Summary:
A novel explainability-guided training framework for hate speech detection in Indic languages that combines large language models with attention-enhancing techniques and provides human-annotated ratio...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03194
• PDF: https://arxiv.org/pdf/2601.03194
• Github: https://github.com/ziarehman30/X-MuTeST
✨ Datasets citing this paper:
• https://huggingface.co/datasets/UVSKKR/X-MuTeST
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A novel explainability-guided training framework for hate speech detection in Indic languages that combines large language models with attention-enhancing techniques and provides human-annotated ratio...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03194
• PDF: https://arxiv.org/pdf/2601.03194
• Github: https://github.com/ziarehman30/X-MuTeST
✨ Datasets citing this paper:
• https://huggingface.co/datasets/UVSKKR/X-MuTeST
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Parallel Latent Reasoning for Sequential Recommendation
📝 Summary:
Parallel Latent Reasoning framework improves sequential recommendation by exploring multiple diverse reasoning trajectories simultaneously through learnable trigger tokens and adaptive aggregation. AI...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03153
• PDF: https://arxiv.org/pdf/2601.03153
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Parallel Latent Reasoning framework improves sequential recommendation by exploring multiple diverse reasoning trajectories simultaneously through learnable trigger tokens and adaptive aggregation. AI...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03153
• PDF: https://arxiv.org/pdf/2601.03153
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
✨DreamStyle: A Unified Framework for Video Stylization
📝 Summary:
DreamStyle is a unified video stylization framework that supports multiple style conditions while addressing style inconsistency and temporal flicker through a specialized data curation pipeline and L...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02785
• PDF: https://arxiv.org/pdf/2601.02785
• Project Page: https://lemonsky1995.github.io/dreamstyle/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
DreamStyle is a unified video stylization framework that supports multiple style conditions while addressing style inconsistency and temporal flicker through a specialized data curation pipeline and L...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02785
• PDF: https://arxiv.org/pdf/2601.02785
• Project Page: https://lemonsky1995.github.io/dreamstyle/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨MiMo-V2-Flash Technical Report
📝 Summary:
MiMo-V2-Flash is a sparse Mixture-of-Experts model with hybrid attention architecture and efficient distillation technique that achieves strong performance with reduced parameters and improved inferen...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02780
• PDF: https://arxiv.org/pdf/2601.02780
• Project Page: https://mimo.xiaomi.com/blog/mimo-v2-flash
• Github: https://github.com/XiaomiMiMo/MiMo-V2-Flash
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
MiMo-V2-Flash is a sparse Mixture-of-Experts model with hybrid attention architecture and efficient distillation technique that achieves strong performance with reduced parameters and improved inferen...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02780
• PDF: https://arxiv.org/pdf/2601.02780
• Project Page: https://mimo.xiaomi.com/blog/mimo-v2-flash
• Github: https://github.com/XiaomiMiMo/MiMo-V2-Flash
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks
📝 Summary:
WebGym presents a large-scale open-source environment for training visual web agents using reinforcement learning with high-throughput asynchronous sampling, achieving superior performance on unseen w...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02439
• PDF: https://arxiv.org/pdf/2601.02439
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
WebGym presents a large-scale open-source environment for training visual web agents using reinforcement learning with high-throughput asynchronous sampling, achieving superior performance on unseen w...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02439
• PDF: https://arxiv.org/pdf/2601.02439
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision
📝 Summary:
UniCorn is a self-improvement framework enhancing multimodal model generation. It uses self-play and cognitive reconstruction, without external data or supervision. UniCorn achieves state-of-the-art text-to-image generation.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03193
• PDF: https://arxiv.org/pdf/2601.03193
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
UniCorn is a self-improvement framework enhancing multimodal model generation. It uses self-play and cognitive reconstruction, without external data or supervision. UniCorn achieves state-of-the-art text-to-image generation.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03193
• PDF: https://arxiv.org/pdf/2601.03193
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization
📝 Summary:
Audio geo-localization benchmark AGL1K is introduced to advance audio language models' geospatial reasoning capabilities through curated audio clips and evaluation across multiple models. AI-generated...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03227
• PDF: https://arxiv.org/pdf/2601.03227
• Github: https://github.com/Rising0321/AGL1K
✨ Spaces citing this paper:
• https://huggingface.co/spaces/RisingZhang/AudioGeoLoc
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Audio geo-localization benchmark AGL1K is introduced to advance audio language models' geospatial reasoning capabilities through curated audio clips and evaluation across multiple models. AI-generated...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03227
• PDF: https://arxiv.org/pdf/2601.03227
• Github: https://github.com/Rising0321/AGL1K
✨ Spaces citing this paper:
• https://huggingface.co/spaces/RisingZhang/AudioGeoLoc
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
ML Research Hub
OnSpace Mobile App builder: Build AI Apps in minutes Visit website: https://www.onspace.ai/?via=tg_datas Or Download app:https://onspace.onelink.me/za8S/h1jb6sb9?c=datas With OnSpace, you can build website or AI Mobile Apps by chatting with AI, and publish…
A great app for building and programming desktop, Android, and Telegram bots using only prompts
Just send what you want and it will design everything for you and the possibility to make money from your app 👍
Just send what you want and it will design everything for you and the possibility to make money from your app 👍
Media is too big
VIEW IN TELEGRAM
✨SOP: A Scalable Online Post-Training System for Vision-Language-Action Models
📝 Summary:
SOP is a scalable online post-training system for VLA models that enables real-world robot policy adaptation. It uses a robot fleet to continuously learn from interaction, improving task proficiency while maintaining generality. SOP significantly boosts VLA model performance within hours.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03044
• PDF: https://arxiv.org/pdf/2601.03044
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
SOP is a scalable online post-training system for VLA models that enables real-world robot policy adaptation. It uses a robot fleet to continuously learn from interaction, improving task proficiency while maintaining generality. SOP significantly boosts VLA model performance within hours.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03044
• PDF: https://arxiv.org/pdf/2601.03044
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence
📝 Summary:
SciEvalKit is an open-source toolkit for evaluating AI models in science. It assesses scientific intelligence across diverse domains and competencies using expert-grade benchmarks and a flexible pipeline. This provides a standardized platform for scientific AI evaluation.
🔹 Publication Date: Published on Dec 26, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.22334
• PDF: https://arxiv.org/pdf/2512.22334
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AIevaluation #ScientificAI #OpenSource #AIBenchmarks #AIResearch
📝 Summary:
SciEvalKit is an open-source toolkit for evaluating AI models in science. It assesses scientific intelligence across diverse domains and competencies using expert-grade benchmarks and a flexible pipeline. This provides a standardized platform for scientific AI evaluation.
🔹 Publication Date: Published on Dec 26, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.22334
• PDF: https://arxiv.org/pdf/2512.22334
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AIevaluation #ScientificAI #OpenSource #AIBenchmarks #AIResearch
✨Steerability of Instrumental-Convergence Tendencies in LLMs
📝 Summary:
This research investigates AI system steerability, noting a safety-security dilemma. It demonstrates that a short anti-instrumental prompt suffix dramatically reduces unwanted instrumental behaviors, like self-replication, in large language models. For Qwen3-30B, this reduced the convergence rate...
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01584
• PDF: https://arxiv.org/pdf/2601.01584
• Github: https://github.com/j-hoscilowicz/instrumental_steering/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AISafety #LLMs #AISteering #PromptEngineering #AIAlignment
📝 Summary:
This research investigates AI system steerability, noting a safety-security dilemma. It demonstrates that a short anti-instrumental prompt suffix dramatically reduces unwanted instrumental behaviors, like self-replication, in large language models. For Qwen3-30B, this reduced the convergence rate...
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01584
• PDF: https://arxiv.org/pdf/2601.01584
• Github: https://github.com/j-hoscilowicz/instrumental_steering/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AISafety #LLMs #AISteering #PromptEngineering #AIAlignment
✨OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs
📝 Summary:
OpenRT is an open-source framework that unifies and modularizes red-teaming for multimodal LLMs. It exposes significant safety gaps in frontier models, which fail to generalize across diverse attacks, showing attack success rates up to 49.14%.
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01592
• PDF: https://arxiv.org/pdf/2601.01592
• Project Page: https://ai45lab.github.io/OpenRT/
• Github: https://github.com/AI45Lab/OpenRT
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#RedTeaming #MultimodalLLMs #AISafety #LLMSecurity #AIResearch
📝 Summary:
OpenRT is an open-source framework that unifies and modularizes red-teaming for multimodal LLMs. It exposes significant safety gaps in frontier models, which fail to generalize across diverse attacks, showing attack success rates up to 49.14%.
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01592
• PDF: https://arxiv.org/pdf/2601.01592
• Project Page: https://ai45lab.github.io/OpenRT/
• Github: https://github.com/AI45Lab/OpenRT
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#RedTeaming #MultimodalLLMs #AISafety #LLMSecurity #AIResearch
✨AceFF: A State-of-the-Art Machine Learning Potential for Small Molecules
📝 Summary:
AceFF is a new machine learning potential for small molecule drug discovery. It offers DFT-level accuracy with high speed, supporting essential elements and charged states. Validation shows it is state-of-the-art for organic molecules.
🔹 Publication Date: Published on Jan 2
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.00581
• PDF: https://arxiv.org/pdf/2601.00581
• Github: https://github.com/torchmd/torchmd-net
🔹 Models citing this paper:
• https://huggingface.co/Acellera/AceFF-2.0
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#MachineLearning #DrugDiscovery #ComputationalChemistry #AIforScience #SmallMolecules
📝 Summary:
AceFF is a new machine learning potential for small molecule drug discovery. It offers DFT-level accuracy with high speed, supporting essential elements and charged states. Validation shows it is state-of-the-art for organic molecules.
🔹 Publication Date: Published on Jan 2
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.00581
• PDF: https://arxiv.org/pdf/2601.00581
• Github: https://github.com/torchmd/torchmd-net
🔹 Models citing this paper:
• https://huggingface.co/Acellera/AceFF-2.0
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#MachineLearning #DrugDiscovery #ComputationalChemistry #AIforScience #SmallMolecules
❤1
✨Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training
📝 Summary:
Muses is a training-free method for generating fantastic 3D creatures. It leverages 3D skeletal structures and graph-constrained reasoning to coherently design, compose, and assemble diverse elements. This approach achieves state-of-the-art visual fidelity and alignment with text descriptions.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03256
• PDF: https://arxiv.org/pdf/2601.03256
• Github: https://github.com/luhexiao/Muses
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#3DGeneration #GenerativeAI #ComputerGraphics #AIArt #TrainingFreeAI
📝 Summary:
Muses is a training-free method for generating fantastic 3D creatures. It leverages 3D skeletal structures and graph-constrained reasoning to coherently design, compose, and assemble diverse elements. This approach achieves state-of-the-art visual fidelity and alignment with text descriptions.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03256
• PDF: https://arxiv.org/pdf/2601.03256
• Github: https://github.com/luhexiao/Muses
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#3DGeneration #GenerativeAI #ComputerGraphics #AIArt #TrainingFreeAI
✨U-Net-Like Spiking Neural Networks for Single Image Dehazing
📝 Summary:
DehazeSNN introduces a U-Net-like Spiking Neural Network with an Orthogonal Leaky-Integrate-and-Fire Block for efficient image dehazing. It achieves competitive performance with reduced computational resources and a smaller model size.
🔹 Publication Date: Published on Dec 30, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.23950
• PDF: https://arxiv.org/pdf/2512.23950
• Github: https://github.com/HaoranLiu507/DehazeSNN
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
DehazeSNN introduces a U-Net-like Spiking Neural Network with an Orthogonal Leaky-Integrate-and-Fire Block for efficient image dehazing. It achieves competitive performance with reduced computational resources and a smaller model size.
🔹 Publication Date: Published on Dec 30, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.23950
• PDF: https://arxiv.org/pdf/2512.23950
• Github: https://github.com/HaoranLiu507/DehazeSNN
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models
📝 Summary:
This paper presents a four-stage framework for AI in digital twins: modeling, mirroring, intervention, and autonomous management. It details how physics-informed AI and large language models empower proactive, self-improving digital twins, acknowledging key challenges.
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01321
• PDF: https://arxiv.org/pdf/2601.01321
• Github: https://github.com/rongzhou7/Awesome-Digital-Twin-AI/tree/main
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#DigitalTwin #AI #LLM #WorldModels #PhysicsInformedAI
📝 Summary:
This paper presents a four-stage framework for AI in digital twins: modeling, mirroring, intervention, and autonomous management. It details how physics-informed AI and large language models empower proactive, self-improving digital twins, acknowledging key challenges.
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01321
• PDF: https://arxiv.org/pdf/2601.01321
• Github: https://github.com/rongzhou7/Awesome-Digital-Twin-AI/tree/main
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#DigitalTwin #AI #LLM #WorldModels #PhysicsInformedAI
arXiv.org
Digital Twin AI: Opportunities and Challenges from Large Language...
Digital twins, as precise digital representations of physical systems, have evolved from passive simulation tools into intelligent and autonomous entities through the integration of artificial...
✨Mechanistic Interpretability of Large-Scale Counting in LLMs through a System-2 Strategy
📝 Summary:
LLMs struggle with large counting due to architectural limits. A System-2 inspired test-time strategy decomposes tasks into smaller parts, achieving high accuracy. This approach involves latent count computation, dedicated attention, and aggregation, overcoming model limitations.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02989
• PDF: https://arxiv.org/pdf/2601.02989
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#LLM #MechanisticInterpretability #System2Strategy #AIResearch #NLP
📝 Summary:
LLMs struggle with large counting due to architectural limits. A System-2 inspired test-time strategy decomposes tasks into smaller parts, achieving high accuracy. This approach involves latent count computation, dedicated attention, and aggregation, overcoming model limitations.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02989
• PDF: https://arxiv.org/pdf/2601.02989
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#LLM #MechanisticInterpretability #System2Strategy #AIResearch #NLP
This media is not supported in your browser
VIEW IN TELEGRAM
✨ExposeAnyone: Personalized Audio-to-Expression Diffusion Models Are Robust Zero-Shot Face Forgery Detectors
📝 Summary:
ExposeAnyone is a self-supervised diffusion model for deepfake detection that personalizes to subjects and uses reconstruction errors to measure identity distance. It significantly outperforms prior methods on unseen manipulations, including Sora2 videos, and is robust to real-world corruptions.
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02359
• PDF: https://arxiv.org/pdf/2601.02359
• Github: https://mapooon.github.io/ExposeAnyonePage/
✨ Datasets citing this paper:
• https://huggingface.co/datasets/mapooon/S2CFP
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#DeepfakeDetection #DiffusionModels #ComputerVision #AITechnology #ForgeryDetection
📝 Summary:
ExposeAnyone is a self-supervised diffusion model for deepfake detection that personalizes to subjects and uses reconstruction errors to measure identity distance. It significantly outperforms prior methods on unseen manipulations, including Sora2 videos, and is robust to real-world corruptions.
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02359
• PDF: https://arxiv.org/pdf/2601.02359
• Github: https://mapooon.github.io/ExposeAnyonePage/
✨ Datasets citing this paper:
• https://huggingface.co/datasets/mapooon/S2CFP
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#DeepfakeDetection #DiffusionModels #ComputerVision #AITechnology #ForgeryDetection
❤2
✨Unified Thinker: A General Reasoning Modular Core for Image Generation
📝 Summary:
Unified Thinker introduces a modular reasoning core for image generation, decoupling a Thinker from the generator. It uses reinforcement learning to optimize visual correctness, substantially improving image reasoning and generation quality.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03127
• PDF: https://arxiv.org/pdf/2601.03127
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#ImageGeneration #AIResearch #ReinforcementLearning #DeepLearning #GenerativeAI
📝 Summary:
Unified Thinker introduces a modular reasoning core for image generation, decoupling a Thinker from the generator. It uses reinforcement learning to optimize visual correctness, substantially improving image reasoning and generation quality.
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03127
• PDF: https://arxiv.org/pdf/2601.03127
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#ImageGeneration #AIResearch #ReinforcementLearning #DeepLearning #GenerativeAI
❤2