✨Can We Predict Before Executing Machine Learning Agents?
📝 Summary:
Autonomous machine learning agents overcome execution bottlenecks by predicting outcomes before physical execution, achieving faster convergence and improved performance through a predict-then-verify ...
🔹 Publication Date: Published on Jan 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05930
• PDF: https://arxiv.org/pdf/2601.05930
• Github: https://github.com/zjunlp/predict-before-execute
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Autonomous machine learning agents overcome execution bottlenecks by predicting outcomes before physical execution, achieving faster convergence and improved performance through a predict-then-verify ...
🔹 Publication Date: Published on Jan 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05930
• PDF: https://arxiv.org/pdf/2601.05930
• Github: https://github.com/zjunlp/predict-before-execute
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency
📝 Summary:
Large language models exhibit brittle beliefs under contextual perturbations, which are better measured by structural consistency metrics and addressed through structure-aware training methods. AI-gen...
🔹 Publication Date: Published on Jan 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05905
• PDF: https://arxiv.org/pdf/2601.05905
• Github: https://github.com/zjunlp/belief
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Large language models exhibit brittle beliefs under contextual perturbations, which are better measured by structural consistency metrics and addressed through structure-aware training methods. AI-gen...
🔹 Publication Date: Published on Jan 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05905
• PDF: https://arxiv.org/pdf/2601.05905
• Github: https://github.com/zjunlp/belief
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Orient Anything V2: Unifying Orientation and Rotation Understanding
📝 Summary:
Orient Anything V2 enhances 3D orientation understanding through scalable 3D asset synthesis, symmetry-aware periodic distribution fitting, and multi-frame relative rotation prediction, achieving stat...
🔹 Publication Date: Published on Jan 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05573
• PDF: https://arxiv.org/pdf/2601.05573
• Project Page: https://orient-anythingv2.github.io/
• Github: https://github.com/SpatialVision/Orient-Anything-V2
🔹 Models citing this paper:
• https://huggingface.co/Viglong/OriAnyV2_ckpt
✨ Datasets citing this paper:
• https://huggingface.co/datasets/Viglong/OriAnyV2_Train_Render
✨ Spaces citing this paper:
• https://huggingface.co/spaces/Viglong/Orient-Anything-V2
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Orient Anything V2 enhances 3D orientation understanding through scalable 3D asset synthesis, symmetry-aware periodic distribution fitting, and multi-frame relative rotation prediction, achieving stat...
🔹 Publication Date: Published on Jan 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05573
• PDF: https://arxiv.org/pdf/2601.05573
• Project Page: https://orient-anythingv2.github.io/
• Github: https://github.com/SpatialVision/Orient-Anything-V2
🔹 Models citing this paper:
• https://huggingface.co/Viglong/OriAnyV2_ckpt
✨ Datasets citing this paper:
• https://huggingface.co/datasets/Viglong/OriAnyV2_Train_Render
✨ Spaces citing this paper:
• https://huggingface.co/spaces/Viglong/Orient-Anything-V2
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨SmartSearch: Process Reward-Guided Query Refinement for Search Agents
📝 Summary:
SmartSearch enhances LLM-based search agents through process rewards and query refinement mechanisms that improve intermediate search query quality via a three-stage curriculum learning approach. AI-g...
🔹 Publication Date: Published on Jan 8
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04888
• PDF: https://arxiv.org/pdf/2601.04888
• Github: https://github.com/MYVAE/SmartSearch?tab=readme-ov-file
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
SmartSearch enhances LLM-based search agents through process rewards and query refinement mechanisms that improve intermediate search query quality via a three-stage curriculum learning approach. AI-g...
🔹 Publication Date: Published on Jan 8
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04888
• PDF: https://arxiv.org/pdf/2601.04888
• Github: https://github.com/MYVAE/SmartSearch?tab=readme-ov-file
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Router-Suggest: Dynamic Routing for Multimodal Auto-Completion in Visually-Grounded Dialogs
📝 Summary:
Multimodal auto-completion leverages visual and textual context to improve real-time prediction accuracy in conversational interfaces, with a router framework enabling efficient model selection based ...
🔹 Publication Date: Published on Jan 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05851
• PDF: https://arxiv.org/pdf/2601.05851
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Multimodal auto-completion leverages visual and textual context to improve real-time prediction accuracy in conversational interfaces, with a router framework enabling efficient model selection based ...
🔹 Publication Date: Published on Jan 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05851
• PDF: https://arxiv.org/pdf/2601.05851
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨AgentOCR: Reimagining Agent History via Optical Self-Compression
📝 Summary:
AgentOCR reimagines agent history as visual tokens to reduce token consumption and memory in agentic systems. It leverages optical caching and adaptive self-compression. This framework maintains strong performance while significantly cutting token usage and boosting efficiency.
🔹 Publication Date: Published on Jan 8
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04786
• PDF: https://arxiv.org/pdf/2601.04786
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
AgentOCR reimagines agent history as visual tokens to reduce token consumption and memory in agentic systems. It leverages optical caching and adaptive self-compression. This framework maintains strong performance while significantly cutting token usage and boosting efficiency.
🔹 Publication Date: Published on Jan 8
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04786
• PDF: https://arxiv.org/pdf/2601.04786
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨MMFormalizer: Multimodal Autoformalization in the Wild
📝 Summary:
MMFormalizer enables multimodal autoformalization by integrating visual perception with formal mathematical reasoning, supporting complex physical domains from classical mechanics to quantum mechanics...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03017
• PDF: https://arxiv.org/pdf/2601.03017
• Project Page: https://mmformalizer.github.io/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
MMFormalizer enables multimodal autoformalization by integrating visual perception with formal mathematical reasoning, supporting complex physical domains from classical mechanics to quantum mechanics...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03017
• PDF: https://arxiv.org/pdf/2601.03017
• Project Page: https://mmformalizer.github.io/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking
📝 Summary:
The Qwen3-VL-Embedding and Qwen3-VL-Reranker models form an end-to-end multimodal search pipeline, leveraging multi-stage training and cross-attention mechanisms to achieve high-precision retrieval ac...
🔹 Publication Date: Published on Jan 8
🔹 Paper Links:
• arXiv Page: https://www.arxiv.org/abs/2601.04720
• PDF: https://arxiv.org/pdf/2601.04720
• Github: https://github.com/QwenLM/Qwen3-VL-Embedding
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
The Qwen3-VL-Embedding and Qwen3-VL-Reranker models form an end-to-end multimodal search pipeline, leveraging multi-stage training and cross-attention mechanisms to achieve high-precision retrieval ac...
🔹 Publication Date: Published on Jan 8
🔹 Paper Links:
• arXiv Page: https://www.arxiv.org/abs/2601.04720
• PDF: https://arxiv.org/pdf/2601.04720
• Github: https://github.com/QwenLM/Qwen3-VL-Embedding
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨AnyDepth: Depth Estimation Made Easy
📝 Summary:
A lightweight monocular depth estimation framework uses DINOv3 as visual encoder and a compact transformer decoder to achieve higher accuracy with reduced computational overhead and improved data qual...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02760
• PDF: https://arxiv.org/pdf/2601.02760
• Project Page: https://aigeeksgroup.github.io/AnyDepth
• Github: https://aigeeksgroup.github.io/AnyDepth
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A lightweight monocular depth estimation framework uses DINOv3 as visual encoder and a compact transformer decoder to achieve higher accuracy with reduced computational overhead and improved data qual...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02760
• PDF: https://arxiv.org/pdf/2601.02760
• Project Page: https://aigeeksgroup.github.io/AnyDepth
• Github: https://aigeeksgroup.github.io/AnyDepth
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature
📝 Summary:
CaricatureGS introduces a 3D caricaturization framework combining Gaussian curvature-based exaggeration with 3D Gaussian Splatting for photorealistic, controllable face avatars. It uses a unique training scheme with synthesized supervision to achieve high fidelity, real-time deformation, and cont...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03319
• PDF: https://arxiv.org/pdf/2601.03319
• Project Page: https://c4ricaturegs.github.io/
• Github: https://c4ricaturegs.github.io/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
CaricatureGS introduces a 3D caricaturization framework combining Gaussian curvature-based exaggeration with 3D Gaussian Splatting for photorealistic, controllable face avatars. It uses a unique training scheme with synthesized supervision to achieve high fidelity, real-time deformation, and cont...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03319
• PDF: https://arxiv.org/pdf/2601.03319
• Project Page: https://c4ricaturegs.github.io/
• Github: https://c4ricaturegs.github.io/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research