ML Research Hub
32.8K subscribers
4.3K photos
260 videos
23 files
4.65K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
πŸ”Ή Title: Paper2Web: Let's Make Your Paper Alive!

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15842
β€’ PDF: https://arxiv.org/pdf/2510.15842
β€’ Project Page: https://francischen3.github.io/P2W_Website/
β€’ Github: https://github.com/YuhangChen1/Paper2All

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15742
β€’ PDF: https://arxiv.org/pdf/2510.15742
β€’ Project Page: https://ezioby.github.io/Ditto_page
β€’ Github: https://github.com/EzioBy/Ditto

πŸ”Ή Datasets citing this paper:
β€’ https://huggingface.co/datasets/QingyanBai/Ditto-1M

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Latent Diffusion Model without Variational Autoencoder

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15301
β€’ PDF: https://arxiv.org/pdf/2510.15301
β€’ Project Page: https://howlin-wang.github.io/svg/
β€’ Github: https://github.com/shiml20/SVG

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in Finance Domain

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15232
β€’ PDF: https://arxiv.org/pdf/2510.15232
β€’ Github: https://github.com/HughieHu/FinTrust/

πŸ”Ή Datasets citing this paper:
β€’ https://huggingface.co/datasets/HughieHu/FinTrust

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: VISTA: A Test-Time Self-Improving Video Generation Agent

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15831
β€’ PDF: https://arxiv.org/pdf/2510.15831
β€’ Github: https://g-vista.github.io/

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Robust Layerwise Scaling Rules by Proper Weight Decay Tuning

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15262
β€’ PDF: https://arxiv.org/pdf/2510.15262

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

πŸ”Ή Publication Date: Published on Oct 16

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15110
β€’ PDF: https://arxiv.org/pdf/2510.15110

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Train a Unified Multimodal Data Quality Classifier with Synthetic Data

πŸ”Ή Publication Date: Published on Oct 16

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15162
β€’ PDF: https://arxiv.org/pdf/2510.15162

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
❀1
πŸ”Ή Title: Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15280
β€’ PDF: https://arxiv.org/pdf/2510.15280

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15859
β€’ PDF: https://arxiv.org/pdf/2510.15859

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning

πŸ”Ή Publication Date: Published on Oct 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.12838
β€’ PDF: https://arxiv.org/pdf/2510.12838

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15624
β€’ PDF: https://arxiv.org/pdf/2510.15624
β€’ Project Page: https://freephdlabor.github.io/
β€’ Github: https://github.com/ltjed/freephdlabor

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
β€’ https://huggingface.co/spaces/edli/freephdlabor
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Language Models Model Language

πŸ”Ή Publication Date: Published on Oct 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.12766
β€’ PDF: https://arxiv.org/pdf/2510.12766

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: ERGO: Entropy-guided Resetting for Generation Optimization in Multi-turn Language Models

πŸ”Ή Publication Date: Published on Oct 15

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.14077
β€’ PDF: https://arxiv.org/pdf/2510.14077
β€’ Project Page: https://ergopaper.github.io/ERGO/
β€’ Github: https://github.com/haziq-exe/ERGO

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs

πŸ”Ή Publication Date: Published on Oct 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.11288
β€’ PDF: https://arxiv.org/pdf/2510.11288

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15444
β€’ PDF: https://arxiv.org/pdf/2510.15444
β€’ Project Page: https://zhouz.dev/RPC/
β€’ Github: https://github.com/WNJXYK/RPC/

πŸ”Ή Datasets citing this paper:
β€’ https://huggingface.co/datasets/WNJXYK/MathOdyssey-Reasoning-Paths
β€’ https://huggingface.co/datasets/WNJXYK/AIME_1983_2024-Reasoning-Paths
β€’ https://huggingface.co/datasets/WNJXYK/OlympiadBench-Reasoning-Paths
β€’ https://huggingface.co/datasets/WNJXYK/MATH-Reasoning-Paths

πŸ”Ή Spaces citing this paper:
β€’ https://huggingface.co/spaces/WNJXYK/RPC
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
❀1
πŸ”Ή Title: DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

πŸ”Ή Publication Date: Published on Oct 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.15264
β€’ PDF: https://arxiv.org/pdf/2510.15264

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
❀2
πŸ€–πŸ§  NanoChat: The Best ChatGPT That $100 Can Buy

πŸ—“οΈ 20 Oct 2025
πŸ“š AI News & Trends

In a world dominated by billion-dollar AI models like GPT-4 and Claude 3, it’s refreshing to see a minimalist, open-source alternative that puts the power of Large Language Models (LLMs) back into the hands of hackers, researchers and enthusiasts. Enter NanoChat – an end-to-end, full-stack implementation of a ChatGPT-style AI chatbot developed by Andrej Karpathy, ...

#NanoChat #ChatGPT #AI #LargeLanguageModels #OpenSource #AndrejKarpathy
πŸ€–πŸ§  PaddleOCR-VL: Redefining Multilingual Document Parsing with a 0.9B Vision-Language Model

πŸ—“οΈ 20 Oct 2025
πŸ“š AI News & Trends

In an era where information is predominantly digital, the ability to extract, interpret and organize data from documents is crucial. From invoices and research papers to multilingual contracts and handwritten notes, document parsing stands at the intersection of vision and language. Traditional Optical Character Recognition (OCR) systems have made impressive strides but they often fall ...

#PaddleOCR-VL #Multilingual #DocumentParsing #VisionLanguageModel #OCR #AI
πŸ”Ή Title: Do LLMs "Feel"? Emotion Circuits Discovery and Control

πŸ”Ή Publication Date: Published on Oct 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.11328
β€’ PDF: https://arxiv.org/pdf/2510.11328
β€’ Github: https://github.com/Aurora-cx/EmotionCircuits-LLM

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ€–πŸ§  Top 30 More Retro Bollywood Diwali Portrait Prompts for Women Using Gemini AI – Part 2

πŸ—“οΈ 20 Oct 2025
πŸ“š AI News & Trends

The Diwali celebrations continue and so does the nostalgia! After the huge buzz around our Top 20 Retro Bollywood Diwali Portrait Ideas, we’re back with Part 2 featuring prompts 21 to 50 curated to help you create even more magical, cinematic AI portraits using Google Gemini AI. If you loved the 90s-style Diwali aesthetics shimmering ...