ML Research Hub
32.8K subscribers
4.26K photos
257 videos
23 files
4.6K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
πŸ€–πŸ§  Context Engineering 2.0: Redefining Human–Machine Understanding

πŸ—“οΈ 16 Nov 2025
πŸ“š AI News & Trends

As artificial intelligence advances, machines are becoming increasingly capable of understanding and responding to human language. Yet, one crucial challenge remains how can machines truly understand the context behind human intentions? This question forms the foundation of context engineering, a discipline that focuses on designing, organizing and managing contextual information so that AI systems can ...

#ContextEngineering #AIEducation #HumanMachineUnderstanding #AIContext #NaturalLanguageProcessing #AIModels
✨EmoVid: A Multimodal Emotion Video Dataset for Emotion-Centric Video Understanding and Generation

πŸ“ Summary:
EmoVid is a new multimodal, emotion-annotated video dataset designed for creative media like cartoons and movies. It bridges emotion understanding with video generation, significantly improving emotional expression and quality in generated videos. EmoVid establishes a new benchmark for affective ...

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.11002
β€’ PDF: https://arxiv.org/pdf/2511.11002

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#EmoVid #MultimodalAI #EmotionAI #VideoGeneration #VideoUnderstanding
✨Virtual Width Networks

πŸ“ Summary:
Virtual Width Networks VWN enhance model efficiency by expanding representational width without increasing computational cost. VWN accelerates optimization and improves loss reduction, showing a log-linear scaling relation between virtual width and loss.

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.11238
β€’ PDF: https://arxiv.org/pdf/2511.11238

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#NeuralNetworks #DeepLearning #ModelEfficiency #MachineLearning #AI
This media is not supported in your browser
VIEW IN TELEGRAM
✨GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models

πŸ“ Summary:
GGBench is a new benchmark for evaluating geometric generative reasoning in unified multimodal models. It addresses a critical gap by assessing integrated cognitive processes, requiring language comprehension and precise visual generation to actively construct solutions. This sets a rigorous stan...

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.11134
β€’ PDF: https://arxiv.org/pdf/2511.11134

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#GGBench #MultimodalAI #GeometricReasoning #GenerativeAI #AIResearch
✨DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains

πŸ“ Summary:
A new benchmark, DiscoX, and evaluation system, Metric-S, are introduced for discourse-level, expert Chinese-English translation. Findings show advanced LLMs still fall short of human performance, underscoring challenges in professional machine translation.

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.10984
β€’ PDF: https://arxiv.org/pdf/2511.10984

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#MachineTranslation #NLP #LLM #Benchmarking #AI
✨CATS-V2V: A Real-World Vehicle-to-Vehicle Cooperative Perception Dataset with Complex Adverse Traffic Scenarios

πŸ“ Summary:
CATS-V2V is a new real-world dataset for V2V cooperative perception, focusing on complex adverse traffic scenarios. It provides extensive synchronized sensor data, including LiDAR and cameras, from two vehicles across diverse conditions. This dataset supports autonomous driving research.

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.11168
β€’ PDF: https://arxiv.org/pdf/2511.11168

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#V2V #AutonomousDriving #CooperativePerception #Dataset #ADAS
✨UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation

πŸ“ Summary:
UI2Code^N is a visual language model trained for interactive UI-to-code generation, editing, and polishing. It uses multi-turn feedback to achieve state-of-the-art performance among open-source models, comparable to leading closed-source solutions.

πŸ”Ή Publication Date: Published on Nov 11

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.08195
β€’ PDF: https://arxiv.org/pdf/2511.08195
β€’ Project Page: https://zheny2751-dotcom.github.io/ui2code-n.github.io/
β€’ Github: https://zheny2751-dotcom.github.io/ui2code-n.github.io/

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/zai-org/UI2Code_N

✨ Spaces citing this paper:
β€’ https://huggingface.co/spaces/zai-org/UI2Code_N-demo-case

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#UI2Code #VisualLanguageModels #CodeGeneration #AI #SoftwareEngineering
✨MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism

πŸ“ Summary:
MarsRL enhances multi-agent reasoning systems by jointly optimizing all agents through reinforcement learning and agentic pipeline parallelism. This novel approach significantly boosts open-source LLM accuracy on complex tasks, even outperforming larger models on benchmarks like AIME2025.

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.11373
β€’ PDF: https://arxiv.org/pdf/2511.11373
β€’ Github: https://github.com/liushulinle/MarsRL

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/forestliutc/MarsRL

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#ReinforcementLearning #MultiAgentSystems #LLM #AIResearch #MachineLearning
✨AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery

πŸ“ Summary:
AIonopedia is an LLM agent that orchestrates multimodal learning for Ionic Liquid discovery. It enables accurate property predictions and molecular design through hierarchical search, validated by real-world wet-lab experiments, significantly accelerating IL discovery.

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.11257
β€’ PDF: https://arxiv.org/pdf/2511.11257

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#LLMAgents #IonicLiquids #MultimodalLearning #MaterialsScience #AIforScience
❀1
✨SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards

πŸ“ Summary:
SpatialThinker is a new 3D-aware MLLM that uses RL and dense spatial rewards to significantly improve spatial understanding. It integrates structured spatial grounding and multi-step reasoning, outperforming existing models and GPT-4o on spatial VQA and real-world benchmarks.

πŸ”Ή Publication Date: Published on Nov 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.07403
β€’ PDF: https://arxiv.org/pdf/2511.07403
β€’ Github: https://github.com/hunarbatra/SpatialThinker

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/OX-PIXL/SpatialThinker-3B
β€’ https://huggingface.co/OX-PIXL/SpatialThinker-7B

✨ Datasets citing this paper:
β€’ https://huggingface.co/datasets/OX-PIXL/STVQA-7K

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#MultimodalLLM #3DReasoning #ReinforcementLearning #AIResearch #ComputerVision
✨DoPE: Denoising Rotary Position Embedding

πŸ“ Summary:
DoPE improves Transformer length generalization by detecting and mitigating noisy frequency bands in positional embeddings. This training-free method enhances retrieval accuracy and reasoning stability across extended contexts up to 64K tokens.

πŸ”Ή Publication Date: Published on Nov 12

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.09146
β€’ PDF: https://arxiv.org/pdf/2511.09146
β€’ Project Page: https://The-physical-picture-of-LLMs.github.io

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#Transformers #PositionalEmbedding #LLMs #DeepLearning #AIResearch
✨LiteAttention: A Temporal Sparse Attention for Diffusion Transformers

πŸ“ Summary:
LiteAttention accelerates video generation by exploiting temporal coherence in diffusion attention. It propagates skip decisions for non-essential attention tiles across denoising steps, eliminating redundant computations. This achieves substantial speedups without quality loss.

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.11062
β€’ PDF: https://arxiv.org/pdf/2511.11062
β€’ Github: https://github.com/moonmath-ai/LiteAttention

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#DiffusionModels #VideoGeneration #Transformers #SparseAttention #ComputationalEfficiency
✨Large Language Models for Scientific Idea Generation: A Creativity-Centered Survey

πŸ“ Summary:
This survey examines methods for using large language models to generate scientific ideas, categorizing them into five families and aligning them with creativity frameworks to improve scientific sound...

πŸ”Ή Publication Date: Published on Nov 5

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.07448
β€’ PDF: https://arxiv.org/pdf/2511.07448

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Qwen3 Technical Report

πŸ“ Summary:
Qwen3 is a new series of large language models integrating thinking and non-thinking modes for unified performance and efficiency. It achieves state-of-the-art results across diverse tasks and expands multilingual support to 119 languages.

πŸ”Ή Publication Date: Published on May 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxivexplained.com/papers/qwen3-technical-report
β€’ PDF: https://arxiv.org/pdf/2505.09388
β€’ Project Page: https://qwenlm.github.io/blog/qwen3/
β€’ Github: https://github.com/QwenLM/Qwen3

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct
β€’ https://huggingface.co/Qwen/Qwen3-235B-A22B
β€’ https://huggingface.co/Qwen/Qwen3-Next-80B-A3B-Instruct

✨ Spaces citing this paper:
β€’ https://huggingface.co/spaces/modelscope/DocResearch
β€’ https://huggingface.co/spaces/enzostvs/deepsite
β€’ https://huggingface.co/spaces/multimodalart/Eigen-Banana

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#LLM #AI #MultilingualAI #NLP #Qwen3
✨WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

πŸ“ Summary:
WEAVE introduces a suite with a large dataset and benchmark to assess multi-turn context-dependent image generation and editing in multimodal models. It enables new capabilities like visual memory in models while exposing current limitations in these complex tasks.

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.11434
β€’ PDF: https://arxiv.org/pdf/2511.11434
β€’ Project Page: https://weichow23.github.io/weave/
β€’ Github: https://github.com/weichow23/weave

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#MultimodalAI #ImageGeneration #GenerativeAI #ComputerVision #AIResearch
✨HI-TransPA: Hearing Impairments Translation Personal Assistant

πŸ“ Summary:
HI-TransPA, an instruction-driven audio-visual personal assistant, uses Omni-Model paradigm to translate and dialogue by fusing speech with lip dynamics, achieving state-of-the-art performance in assi...

πŸ”Ή Publication Date: Published on Nov 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.09915
β€’ PDF: https://arxiv.org/pdf/2511.09915

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Simulating the Visual World with Artificial Intelligence: A Roadmap

πŸ“ Summary:
Video generation is evolving towards foundation models that integrate world simulation and rendering to produce physically plausible and interactive videos. AI-generated summary The landscape of video...

πŸ”Ή Publication Date: Published on Nov 11

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.08585
β€’ PDF: https://arxiv.org/pdf/2511.08585
β€’ Github: https://github.com/ziqihuangg/Awesome-From-Video-Generation-to-World-Model

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Workload Schedulers -- Genesis, Algorithms and Differences

πŸ“ Summary:
This paper categorizes modern workload schedulers into three classes: OS, Cluster, and Big Data. It details their evolution, algorithms, and differences. The conclusion highlights similarities in scheduling strategy design across both local and distributed systems.

πŸ”Ή Publication Date: Published on Nov 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.10258
β€’ PDF: https://arxiv.org/pdf/2511.10258

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#WorkloadScheduling #OperatingSystems #DistributedComputing #SchedulingAlgorithms #ComputerScience
✨MediaPipe: A Framework for Building Perception Pipelines

πŸ“ Summary:
MediaPipe is a framework for building perception applications, offering tools to combine components, prototype, and measure performance across platforms. It helps developers iteratively improve AI models with reproducible results.

πŸ”Ή Publication Date: Published on Jun 14, 2019

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/1906.08172
β€’ PDF: https://arxiv.org/pdf/1906.08172
β€’ Github: https://github.com/google-ai-edge/mediapipe

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Building the Web for Agents: A Declarative Framework for Agent-Web Interaction

πŸ“ Summary:
VOIX is a web framework using declarative HTML tags like tool and context for websites to explicitly define AI agent capabilities. This enables reliable, privacy-preserving, and secure agent interaction with human-oriented interfaces, fostering the Agentic Web.

πŸ”Ή Publication Date: Published on Nov 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2511.11287
β€’ PDF: https://arxiv.org/pdf/2511.11287

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#AgenticWeb #AIAgents #WebFramework #DeclarativeAI #FutureofWeb