ML Research Hub
32.9K subscribers
4.6K photos
284 videos
24 files
4.98K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
πŸ”Ή Title: Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends

πŸ”Ή Publication Date: Published on Aug 15

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.11548
β€’ PDF: https://arxiv.org/pdf/2508.11548
β€’ Github: https://xuzhenhua55.github.io/awesome-llm-copyright-protection/index.html

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: CorrSteer: Steering Improves Task Performance and Safety in LLMs through Correlation-based Sparse Autoencoder Feature Selection

πŸ”Ή Publication Date: Published on Aug 18

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.12535
β€’ PDF: https://arxiv.org/pdf/2508.12535

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research

πŸ”Ή Publication Date: Published on Aug 6

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.04326
β€’ PDF: https://arxiv.org/pdf/2508.04326
β€’ Project Page: https://mediated-reality.github.io/rf4xr/papers/li_tvcg25/
β€’ Github: https://github.com/mediated-reality/awesome-rf4xr

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents

πŸ”Ή Publication Date: Published on Aug 6

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.04038
β€’ PDF: https://arxiv.org/pdf/2508.04038
β€’ Github: https://github.com/zechenli03/ZARA

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Evaluating Podcast Recommendations with Profile-Aware LLM-as-a-Judge

πŸ”Ή Publication Date: Published on Aug 12

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.08777
β€’ PDF: https://arxiv.org/pdf/2508.08777

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation

πŸ”Ή Publication Date: Published on Aug 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.11032
β€’ PDF: https://arxiv.org/pdf/2508.11032
β€’ Github: https://github.com/podismine/MedSAMix

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Semantic IDs for Joint Generative Search and Recommendation

πŸ”Ή Publication Date: Published on Aug 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.10478
β€’ PDF: https://arxiv.org/pdf/2508.10478

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations

πŸ”Ή Publication Date: Published on Aug 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.09789
β€’ PDF: https://arxiv.org/pdf/2508.09789

πŸ”Ή Datasets citing this paper:
β€’ https://huggingface.co/datasets/marcodena/video-recs-describe-what-you-see

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation

πŸ”Ή Publication Date: Published on Aug 19

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.13998
β€’ PDF: https://arxiv.org/pdf/2508.13998

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Beyond Human Judgment: A Bayesian Evaluation of LLMs' Moral Values Understanding

πŸ”Ή Publication Date: Published on Aug 19

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.13804
β€’ PDF: https://arxiv.org/pdf/2508.13804
β€’ Project Page: https://maciejskorski.github.io/moral-foundations-llm-eval
β€’ Github: https://github.com/maciejskorski/moral-foundations-llm-eval

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Mind the Generation Process: Fine-Grained Confidence Estimation During LLM Generation

πŸ”Ή Publication Date: Published on Aug 16

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.12040
β€’ PDF: https://arxiv.org/pdf/2508.12040

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: A Stitch in Time Saves Nine: Proactive Self-Refinement for Language Models

πŸ”Ή Publication Date: Published on Aug 18

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.12903
β€’ PDF: https://arxiv.org/pdf/2508.12903

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

πŸ”Ή Publication Date: Published on Aug 14

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.13186
β€’ PDF: https://arxiv.org/pdf/2508.13186
β€’ Github: https://github.com/MMBrowseComp/MM-BrowseComp

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
❀1
πŸ”Ή Title: CAMAR: Continuous Actions Multi-Agent Routing

πŸ”Ή Publication Date: Published on Aug 18

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.12845
β€’ PDF: https://arxiv.org/pdf/2508.12845

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward

πŸ”Ή Publication Date: Published on Aug 18

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.12800
β€’ PDF: https://arxiv.org/pdf/2508.12800

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
❀1
πŸ”₯ $10.000 WITH LISA!

Lisa earned $200,000 in a month, and now it’s YOUR TURN!

She’s made trading SO SIMPLE that anyone can do it.

❗️Just copy her signals every day
❗️Follow her trades step by step
❗️Earn $1,000+ in your first week – GUARANTEED!

🚨 BONUS: Lisa is giving away $10,000 to her subscribers!

Don’t miss this once-in-a-lifetime opportunity. Free access for the first 500 people only!

πŸ‘‰ CLICK HERE TO JOIN NOW πŸ‘ˆ
πŸ”Ή Title: Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

πŸ”Ή Publication Date: Published on Aug 1

πŸ”Ή Abstract: Foundation-Sec-8B-Instruct is a cybersecurity-focused LLM designed for chat-style interactions and instruction-following, outperforming other models in cybersecurity tasks while matching their instruction-following capabilities. AI-generated summary Large language models ( LLMs ) have shown remarkable success across many domains, yet their integration into cybersecurity applications remains limited due to a lack of general-purpose cybersecurity data, representational complexity, and safety and regulatory concerns. To address this gap, we previously introduced Foundation-Sec-8B , a cybersecurity -focused LLM suitable for fine-tuning on downstream tasks. That model, however, was not designed for chat-style interactions or instruction-following . In this report, we release Foundation-Sec-8B -Instruct: a model specifically trained for general-purpose cybersecurity dialogue . Built on Foundation-Sec-8B , it combines domain-specific knowledge with instruction-following , conversational capabilities , and alignment with human preferences to produce high-quality, relevant responses. Comprehensive evaluations show that Foundation-Sec-8B -Instruct outperforms Llama 3.1-8B-Instruct on a range of cybersecurity tasks while matching its instruction-following performance. It is also competitive with GPT-4o-mini on cyber threat intelligence and instruction-following tasks. We envision Foundation-Sec-8B -Instruct becoming an indispensable assistant in the daily workflows of cybersecurity professionals. We release the model publicly at https://huggingface.co/fdtn-ai/ Foundation-Sec-8B -Instruct.

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.01059

β€’ PDF: https://arxiv.org/pdf/2508.01059

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Rapidly Adapting to New Voice Spoofing: Few-Shot Detection of Synthesized Speech Under Distribution Shifts

πŸ”Ή Publication Date: Published on Aug 18

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.13320
β€’ PDF: https://arxiv.org/pdf/2508.13320

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Retrieval-augmented reasoning with lean language models

πŸ”Ή Publication Date: Published on Aug 15

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.11386
β€’ PDF: https://arxiv.org/pdf/2508.11386

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: StrandDesigner: Towards Practical Strand Generation with Sketch Guidance

πŸ”Ή Publication Date: Published on Aug 3

πŸ”Ή Abstract: A sketch-based strand generation model using a learnable upsampling strategy and multi-scale adaptive conditioning mechanism outperforms existing methods in realism and precision for hair strand generation. AI-generated summary Realistic hair strand generation is crucial for applications like computer graphics and virtual reality. While diffusion models can generate hairstyles from text or images, these inputs lack precision and user-friendliness. Instead, we propose the first sketch-based strand generation model, which offers finer control while remaining user-friendly. Our framework tackles key challenges, such as modeling complex strand interactions and diverse sketch patterns, through two main innovations: a learnable strand upsampling strategy that encodes 3D strands into multi-scale latent spaces , and a multi-scale adaptive conditioning mechanism using a transformer with diffusion heads to ensure consistency across granularity levels. Experiments on several benchmark datasets show our method outperforms existing approaches in realism and precision. Qualitative results further confirm its effectiveness. Code will be released at [GitHub](https://github.com/fighting-Zhang/StrandDesigner).

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.01650

β€’ PDF: https://arxiv.org/pdf/2508.01650

β€’ Github: https://github.com/fighting-Zhang/StrandDesigner

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

πŸ”Ή Publication Date: Published on Aug 16

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.11987
β€’ PDF: https://arxiv.org/pdf/2508.11987
β€’ Project Page: https://futurex-ai.github.io/

πŸ”Ή Datasets citing this paper:
β€’ https://huggingface.co/datasets/futurex-ai/Futurex-Online
β€’ https://huggingface.co/datasets/futurex-ai/Futurex-Past

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT