ML Research Hub
32.8K subscribers
4.36K photos
267 videos
23 files
4.71K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
πŸ€–πŸ§  Sora: OpenAI’s Breakthrough Text-to-Video Model Transforming Visual Creativity

πŸ—“οΈ 18 Oct 2025
πŸ“š AI News & Trends

Introduction Artificial Intelligence (AI) is rapidly transforming the creative world. From generating realistic images to composing music and writing code, AI has redefined how humans interact with technology. But one of the most revolutionary advancements in this domain is Sora, OpenAI’s text-to-video generative model that converts written prompts into hyper-realistic video clips. Ithas captured global ...

#Sora #OpenAI #TextToVideo #AI #VisualCreativity #GenerativeModel
❀3❀‍πŸ”₯1
✨LongCat-Video Technical Report

πŸ“ Summary:
LongCat-Video is a 13.6B Diffusion Transformer model excelling in efficient, high-quality long video generation. It uses a unified architecture for tasks like Text-to-Video and coarse-to-fine generation for efficiency. This model is a significant step toward developing world models.

πŸ”Ή Publication Date: Published on Oct 25

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.22200
β€’ PDF: https://arxiv.org/pdf/2510.22200
β€’ Github: https://github.com/meituan-longcat/LongCat-Video

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/meituan-longcat/LongCat-Video

✨ Spaces citing this paper:
β€’ https://huggingface.co/spaces/multimodalart/LongCat-Video
β€’ https://huggingface.co/spaces/rahul7star/LongCat-Video
β€’ https://huggingface.co/spaces/armaishere/meituan-longcat-LongCat-Video

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#VideoGeneration #DiffusionModels #Transformers #AI #TextToVideo
✨EasyV2V: A High-quality Instruction-based Video Editing Framework

πŸ“ Summary:
EasyV2V is a framework for instruction-based video editing that combines diverse data sources, leverages pretrained text-to-video models with LoRA fine-tuning, and uses unified spatiotemporal control. This innovative approach achieves state-of-the-art results in video editing.

πŸ”Ή Publication Date: Published on Dec 18

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.16920
β€’ PDF: https://arxiv.org/pdf/2512.16920
β€’ Github: https://snap-research.github.io/easyv2v/

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#VideoEditing #AI #DeepLearning #ComputerVision #TextToVideo
❀2