ML Research Hub

🤖🧠 Skyvern: The Future of Browser Automation Powered by AI and Computer Vision

🗓️ 16 Nov 2025
📚 AI News & Trends

In today’s fast-evolving digital landscape, automation plays a crucial role in enhancing productivity, efficiency and innovation. Yet, traditional browser automation tools often struggle with complexity, maintenance and reliability. They rely heavily on DOM parsing, XPaths and rigid scripts that easily break when websites change their layout. Enter Skyvern, an open-source, AI-driven browser automation platform developed ...

#Skyvern #BrowserAutomation #AIDriven #ComputerVision #OpenSource #WebAutomation

❤‍🔥1❤1👍1

541 views13:44

📖 Read More

📣 BEST TELEGRAM CHANNELS

ML Research Hub

✨P1: Mastering Physics Olympiads with Reinforcement Learning

📝 Summary:
P1 is a family of open-source physics reasoning models trained via reinforcement learning. P1-235B-A22B achieved Gold-medal performance at IPhO 2025 and won 12 other competitions. These models also show strong generalizability on other reasoning tasks.

🔹 Publication Date: Published on Nov 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.13612
• PDF: https://arxiv.org/pdf/2511.13612
• Project Page: https://prime-rl.github.io/P1/
• Github: https://github.com/PRIME-RL/P1

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#ReinforcementLearning #Physics #AI #MachineLearning #OpenSource

261 views06:06

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Instella: Fully Open Language Models with Stellar Performance

📝 Summary:
Instella is a family of fully open language models trained on open data. It achieves state-of-the-art among fully open models and competes with leading open-weight LLMs. Specialized variants for long context and math reasoning are also offered.

🔹 Publication Date: Published on Nov 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.10628
• PDF: https://arxiv.org/pdf/2511.10628
• Github: https://github.com/AMD-AGI/Instella

🔹 Models citing this paper:
• https://huggingface.co/amd/AMD-OLMo
• https://huggingface.co/amd/Instella-3B-Instruct
• https://huggingface.co/amd/Instella-3B

✨ Datasets citing this paper:
• https://huggingface.co/datasets/amd/Instella-Long
• https://huggingface.co/datasets/amd/Instella-GSM8K-synthetic

✨ Spaces citing this paper:
• https://huggingface.co/spaces/DexterSptizu/AMD-OLMo-1B
• https://huggingface.co/spaces/universeofml/DeepFocusTrain

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#LLMs #OpenSource #AI #MachineLearning #NLP

arXiv.org

Instella: Fully Open Language Models with Stellar Performance

Large language models (LLMs) have demonstrated remarkable performance across a wide range of tasks, yet the majority of high-performing models remain closed-source or partially open, limiting...

❤1

372 views11:08

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨OpenUS: A Fully Open-Source Foundation Model for Ultrasound Image Analysis via Self-Adaptive Masked Contrastive Learning

📝 Summary:
OpenUS is an open-source ultrasound foundation model built on a large public dataset. It uses a vision Mamba backbone and a novel self-adaptive masking framework to enhance pre-training, enabling label-efficient fine-tuning for various US tasks.

🔹 Publication Date: Published on Nov 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.11510
• PDF: https://arxiv.org/pdf/2511.11510
• Github: https://github.com/XZheng0427/OpenUS

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#OpenSource #FoundationModel #UltrasoundAI #MachineLearning #MedicalImaging

❤1

231 views22:10

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨HunyuanVideo 1.5 Technical Report

📝 Summary:
HunyuanVideo 1.5 is a lightweight, open-source video generation model achieving state-of-the-art visual quality and motion coherence. It employs an advanced DiT architecture with SSTA and an efficient video super-resolution network, enabling high-quality video creation on consumer GPUs.

🔹 Publication Date: Published on Nov 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.18870
• PDF: https://arxiv.org/pdf/2511.18870
• Github: https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#VideoGeneration #AI #DeepLearning #OpenSource #DiffusionModels

191 views05:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

📝 Summary:
GigaEvo is an open-source framework for LLM-guided evolutionary computation, providing modular tools for complex optimization. It enhances reproducibility of AlphaEvolve-inspired methods with detailed implementations, validated on challenging problems like Heilbronn triangle placement.

🔹 Publication Date: Published on Nov 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.17592
• PDF: https://arxiv.org/pdf/2511.17592
• Project Page: https://airi-institute.github.io/gigaevo-cover/
• Github: https://github.com/FusionBrainLab/gigaevo-core

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#LLM #EvolutionaryAlgorithms #Optimization #OpenSource #AI

264 views09:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications

📝 Summary:
SWE-SQL introduces BIRD-CRITIC, a new benchmark for SQL issue debugging, and Six-Gym, a training environment using f-Plan Boosting. Their open-source Bird-Fixer agent surpasses proprietary LLMs like GPT-4.1 in performance, democratizing advanced SQL-debugging capabilities.

🔹 Publication Date: Published on Jun 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.18951
• PDF: https://arxiv.org/pdf/2506.18951
• Project Page: https://bird-critic.github.io
• Github: https://github.com/bird-bench/BIRD-CRITIC-1

✨ Datasets citing this paper:
• https://huggingface.co/datasets/birdsql/bird-critic-1.0-flash-exp
• https://huggingface.co/datasets/birdsql/bird-critic-1.0-open
• https://huggingface.co/datasets/birdsql/bird-critic-1.0-postgresql

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#SQL #LLM #AI #Debugging #OpenSource

❤1

421 views09:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories

📝 Summary:
SWE-Bench++ is an automated framework generating scalable, multilingual, repository-level coding tasks from live GitHub pull requests. It overcomes manual curation limits and static datasets, offering a benchmark to evaluate and improve code generation models across 11 languages.

🔹 Publication Date: Published on Dec 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.17419
• PDF: https://arxiv.org/pdf/2512.17419
• Project Page: https://research.turing.com/swebench
• Github: https://huggingface.co/papers?q=GitHub%20pull%20requests

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#SoftwareEngineering #CodeGeneration #AIBenchmarking #MachineLearning #OpenSource

❤1

207 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Simulstream: Open-Source Toolkit for Evaluation and Demonstration of Streaming Speech-to-Text Translation Systems

📝 Summary:
Simulstream is an open-source toolkit for evaluating and demonstrating streaming speech-to-text translation. It supports long-form audio, incremental decoding, and re-translation, plus offers an interactive demo interface.

🔹 Publication Date: Published on Dec 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.17648
• PDF: https://arxiv.org/pdf/2512.17648
• Project Page: https://pypi.org/project/simulstream/

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#SpeechToText #MachineTranslation #NLP #OpenSource #StreamingAI

❤1

374 views11:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨NVIDIA Nemotron 3: Efficient and Open Intelligence

📝 Summary:
NVIDIA introduces Nemotron 3, a family of models with strong agentic, reasoning, and conversational capabilities. They feature a hybrid Mamba-Transformer MoE architecture for high throughput and long context, plus advanced post-training for tool use. The models will be openly released.

🔹 Publication Date: Published on Dec 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.20856
• PDF: https://arxiv.org/pdf/2512.20856

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#AI #LLM #DeepLearning #NVIDIA #OpenSource

188 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

📝 Summary:
SciEvalKit is an open-source toolkit for evaluating AI models in science. It assesses scientific intelligence across diverse domains and competencies using expert-grade benchmarks and a flexible pipeline. This provides a standardized platform for scientific AI evaluation.

🔹 Publication Date: Published on Dec 26, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.22334
• PDF: https://arxiv.org/pdf/2512.22334

==================================

For more data science resources:
✓ https://t.me/DataScienceT

#AIevaluation #ScientificAI #OpenSource #AIBenchmarks #AIResearch

226 views08:03

✨ Explore Data Science 📝 Write your paper

About

Blog

Apps

Platform