Deep Learning

Relatively new python tools:
https://docs.astral.sh/uv/ An extremely fast Python package and project manager, written in Rust
https://docs.marimo.io/ marimo is a reactive Python notebook
#Tools

docs.astral.sh

uv is an extremely fast Python package and project manager, written in Rust.

🔥1

38 viewsVadim, 04:32

Deep Learning

https://github.com/parthsarthi03/raptor #Frameworks

GitHub

GitHub - parthsarthi03/raptor: The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval - parthsarthi03/raptor

38 viewsARТ̈EM Zaborskiy, edited 08:12

Deep Learning

https://github.com/parthsarthi03/raptor #Frameworks

https://github.com/illuin-tech/colpali
Efficient Document Retrieval with Vision Language Models #Frameworks #Models

GitHub

GitHub - illuin-tech/colpali: The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol. - illuin-tech/colpali

40 viewsVadim, 14:56

Deep Learning

https://arxiv.org/abs/2506.01928 #Paper
Esoteric Language Models ;)
a new family of models that fuses autoregressive and Masked Diffusion Models paradigms

arXiv.org

Esoteric Language Models

Diffusion-based language models offer a compelling alternative to autoregressive (AR) models by enabling parallel and controllable generation. Among this family of models, Masked Diffusion Models...

38 viewsVadim, edited 03:09

Deep Learning

https://arxiv.org/abs/2503.19108
The plane ViT architecture without a decoder to perform fast image segmentation #Paper #Frameworks

arXiv.org

Your ViT is Secretly an Image Segmentation Model

Vision Transformers (ViTs) have shown remarkable performance and scalability across various computer vision tasks. To apply single-scale ViTs to image segmentation, existing methods adopt a...

47 viewsVadim, 21:10

Deep Learning

https://arxiv.org/abs/2412.16334 open-vocabulary segmentation! #Paper #Frameworks

arXiv.org

DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level...

Self-supervised visual foundation models produce powerful embeddings that achieve remarkable performance on a wide range of downstream tasks. However, unlike vision-language models such as CLIP,...

37 viewsVadim, 08:38

Deep Learning

https://arxiv.org/abs/2411.04983
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning #Frameworks #Paper

arXiv.org

DINO-WM: World Models on Pre-trained Visual Features enable...

The ability to predict future outcomes given control actions is fundamental for physical reasoning. However, such predictive models, often called world models, remains challenging to learn and are...

40 viewsVadim, 20:17

Deep Learning

Achieving 10,000x training data reduction with high-fidelity labels https://share.google/PXeW6ut6dkPw4M0zw

#paper

research.google

Achieving 10,000x training data reduction with high-fidelity labels

41 viewsVadim, 10:06

Deep Learning

https://arxiv.org/abs/2508.10104 Dino v3 is out! #Paper #models

arXiv.org

DINOv3

Self-supervised learning holds the promise of eliminating the need for manual data annotation, enabling models to scale effortlessly to massive datasets and larger architectures. By not being...

41 viewsVadim, 11:48

Deep Learning

https://xl0.github.io/lovely-tensors/ Lovely Tensors is just working with tensors.
https://github.com/google/mediapy And this library just makes it easy to display images and videos in jupyter notebooks.

#library

lovely-tensors

❤️ Lovely Tensors – lovely-tensors

After all, you are only human.

38 viewsVadim, 03:55

Deep Learning

[2509.13351] Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning https://share.google/Po2YXN8rOVrhNXMvz

arXiv.org

Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning...

Large language models (LLMs) have demonstrated impressive capabilities across diverse tasks, yet their ability to perform structured symbolic planning remains limited, particularly in domains...

23 viewsVadim, 09:29

Deep Learning

https://moondream.ai/blog/moondream-3-preview A small vision language model (VLM) designed for use in extreme cases or on devices. #Models

Moondream

A fast & powerful vision model that rocks.

18 viewsVadim, edited 02:06

Deep Learning

https://www.perceptron.inc/blog/introducing-isaac-0-1 Another vision language model(VLM) with similar properties #Models

marketing.perceptron.inc

A layer of intelligence for the physical world.
We are a research company building the future of Physical AGI.

19 viewsVadim, 02:11

Deep Learning

https://arxiv.org/abs/2510.05949v1 JEPA architectures such as DINOv3 can be effectively used for data curation, outlier detection and similar tasks. #Paper

arXiv.org

Gaussian Embeddings: How JEPAs Secretly Learn Your Data Density

Joint Embedding Predictive Architectures (JEPAs) learn representations able to solve numerous downstream tasks out-of-the-box. JEPAs combine two objectives: (i) a latent-space prediction term,...

23 viewsVadim, 19:53

Deep Learning

https://github.com/microsoft/markitdown
Converts all major document formats to markdown and can work as an MCP server

GitHub

GitHub - microsoft/markitdown: Python tool for converting files and office documents to Markdown.

Python tool for converting files and office documents to Markdown. - microsoft/markitdown

9 viewsVadim, 19:40

About

Blog

Apps

Platform