Deep Learning – Telegram

Deep Learning

@deep_learning_prog

37 subscribers

5 photos

136 links

Deep Learning: programming, tools & resources.
#DeepLearning #Python

Download Telegram

About

Blog

Apps

Platform

SOTA in unsupervised semantic segmentation:
1. STEGO: Unsupervised Semantic Segmentation by Distilling Feature Correspondences - 2022 https://arxiv.org/abs/2203.08414
2. HP: Leveraging Hidden Positives for Unsupervised Semantic Segmentation -2023 https://arxiv.org/abs/2303.15014
3. CAUSE: Causal Unsupervised Semantic Segmentation - 2023 https://arxiv.org/abs/2310.07379
#Paper

Unsupervised Semantic Segmentation by Distilling Feature Correspondences

Unsupervised semantic segmentation aims to discover and localize semantically meaningful categories within image corpora without any form of annotation. To solve this task, algorithms must produce...

🔥1

85 viewsVadim, 19:57

https://arxiv.org/pdf/2408.04840v1
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
#Paper

67 viewsVadim, 04:48

https://encord.com/blog/dimentionality-reduction-techniques-machine-learning/
Dimensionality reduction techniques in one place #FYI #Tips

Top 12 Dimensionality Reduction Techniques for Machine Learning

Dimensionality reduction is a fundamental technique in machine learning (ML) that simplifies datasets by reducing the number of input variables

🔥1

77 viewsVadim, 04:50

NVidia: Traditional machine learning on GPU: various clustering, UMAP, TSNE, PCA, etc. #FYI #library
https://github.com/rapidsai/cuml
https://docs.rapids.ai/api/cuml/stable/

GitHub - rapidsai/cuml: cuML - RAPIDS Machine Learning Library

cuML - RAPIDS Machine Learning Library. Contribute to rapidsai/cuml development by creating an account on GitHub.

84 viewsVadim, 17:23

https://arxiv.org/abs/2410.05258
Differential Transformer #Paper

Differential Transformer

Transformer tends to overallocate attention to irrelevant context. In this work, we introduce Diff Transformer, which amplifies attention to the relevant context while canceling noise....

67 viewsVadim, 14:33

Numba is an open source JIT compiler that translates a subset of Python and NumPy code into fast machine code.
https://numba.pydata.org/ #Frameworks #library

❤1👏1

64 viewsVadim, 04:14

Dino/Dino v2 explained: Self-distillation with no labels & etc. #FYI #Tips #Explained #Tutorial
1. https://medium.com/@anuj.dutt9/emerging-properties-in-self-supervised-vision-transformers-dino-paper-summary-4c7a6ed68161 Original Dino
2. https://encord.com/blog/dinov2-self-supervised-learning-explained/
3. https://www.picsellia.com/post/dinov2-steps-by-steps-explanations-picsellia
4. https://www.ai-bites.net/dino-v2-learning-robust-visual-features-without-supervision-model-explained/
5. https://blog.marvik.ai/2023/05/16/dinov2-exploring-self-supervised-vision-transformers/

Original papers:
1. https://arxiv.org/abs/2104.14294 Emerging Properties in Self-Supervised Vision Transformers (Dino)
2. https://arxiv.org/abs/2304.07193 DINOv2: Learning Robust Visual Features without Supervision
3. https://arxiv.org/abs/2309.16588 Vision Transformers Need Registers

Emerging Properties in Self-Supervised Vision Transformers (DINO) — Paper Summary

Hi Everyone! Today, we’ll unravel the complexities of an intriguing approach in the realm of self-supervised learning, delving into a groundbreaking paper titled “Emerging Properties in…

71 viewsVadim, edited 03:41

https://arxiv.org/html/2405.18886v1 Compressing Large Language Models using Low Rank and Low Precision Decomposition #paper

46 viewsVadim, 20:04

https://github.com/staghado/vit.cpp Inference Vision Transformer (ViT) in plain C/C++ with ggml
https://github.com/ggerganov/ggml Tensor library for machine learning with Low-level cross-platform implementation
#Frameworks

GitHub - staghado/vit.cpp: Inference Vision Transformer (ViT) in plain C/C++ with ggml

Inference Vision Transformer (ViT) in plain C/C++ with ggml - staghado/vit.cpp

53 viewsVadim, edited 15:07

https://arxiv.org/abs/2412.11768
https://github.com/AnonymousAlethiometer/SGD_SaI/
#Paper #Frameworks

No More Adam: Learning Rate Scaling at Initialization is All You Need

In this work, we question the necessity of adaptive gradient methods for training deep neural networks. SGD-SaI is a simple yet effective enhancement to stochastic gradient descent with momentum...

59 viewsVadim, edited 22:47

Dino/Dino v2 explained: Self-distillation with no labels & etc. #FYI #Tips #Explained #Tutorial 1. https://medium.com/@anuj.dutt9/emerging-properties-in-self-supervised-vision-transformers-dino-paper-summary-4c7a6ed68161 Original Dino 2. https://encord.com/blog/dinov2…

https://www.samarkhanna.com/ExPLoRA/ Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts
#Paper #Framework

ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts

64 viewsVadim, 01:56

https://medium.com/version-1/the-rise-of-large-action-models-lams-how-ai-can-understand-and-execute-human-intentions-f59c8e78bc09
Large Action Models

The Rise of Large Action Models, LAMs: How AI Can Understand and Execute Human Intentions?

A hot topic and development in the realm artificial intelligence (AI) is Large Action Models, also referred as Large Agentic Models or LAMs…

80 viewsVadim, edited 02:21

https://arxiv.org/pdf/2411.07975
JanusFlow: Harmonizing Autoregression and Rectified Flow
for Unified Multimodal Understanding and Generation
#Paper

Finally multimodality on input and output!

61 viewsVadim, 03:32

https://github.com/ollama/ollama #Framework #LLMserving

GitHub - ollama/ollama: Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models. - ollama/ollama

59 viewsVadim, edited 20:37

https://github.com/exo-explore/exo #LLMserving #Frameworks

GitHub - exo-explore/exo: Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚ - exo-explore/exo

48 viewsVadim, edited 11:58

https://huggingface.co/spaces/opencompass/open_vlm_leaderboard
#Models Leaderboard

Open VLM Leaderboard - a Hugging Face Space by opencompass

Browse and filter Open VLM leaderboard data by model name, size, and type. View detailed evaluation results across various datasets.

46 viewsVadim, edited 10:30

https://github.com/matatonic/openedai-vision #Frameworks #LLMserving Another LLM server

GitHub - matatonic/openedai-vision: An OpenAI API compatible API for chat with image input and questions about the images. aka…

An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal. - matatonic/openedai-vision

56 viewsVadim, edited 10:30

https://docs.vllm.ai/en/latest/
#Frameworks #LLMserving

53 viewsVadim, edited 10:44

https://arxiv.org/abs/2502.07577 #Paper

Automated Capability Discovery via Foundation Model Self-Exploration

Foundation models have become general-purpose assistants, exhibiting diverse capabilities across numerous domains through training on web-scale data. It remains challenging to precisely...

🔥1

39 viewsVadim, 04:23

https://arxiv.org/pdf/2411.17525
https://huggingface.co/docs/transformers/main/en/quantization/higgs
https://github.com/HanGuo97/flute

Large Language Model Quantization
#Frameworks #Paper #Tips

40 viewsVadim, 21:21