Deep Learning – Telegram

Deep Learning

@deep_learning_prog

37 subscribers

5 photos

136 links

Deep Learning: programming, tools & resources.
#DeepLearning #Python

Download Telegram

About

Blog

Apps

Platform

https://www.anyscale.com/blog/continuous-batching-llm-inference

LLM inference acceleration #Frameworks

Achieve 23x LLM Inference Throughput & Reduce p50 Latency

In this blog, we discuss continuous batching, a critical systems-level optimization that improves both throughput and latency under load for LLMs.

❤1

81 viewsVadim ✨, 16:42

https://llava-vl.github.io/blog/2024-01-30-llava-next/

#Frameworks #Models

LLaVA-NeXT: Improved reasoning, OCR, and world knowledge

LLaVA team presents LLaVA-NeXT, with improved reasoning, OCR, and world knowledge. LLaVA-NeXT even exceeds Gemini Pro on several benchmarks.

75 viewsVadim ✨, 18:50

https://github.com/arcee-ai/mergekit
Model Merging: toolkit
#Frameworks

GitHub - arcee-ai/mergekit: Tools for merging pretrained large language models.

Tools for merging pretrained large language models. - arcee-ai/mergekit

82 viewsVadim ✨, 05:34

https://github.com/SeldonIO/alibi-detect Algorithms for outlier, adversarial and drift detection
https://github.com/SeldonIO/alibi Algorithms for explaining machine learning models
#Frameworks #library #anomaly #drift

GitHub - SeldonIO/alibi-detect: Algorithms for outlier, adversarial and drift detection

Algorithms for outlier, adversarial and drift detection - SeldonIO/alibi-detect

72 viewsVadim ✨, edited 00:40

Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and many other libraries.
https://mars-project.readthedocs.io/
#Frameworks

75 viewsVadim, edited 02:00

RAG #Frameworks
https://github.com/infiniflow/ragflow

GitHub - infiniflow/ragflow: RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge…

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs - infiniflow/ragflow

75 viewsVadim, 23:18

Numba is an open source JIT compiler that translates a subset of Python and NumPy code into fast machine code.
https://numba.pydata.org/ #Frameworks #library

❤1👏1

64 viewsVadim, 04:14

https://github.com/staghado/vit.cpp Inference Vision Transformer (ViT) in plain C/C++ with ggml
https://github.com/ggerganov/ggml Tensor library for machine learning with Low-level cross-platform implementation
#Frameworks

GitHub - staghado/vit.cpp: Inference Vision Transformer (ViT) in plain C/C++ with ggml

Inference Vision Transformer (ViT) in plain C/C++ with ggml - staghado/vit.cpp

53 viewsVadim, edited 15:07

https://arxiv.org/abs/2412.11768
https://github.com/AnonymousAlethiometer/SGD_SaI/
#Paper #Frameworks

No More Adam: Learning Rate Scaling at Initialization is All You Need

In this work, we question the necessity of adaptive gradient methods for training deep neural networks. SGD-SaI is a simple yet effective enhancement to stochastic gradient descent with momentum...

59 viewsVadim, edited 22:47

https://github.com/exo-explore/exo #LLMserving #Frameworks

GitHub - exo-explore/exo: Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚ - exo-explore/exo

48 viewsVadim, edited 11:58

https://github.com/matatonic/openedai-vision #Frameworks #LLMserving Another LLM server

GitHub - matatonic/openedai-vision: An OpenAI API compatible API for chat with image input and questions about the images. aka…

An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal. - matatonic/openedai-vision

56 viewsVadim, edited 10:30

https://docs.vllm.ai/en/latest/
#Frameworks #LLMserving

53 viewsVadim, edited 10:44

https://arxiv.org/pdf/2411.17525
https://huggingface.co/docs/transformers/main/en/quantization/higgs
https://github.com/HanGuo97/flute

Large Language Model Quantization
#Frameworks #Paper #Tips

40 viewsVadim, 21:21

https://github.com/parthsarthi03/raptor #Frameworks

GitHub - parthsarthi03/raptor: The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval - parthsarthi03/raptor

38 viewsARТ̈EM Zaborskiy, edited 08:12

https://github.com/parthsarthi03/raptor #Frameworks

https://github.com/illuin-tech/colpali
Efficient Document Retrieval with Vision Language Models #Frameworks #Models

GitHub - illuin-tech/colpali: The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol. - illuin-tech/colpali

40 viewsVadim, 14:56

https://arxiv.org/abs/2503.19108
The plane ViT architecture without a decoder to perform fast image segmentation #Paper #Frameworks

Your ViT is Secretly an Image Segmentation Model

Vision Transformers (ViTs) have shown remarkable performance and scalability across various computer vision tasks. To apply single-scale ViTs to image segmentation, existing methods adopt a...

47 viewsVadim, 21:10

https://arxiv.org/abs/2412.16334 open-vocabulary segmentation! #Paper #Frameworks

DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level...

Self-supervised visual foundation models produce powerful embeddings that achieve remarkable performance on a wide range of downstream tasks. However, unlike vision-language models such as CLIP,...

37 viewsVadim, 08:38

https://arxiv.org/abs/2411.04983
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning #Frameworks #Paper

DINO-WM: World Models on Pre-trained Visual Features enable...

The ability to predict future outcomes given control actions is fundamental for physical reasoning. However, such predictive models, often called world models, remains challenging to learn and are...

40 viewsVadim, 20:17