GitHub repos – Telegram

GitHub repos

26K subscribers

18 photos

2 videos

11.4K links

Welcome to GitHub repos. Here you'll find valuable information on the latest trending projects. Subscribe to stay informed and gain insights from the thriving GitHub community.

Download Telegram

About

Blog

Apps

Platform

26K subscribers

PaulPauls/llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
Language: Python
#feature_extraction #feature_steering #llama3 #llm_interpretability #open_research #pytorch #sparse_autoencoder
Stars: 285 Issues: 0 Forks: 13
https://github.com/PaulPauls/llama3_interpretability_sae

GitHub - PaulPauls/llama3_interpretability_sae: A complete end-to-end pipeline for LLM interpretability with sparse autoencoders…

A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible. - PaulPauls/llama3_interpretability_sae

1.79K views17:00

zhihu/ZhiLight
A highly optimized inference acceleration engine for Llama and its variants.
Language: C++
#cpm #cuda #gpt #inference_engine #llama #llm #llm_serving #minicpm #pytorch #qwen
Stars: 192 Issues: 1 Forks: 16
https://github.com/zhihu/ZhiLight

GitHub - zhihu/ZhiLight: A highly optimized LLM inference acceleration engine for Llama and its variants.

A highly optimized LLM inference acceleration engine for Llama and its variants. - zhihu/ZhiLight

👍1

1.78K views17:00

facebookresearch/large_concept_model
Large Concept Models: Language modeling in a sentence representation space
Language: Python
#language_models #nlp #pytorch #seq2seq #sequence_to_sequence
Stars: 340 Issues: 0 Forks: 27
https://github.com/facebookresearch/large_concept_model

GitHub - facebookresearch/large_concept_model: Large Concept Models: Language modeling in a sentence representation space

Large Concept Models: Language modeling in a sentence representation space - facebookresearch/large_concept_model

🔥2

1.87K views17:00

MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python
#flash_attention #llm #llm_serving #llm_training #moe #pytorch #transformer
Stars: 521 Issues: 2 Forks: 16
https://github.com/MoonshotAI/MoBA

GitHub - MoonshotAI/MoBA: MoBA: Mixture of Block Attention for Long-Context LLMs

MoBA: Mixture of Block Attention for Long-Context LLMs - MoonshotAI/MoBA

1.67K views11:00

babycommando/neuralgraffiti
Live-bending a foundation model’s output at neural network level.
Language: Jupyter Notebook
#finetuning #liquid_neural_networks #llm #neural_network #pytorch #self_attention #transformers
Stars: 217 Issues: 0 Forks: 16
https://github.com/babycommando/neuralgraffiti

GitHub - babycommando/neuralgraffiti: Live-bending a foundation model’s output at neural network level.

Live-bending a foundation model’s output at neural network level. - babycommando/neuralgraffiti

1.58K views22:00