PaulPauls/llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
Language: Python
#feature_extraction #feature_steering #llama3 #llm_interpretability #open_research #pytorch #sparse_autoencoder
Stars: 285 Issues: 0 Forks: 13
https://github.com/PaulPauls/llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
Language: Python
#feature_extraction #feature_steering #llama3 #llm_interpretability #open_research #pytorch #sparse_autoencoder
Stars: 285 Issues: 0 Forks: 13
https://github.com/PaulPauls/llama3_interpretability_sae
GitHub
GitHub - PaulPauls/llama3_interpretability_sae: A complete end-to-end pipeline for LLM interpretability with sparse autoencoders…
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible. - PaulPauls/llama3_interpretability_sae
zhihu/ZhiLight
A highly optimized inference acceleration engine for Llama and its variants.
Language: C++
#cpm #cuda #gpt #inference_engine #llama #llm #llm_serving #minicpm #pytorch #qwen
Stars: 192 Issues: 1 Forks: 16
https://github.com/zhihu/ZhiLight
A highly optimized inference acceleration engine for Llama and its variants.
Language: C++
#cpm #cuda #gpt #inference_engine #llama #llm #llm_serving #minicpm #pytorch #qwen
Stars: 192 Issues: 1 Forks: 16
https://github.com/zhihu/ZhiLight
GitHub
GitHub - zhihu/ZhiLight: A highly optimized LLM inference acceleration engine for Llama and its variants.
A highly optimized LLM inference acceleration engine for Llama and its variants. - zhihu/ZhiLight
👍1
facebookresearch/large_concept_model
Large Concept Models: Language modeling in a sentence representation space
Language: Python
#language_models #nlp #pytorch #seq2seq #sequence_to_sequence
Stars: 340 Issues: 0 Forks: 27
https://github.com/facebookresearch/large_concept_model
Large Concept Models: Language modeling in a sentence representation space
Language: Python
#language_models #nlp #pytorch #seq2seq #sequence_to_sequence
Stars: 340 Issues: 0 Forks: 27
https://github.com/facebookresearch/large_concept_model
GitHub
GitHub - facebookresearch/large_concept_model: Large Concept Models: Language modeling in a sentence representation space
Large Concept Models: Language modeling in a sentence representation space - facebookresearch/large_concept_model
🔥2
MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python
#flash_attention #llm #llm_serving #llm_training #moe #pytorch #transformer
Stars: 521 Issues: 2 Forks: 16
https://github.com/MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python
#flash_attention #llm #llm_serving #llm_training #moe #pytorch #transformer
Stars: 521 Issues: 2 Forks: 16
https://github.com/MoonshotAI/MoBA
GitHub
GitHub - MoonshotAI/MoBA: MoBA: Mixture of Block Attention for Long-Context LLMs
MoBA: Mixture of Block Attention for Long-Context LLMs - MoonshotAI/MoBA
babycommando/neuralgraffiti
Live-bending a foundation model’s output at neural network level.
Language: Jupyter Notebook
#finetuning #liquid_neural_networks #llm #neural_network #pytorch #self_attention #transformers
Stars: 217 Issues: 0 Forks: 16
https://github.com/babycommando/neuralgraffiti
Live-bending a foundation model’s output at neural network level.
Language: Jupyter Notebook
#finetuning #liquid_neural_networks #llm #neural_network #pytorch #self_attention #transformers
Stars: 217 Issues: 0 Forks: 16
https://github.com/babycommando/neuralgraffiti
GitHub
GitHub - babycommando/neuralgraffiti: Live-bending a foundation model’s output at neural network level.
Live-bending a foundation model’s output at neural network level. - babycommando/neuralgraffiti