facebookresearch/large_concept_model
Large Concept Models: Language modeling in a sentence representation space
Language: Python
#language_models #nlp #pytorch #seq2seq #sequence_to_sequence
Stars: 340 Issues: 0 Forks: 27
https://github.com/facebookresearch/large_concept_model
Large Concept Models: Language modeling in a sentence representation space
Language: Python
#language_models #nlp #pytorch #seq2seq #sequence_to_sequence
Stars: 340 Issues: 0 Forks: 27
https://github.com/facebookresearch/large_concept_model
GitHub
GitHub - facebookresearch/large_concept_model: Large Concept Models: Language modeling in a sentence representation space
Large Concept Models: Language modeling in a sentence representation space - facebookresearch/large_concept_model
🔥2
MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python
#flash_attention #llm #llm_serving #llm_training #moe #pytorch #transformer
Stars: 521 Issues: 2 Forks: 16
https://github.com/MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python
#flash_attention #llm #llm_serving #llm_training #moe #pytorch #transformer
Stars: 521 Issues: 2 Forks: 16
https://github.com/MoonshotAI/MoBA
GitHub
GitHub - MoonshotAI/MoBA: MoBA: Mixture of Block Attention for Long-Context LLMs
MoBA: Mixture of Block Attention for Long-Context LLMs - MoonshotAI/MoBA
babycommando/neuralgraffiti
Live-bending a foundation model’s output at neural network level.
Language: Jupyter Notebook
#finetuning #liquid_neural_networks #llm #neural_network #pytorch #self_attention #transformers
Stars: 217 Issues: 0 Forks: 16
https://github.com/babycommando/neuralgraffiti
Live-bending a foundation model’s output at neural network level.
Language: Jupyter Notebook
#finetuning #liquid_neural_networks #llm #neural_network #pytorch #self_attention #transformers
Stars: 217 Issues: 0 Forks: 16
https://github.com/babycommando/neuralgraffiti
GitHub
GitHub - babycommando/neuralgraffiti: Live-bending a foundation model’s output at neural network level.
Live-bending a foundation model’s output at neural network level. - babycommando/neuralgraffiti