open-compass/MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
Language: Python
#llm #mistral #moe
Stars: 251 Issues: 6 Forks: 21
https://github.com/open-compass/MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
Language: Python
#llm #mistral #moe
Stars: 251 Issues: 6 Forks: 21
https://github.com/open-compass/MixtralKit
GitHub
GitHub - open-compass/MixtralKit: A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI - open-compass/MixtralKit
MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python
#flash_attention #llm #llm_serving #llm_training #moe #pytorch #transformer
Stars: 521 Issues: 2 Forks: 16
https://github.com/MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python
#flash_attention #llm #llm_serving #llm_training #moe #pytorch #transformer
Stars: 521 Issues: 2 Forks: 16
https://github.com/MoonshotAI/MoBA
GitHub
GitHub - MoonshotAI/MoBA: MoBA: Mixture of Block Attention for Long-Context LLMs
MoBA: Mixture of Block Attention for Long-Context LLMs - MoonshotAI/MoBA