lucidrains/recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
Language: Python
#artificial_intelligence #attention_mechanisms #deep_learning #long_context #memory #recurrence #transformers
Stars: 223 Issues: 0 Forks: 4
https://github.com/lucidrains/recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
Language: Python
#artificial_intelligence #attention_mechanisms #deep_learning #long_context #memory #recurrence #transformers
Stars: 223 Issues: 0 Forks: 4
https://github.com/lucidrains/recurrent-memory-transformer-pytorch
GitHub
GitHub - lucidrains/recurrent-memory-transformer-pytorch: Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in…
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch - lucidrains/recurrent-memory-transformer-pytorch
lucidrains/MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Language: Python
#artificial_intelligence #attention_mechanisms #deep_learning #learned_tokenization #long_context #transformers
Stars: 204 Issues: 0 Forks: 10
https://github.com/lucidrains/MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Language: Python
#artificial_intelligence #attention_mechanisms #deep_learning #learned_tokenization #long_context #transformers
Stars: 204 Issues: 0 Forks: 10
https://github.com/lucidrains/MEGABYTE-pytorch
GitHub
GitHub - lucidrains/MEGABYTE-pytorch: Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers…
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch - lucidrains/MEGABYTE-pytorch
dvlab-research/LongLoRA
Efficient long-context fine-tuning, supervised fine-tuning, LongQA dataset.
Language: Python
#fine_tuning_llm #large_language_models #long_context
Stars: 334 Issues: 5 Forks: 13
https://github.com/dvlab-research/LongLoRA
Efficient long-context fine-tuning, supervised fine-tuning, LongQA dataset.
Language: Python
#fine_tuning_llm #large_language_models #long_context
Stars: 334 Issues: 5 Forks: 13
https://github.com/dvlab-research/LongLoRA
GitHub
GitHub - dvlab-research/LongLoRA: Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral) - dvlab-research/LongLoRA
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Language: Python
#fine_tuning #llm #long_context #long_text
Stars: 266 Issues: 7 Forks: 19
https://github.com/THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Language: Python
#fine_tuning #llm #long_context #long_text
Stars: 266 Issues: 7 Forks: 19
https://github.com/THUDM/LongWriter
GitHub
GitHub - THUDM/LongWriter: LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs - THUDM/LongWriter