codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
Language: Python
#bert #language_model #nlp #pytorch #transformer
Stars: 245 Issues: 3 Forks: 46
https://github.com/codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
Language: Python
#bert #language_model #nlp #pytorch #transformer
Stars: 245 Issues: 3 Forks: 46
https://github.com/codertimo/BERT-pytorch
GitHub
GitHub - codertimo/BERT-pytorch: Google AI 2018 BERT pytorch implementation
Google AI 2018 BERT pytorch implementation. Contribute to codertimo/BERT-pytorch development by creating an account on GitHub.
chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
Language: Python
#artificial_intelligence #data_science #language_model #natural_language_processing #nlp #open #python #text_mining
Stars: 371 Issues: 2 Forks: 31
https://github.com/chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
Language: Python
#artificial_intelligence #data_science #language_model #natural_language_processing #nlp #open #python #text_mining
Stars: 371 Issues: 2 Forks: 31
https://github.com/chiphuyen/lazynlp
GitHub
GitHub - chiphuyen/lazynlp: Library to scrape and clean web pages to create massive datasets.
Library to scrape and clean web pages to create massive datasets. - chiphuyen/lazynlp
pingpong-ai/xlnet-pytorch
2019 Google Brain's XLNet Pytorch Implementation
Language: Python
#language_model #pytorch #transformer_xl #xlnet #xlnet_pytorch
Stars: 110 Issues: 2 Forks: 9
https://github.com/pingpong-ai/xlnet-pytorch
2019 Google Brain's XLNet Pytorch Implementation
Language: Python
#language_model #pytorch #transformer_xl #xlnet #xlnet_pytorch
Stars: 110 Issues: 2 Forks: 9
https://github.com/pingpong-ai/xlnet-pytorch
SKTBrain/KoBERT
Korean BERT pre-trained cased (KoBERT)
Language: Python
#korean_nlp #language_model
Stars: 132 Issues: 0 Forks: 17
https://github.com/SKTBrain/KoBERT
Korean BERT pre-trained cased (KoBERT)
Language: Python
#korean_nlp #language_model
Stars: 132 Issues: 0 Forks: 17
https://github.com/SKTBrain/KoBERT
GitHub
GitHub - SKTBrain/KoBERT: Korean BERT pre-trained cased (KoBERT)
Korean BERT pre-trained cased (KoBERT). Contribute to SKTBrain/KoBERT development by creating an account on GitHub.
maraoz/gpt-scrolls
A collaborative collection of open-source safe GPT-3 prompts that work well
#generator #gpt_3 #language_model #openai #safety #transformer
Stars: 123 Issues: 4 Forks: 7
https://github.com/maraoz/gpt-scrolls
A collaborative collection of open-source safe GPT-3 prompts that work well
#generator #gpt_3 #language_model #openai #safety #transformer
Stars: 123 Issues: 4 Forks: 7
https://github.com/maraoz/gpt-scrolls
GitHub
GitHub - maraoz/gpt-scrolls: A collaborative collection of open-source safe GPT-3 prompts that work well
A collaborative collection of open-source safe GPT-3 prompts that work well - GitHub - maraoz/gpt-scrolls: A collaborative collection of open-source safe GPT-3 prompts that work well
nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language: Python
#adapters #artificial_intelligence #deeplearning #dependency_parsing #language_model #lemmatization #machine_learning #morphological_tagging #multilingual #natural_language_processing #nlp #part_of_speech_tagging #pytorch #sentence_segmentation #tokenization #universal_dependencies #xlm_roberta
Stars: 120 Issues: 0 Forks: 8
https://github.com/nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language: Python
#adapters #artificial_intelligence #deeplearning #dependency_parsing #language_model #lemmatization #machine_learning #morphological_tagging #multilingual #natural_language_processing #nlp #part_of_speech_tagging #pytorch #sentence_segmentation #tokenization #universal_dependencies #xlm_roberta
Stars: 120 Issues: 0 Forks: 8
https://github.com/nlp-uoregon/trankit
GitHub
GitHub - nlp-uoregon/trankit: Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing - nlp-uoregon/trankit
will-thompson-k/tldr-transformers
The "tl;dr" on a few notable transformer papers.
#nlp #deep_learning #notes #transformers #attention #transfer_learning #language_models #language_model #bert #open_ai #huggingface #huggingface_transformer #gpt_3
Stars: 90 Issues: 2 Forks: 2
https://github.com/will-thompson-k/tldr-transformers
The "tl;dr" on a few notable transformer papers.
#nlp #deep_learning #notes #transformers #attention #transfer_learning #language_models #language_model #bert #open_ai #huggingface #huggingface_transformer #gpt_3
Stars: 90 Issues: 2 Forks: 2
https://github.com/will-thompson-k/tldr-transformers
GitHub
GitHub - will-thompson-k/tldr-transformers: The "tl;dr" on a few notable transformer papers (pre-2022).
The "tl;dr" on a few notable transformer papers (pre-2022). - will-thompson-k/tldr-transformers
DeutscheKI/tevr-asr-tool
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Language: C
#asr #c_plus_plus #command_line_tool #debian #debian_package #deep_learning #german_speech_recognition #kenlm #language_model #machine_learning #no_cloud #offline #pretrained_models #private #speech #speech_recognition #speech_to_text #tensorflow #tensorflow_lite #wave
Stars: 250 Issues: 0 Forks: 9
https://github.com/DeutscheKI/tevr-asr-tool
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Language: C
#asr #c_plus_plus #command_line_tool #debian #debian_package #deep_learning #german_speech_recognition #kenlm #language_model #machine_learning #no_cloud #offline #pretrained_models #private #speech #speech_recognition #speech_to_text #tensorflow #tensorflow_lite #wave
Stars: 250 Issues: 0 Forks: 9
https://github.com/DeutscheKI/tevr-asr-tool
GitHub
GitHub - DeutscheKI/tevr-asr-tool: State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is…
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool. - DeutscheKI/tevr-asr-tool
extreme-bert/extreme-bert
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
GitHub
GitHub - extreme-bert/extreme-bert: ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on…
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Custom...
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
GitHub
GitHub - JonasGeiping/cramming: Cramming the training of a (BERT-type) language model into limited compute.
Cramming the training of a (BERT-type) language model into limited compute. - JonasGeiping/cramming
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
GitHub
GitHub - BlinkDL/ChatRWKV: ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. - BlinkDL/ChatRWKV
NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
GitHub
GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language: Python
#deep_learning #instruction_following #language_model
Stars: 2633 Issues: 3 Forks: 169
https://github.com/tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language: Python
#deep_learning #instruction_following #language_model
Stars: 2633 Issues: 3 Forks: 169
https://github.com/tatsu-lab/stanford_alpaca
GitHub
GitHub - tatsu-lab/stanford_alpaca: Code and documentation to train Stanford's Alpaca models, and generate the data.
Code and documentation to train Stanford's Alpaca models, and generate the data. - tatsu-lab/stanford_alpaca
context-labs/autodoc
Experimental toolkit for auto-generating codebase documentation using LLMs
Language: TypeScript
#cli_tool #documentation_generator #language_model #typescript
Stars: 568 Issues: 7 Forks: 18
https://github.com/context-labs/autodoc
Experimental toolkit for auto-generating codebase documentation using LLMs
Language: TypeScript
#cli_tool #documentation_generator #language_model #typescript
Stars: 568 Issues: 7 Forks: 18
https://github.com/context-labs/autodoc
GitHub
GitHub - context-labs/autodoc: Experimental toolkit for auto-generating codebase documentation using LLMs
Experimental toolkit for auto-generating codebase documentation using LLMs - context-labs/autodoc
mlc-ai/web-llm
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Language: Python
#chatgpt #deep_learning #language_model #llm #tvm #webgpu #webml
Stars: 1009 Issues: 1 Forks: 41
https://github.com/mlc-ai/web-llm
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Language: Python
#chatgpt #deep_learning #language_model #llm #tvm #webgpu #webml
Stars: 1009 Issues: 1 Forks: 41
https://github.com/mlc-ai/web-llm
GitHub
GitHub - mlc-ai/web-llm: High-performance In-browser LLM Inference Engine
High-performance In-browser LLM Inference Engine . Contribute to mlc-ai/web-llm development by creating an account on GitHub.