chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
Language: Python
#artificial_intelligence #data_science #language_model #natural_language_processing #nlp #open #python #text_mining
Stars: 371 Issues: 2 Forks: 31
https://github.com/chiphuyen/lazynlp
  
  Library to scrape and clean web pages to create massive datasets.
Language: Python
#artificial_intelligence #data_science #language_model #natural_language_processing #nlp #open #python #text_mining
Stars: 371 Issues: 2 Forks: 31
https://github.com/chiphuyen/lazynlp
GitHub
  
  GitHub - chiphuyen/lazynlp: Library to scrape and clean web pages to create massive datasets.
  Library to scrape and clean web pages to create massive datasets. - chiphuyen/lazynlp
  pingpong-ai/xlnet-pytorch
2019 Google Brain's XLNet Pytorch Implementation
Language: Python
#language_model #pytorch #transformer_xl #xlnet #xlnet_pytorch
Stars: 110 Issues: 2 Forks: 9
https://github.com/pingpong-ai/xlnet-pytorch
  2019 Google Brain's XLNet Pytorch Implementation
Language: Python
#language_model #pytorch #transformer_xl #xlnet #xlnet_pytorch
Stars: 110 Issues: 2 Forks: 9
https://github.com/pingpong-ai/xlnet-pytorch
SKTBrain/KoBERT
Korean BERT pre-trained cased (KoBERT)
Language: Python
#korean_nlp #language_model
Stars: 132 Issues: 0 Forks: 17
https://github.com/SKTBrain/KoBERT
  
  Korean BERT pre-trained cased (KoBERT)
Language: Python
#korean_nlp #language_model
Stars: 132 Issues: 0 Forks: 17
https://github.com/SKTBrain/KoBERT
GitHub
  
  GitHub - SKTBrain/KoBERT: Korean BERT pre-trained cased (KoBERT)
  Korean BERT pre-trained cased (KoBERT). Contribute to SKTBrain/KoBERT development by creating an account on GitHub.
  maraoz/gpt-scrolls
A collaborative collection of open-source safe GPT-3 prompts that work well
#generator #gpt_3 #language_model #openai #safety #transformer
Stars: 123 Issues: 4 Forks: 7
https://github.com/maraoz/gpt-scrolls
  
  A collaborative collection of open-source safe GPT-3 prompts that work well
#generator #gpt_3 #language_model #openai #safety #transformer
Stars: 123 Issues: 4 Forks: 7
https://github.com/maraoz/gpt-scrolls
GitHub
  
  GitHub - maraoz/gpt-scrolls: A collaborative collection of open-source safe GPT-3 prompts that work well
  A collaborative collection of open-source safe GPT-3 prompts that work well - GitHub - maraoz/gpt-scrolls: A collaborative collection of open-source safe GPT-3 prompts that work well
  nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language: Python
#adapters #artificial_intelligence #deeplearning #dependency_parsing #language_model #lemmatization #machine_learning #morphological_tagging #multilingual #natural_language_processing #nlp #part_of_speech_tagging #pytorch #sentence_segmentation #tokenization #universal_dependencies #xlm_roberta
Stars: 120 Issues: 0 Forks: 8
https://github.com/nlp-uoregon/trankit
  
  Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language: Python
#adapters #artificial_intelligence #deeplearning #dependency_parsing #language_model #lemmatization #machine_learning #morphological_tagging #multilingual #natural_language_processing #nlp #part_of_speech_tagging #pytorch #sentence_segmentation #tokenization #universal_dependencies #xlm_roberta
Stars: 120 Issues: 0 Forks: 8
https://github.com/nlp-uoregon/trankit
GitHub
  
  GitHub - nlp-uoregon/trankit: Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
  Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing - nlp-uoregon/trankit
  will-thompson-k/tldr-transformers
The "tl;dr" on a few notable transformer papers.
#nlp #deep_learning #notes #transformers #attention #transfer_learning #language_models #language_model #bert #open_ai #huggingface #huggingface_transformer #gpt_3
Stars: 90 Issues: 2 Forks: 2
https://github.com/will-thompson-k/tldr-transformers
  
  The "tl;dr" on a few notable transformer papers.
#nlp #deep_learning #notes #transformers #attention #transfer_learning #language_models #language_model #bert #open_ai #huggingface #huggingface_transformer #gpt_3
Stars: 90 Issues: 2 Forks: 2
https://github.com/will-thompson-k/tldr-transformers
GitHub
  
  GitHub - will-thompson-k/tldr-transformers: The "tl;dr" on a few notable transformer papers (pre-2022).
  The "tl;dr" on a few notable transformer papers (pre-2022). - will-thompson-k/tldr-transformers
  DeutscheKI/tevr-asr-tool
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Language: C
#asr #c_plus_plus #command_line_tool #debian #debian_package #deep_learning #german_speech_recognition #kenlm #language_model #machine_learning #no_cloud #offline #pretrained_models #private #speech #speech_recognition #speech_to_text #tensorflow #tensorflow_lite #wave
Stars: 250 Issues: 0 Forks: 9
https://github.com/DeutscheKI/tevr-asr-tool
  
  State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Language: C
#asr #c_plus_plus #command_line_tool #debian #debian_package #deep_learning #german_speech_recognition #kenlm #language_model #machine_learning #no_cloud #offline #pretrained_models #private #speech #speech_recognition #speech_to_text #tensorflow #tensorflow_lite #wave
Stars: 250 Issues: 0 Forks: 9
https://github.com/DeutscheKI/tevr-asr-tool
GitHub
  
  GitHub - DeutscheKI/tevr-asr-tool: State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is…
  State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool. - DeutscheKI/tevr-asr-tool
👍5
  extreme-bert/extreme-bert
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
  
  ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
GitHub
  
  GitHub - extreme-bert/extreme-bert: ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on…
  ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Custom...
👍3
  JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
  
  Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
GitHub
  
  GitHub - JonasGeiping/cramming: Cramming the training of a (BERT-type) language model into limited compute.
  Cramming the training of a (BERT-type) language model into limited compute. - JonasGeiping/cramming
👍2
  BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
  
  ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
GitHub
  
  GitHub - BlinkDL/ChatRWKV: ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
  ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. - BlinkDL/ChatRWKV
🔥3👍1👏1
  NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
  
  The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
GitHub
  
  GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
  The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer
🔥3
  tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language: Python
#deep_learning #instruction_following #language_model
Stars: 2633 Issues: 3 Forks: 169
https://github.com/tatsu-lab/stanford_alpaca
  
  Code and documentation to train Stanford's Alpaca models, and generate the data.
Language: Python
#deep_learning #instruction_following #language_model
Stars: 2633 Issues: 3 Forks: 169
https://github.com/tatsu-lab/stanford_alpaca
GitHub
  
  GitHub - tatsu-lab/stanford_alpaca: Code and documentation to train Stanford's Alpaca models, and generate the data.
  Code and documentation to train Stanford's Alpaca models, and generate the data. - tatsu-lab/stanford_alpaca
👍3🔥2😐2
  