ianzhao05/textshot
Python tool for grabbing text via screenshot
Language: Python
#ocr #ocr_recognition #python #python_3 #python_script #python3 #screenshot #script #tesseract #tesseract_ocr
Stars: 176 Issues: 0 Forks: 21
https://github.com/ianzhao05/textshot
Python tool for grabbing text via screenshot
Language: Python
#ocr #ocr_recognition #python #python_3 #python_script #python3 #screenshot #script #tesseract #tesseract_ocr
Stars: 176 Issues: 0 Forks: 21
https://github.com/ianzhao05/textshot
GitHub
GitHub - ianzhao05/textshot: Python tool for grabbing text via screenshot
Python tool for grabbing text via screenshot. Contribute to ianzhao05/textshot development by creating an account on GitHub.
open-mmlab/mmocr
OpenMMLab Text Detection and Recognition Toolbox
Language: Python
#crnn #db #dbnet #deep_learning #key_information_extraction #maskrcnn #ocr #pan #panet #psenet #pytorch #robustscanner #sar #sdmg_r #segmentation_based_text_recognition #text_detection #text_recognition #textsnake #transformer
Stars: 141 Issues: 0 Forks: 11
https://github.com/open-mmlab/mmocr
OpenMMLab Text Detection and Recognition Toolbox
Language: Python
#crnn #db #dbnet #deep_learning #key_information_extraction #maskrcnn #ocr #pan #panet #psenet #pytorch #robustscanner #sar #sdmg_r #segmentation_based_text_recognition #text_detection #text_recognition #textsnake #transformer
Stars: 141 Issues: 0 Forks: 11
https://github.com/open-mmlab/mmocr
GitHub
GitHub - open-mmlab/mmocr: OpenMMLab Text Detection, Recognition and Understanding Toolbox
OpenMMLab Text Detection, Recognition and Understanding Toolbox - open-mmlab/mmocr
LBH1024/CAN
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).
Language: Python
#counting #hmer #ocr #recognition
Stars: 109 Issues: 0 Forks: 1
https://github.com/LBH1024/CAN
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).
Language: Python
#counting #hmer #ocr #recognition
Stars: 109 Issues: 0 Forks: 1
https://github.com/LBH1024/CAN
GitHub
GitHub - LBH1024/CAN: When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022…
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster). - LBH1024/CAN
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language: Python
#computer_vision #document_ai #eccv_2022 #multimodal_pre_trained_model #nlp #ocr
Stars: 98 Issues: 2 Forks: 5
https://github.com/clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language: Python
#computer_vision #document_ai #eccv_2022 #multimodal_pre_trained_model #nlp #ocr
Stars: 98 Issues: 2 Forks: 5
https://github.com/clovaai/donut
GitHub
GitHub - clovaai/donut: Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator…
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022 - clovaai/donut
OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
GitHub
GitHub - OpenGVLab/InternGPT: InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now…
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin...
Danily07/Translumo
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
Language: C#
#autotranslate #easyocr #game_translation #mlnet #ocr #translation
Stars: 239 Issues: 5 Forks: 4
https://github.com/Danily07/Translumo
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
Language: C#
#autotranslate #easyocr #game_translation #mlnet #ocr #translation
Stars: 239 Issues: 5 Forks: 4
https://github.com/Danily07/Translumo
GitHub
GitHub - Danily07/Translumo: Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc. - Danily07/Translumo
junhoyeo/BetterOCR
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract) with 🧠 LLM.
Language: Python
#ai #chatgpt #chatgpt_api #easyocr #llm #ocr #openai #openai_api #tesseract #tesseract_ocr
Stars: 154 Issues: 4 Forks: 7
https://github.com/junhoyeo/BetterOCR
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract) with 🧠 LLM.
Language: Python
#ai #chatgpt #chatgpt_api #easyocr #llm #ocr #openai #openai_api #tesseract #tesseract_ocr
Stars: 154 Issues: 4 Forks: 7
https://github.com/junhoyeo/BetterOCR
GitHub
GitHub - junhoyeo/BetterOCR: 🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠…
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM. - junhoyeo/BetterOCR
reworkd/tarsier
Vision utilities for web interaction agents 👀
Language: Jupyter Notebook
#gpt4v #llms #ocr #playwright #pypi_package #python #selenium #webscraping
Stars: 236 Issues: 3 Forks: 14
https://github.com/reworkd/tarsier
Vision utilities for web interaction agents 👀
Language: Jupyter Notebook
#gpt4v #llms #ocr #playwright #pypi_package #python #selenium #webscraping
Stars: 236 Issues: 3 Forks: 14
https://github.com/reworkd/tarsier
GitHub
GitHub - reworkd/tarsier: Vision utilities for web interaction agents 👀
Vision utilities for web interaction agents 👀. Contribute to reworkd/tarsier development by creating an account on GitHub.
VikParuchuri/texify
OCR model for math that outputs LaTeX and markdown
Language: Python
#deep_learning #latex #markdown #ocr
Stars: 142 Issues: 0 Forks: 7
https://github.com/VikParuchuri/texify
OCR model for math that outputs LaTeX and markdown
Language: Python
#deep_learning #latex #markdown #ocr
Stars: 142 Issues: 0 Forks: 7
https://github.com/VikParuchuri/texify
GitHub
GitHub - VikParuchuri/texify: Math OCR model that outputs LaTeX and markdown
Math OCR model that outputs LaTeX and markdown. Contribute to VikParuchuri/texify development by creating an account on GitHub.
robertknight/ocrs
A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
GitHub
GitHub - robertknight/ocrs: Rust library and CLI tool for OCR (extracting text from images)
Rust library and CLI tool for OCR (extracting text from images) - robertknight/ocrs