Github Top Repositories
12.9K subscribers
380 photos
57 videos
9 files
1.33K links
Top GitHub repositories in one place ๐Ÿš€
Explore the best projects in programming, AI, data science, and more.
Download Telegram
๐Ÿš€ New Tutorial: Automatic Number Plate Recognition (ANPR) with YOLOv11 + GPT-4o-mini!


This hands-on tutorial shows you how to combine the real-time detection power of YOLOv11 with the language understanding of GPT-4o-mini to build a smart, high-accuracy ANPR system! From setup to smart prompt engineering, everything is covered step-by-step. ๐Ÿš—๐Ÿ’ก

๐ŸŽฏ Key Highlights:
โœ… YOLOv11 + GPT-4o-mini = High-precision number plate recognition
โœ… Real-time video processing in Google Colab
โœ… Smart prompt engineering for enhanced OCR performance

๐Ÿ“ข A must-watch if you're into computer vision, deep learning, or OpenAI integrations!


๐Ÿ”— Colab Notebook
โ–ถ๏ธ Watch on YouTube


#YOLOv11 #GPT4o #OpenAI #ANPR #OCR #ComputerVision #DeepLearning #AI #DataScience #Python #Ultralytics #MachineLearning #Colab #NumberPlateRecognition

๐Ÿ” By : https://t.me/DataScienceN
๐Ÿ‘2โค1๐Ÿ”ฅ1
๐Ÿ“š JaidedAI/EasyOCR โ€” an open-source Python library for Optical Character Recognition (OCR) that's easy to use and supports over 80 languages out of the box.

### ๐Ÿ” Key Features:

๐Ÿ”ธ Extracts text from images and scanned documents โ€” including handwritten notes and unusual fonts
๐Ÿ”ธ Supports a wide range of languages like English, Russian, Chinese, Arabic, and more
๐Ÿ”ธ Built on PyTorch โ€” uses modern deep learning models (not the old-school Tesseract)
๐Ÿ”ธ Simple to integrate into your Python projects

### โœ… Example Usage:

import easyocr

reader = easyocr.Reader(['en', 'ru']) # Choose supported languages
result = reader.readtext('image.png')


### ๐Ÿ“Œ Ideal For:

โœ… Text extraction from photos, scans, and documents
โœ… Embedding OCR capabilities in apps (e.g. automated data entry)

๐Ÿ”— GitHub: https://github.com/JaidedAI/EasyOCR

๐Ÿ‘‰ Follow us for more: @DataScienceN

#Python #OCR #MachineLearning #ComputerVision #EasyOCR
โค2๐Ÿ”ฅ1
๐Ÿ”ฅ Trending Repository: awesome-deep-text-detection-recognition

๐Ÿ“ Description: A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

๐Ÿ”— Repository URL: https://github.com/hwalsuklee/awesome-deep-text-detection-recognition

๐Ÿ“– Readme: https://github.com/hwalsuklee/awesome-deep-text-detection-recognition#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 2.5K stars
๐Ÿ‘€ Watchers: 148
๐Ÿด Forks: 508 forks

๐Ÿ’ป Programming Languages: Not available

๐Ÿท๏ธ Related Topics:
#ocr #deep_learning #text_recognition #awesome_list #text_detection #ocr_recognition #awesome_lists #text_detection_recognition #ocr_detection #ocr_papers #ocr_paper #ocr_paper_list


==================================
๐Ÿง  By: https://t.me/DataScienceN
โค1
๐Ÿ”ฅ Trending Repository: Umi-OCR

๐Ÿ“ Description: OCR software, free and offline. ๅผ€ๆบใ€ๅ…่ดน็š„็ฆป็บฟOCR่ฝฏไปถใ€‚ๆ”ฏๆŒๆˆชๅฑ/ๆ‰น้‡ๅฏผๅ…ฅๅ›พ็‰‡๏ผŒPDFๆ–‡ๆกฃ่ฏ†ๅˆซ๏ผŒๆŽ’้™คๆฐดๅฐ/้กต็œ‰้กต่„š๏ผŒๆ‰ซๆ/็”ŸๆˆไบŒ็ปด็ ใ€‚ๅ†…็ฝฎๅคšๅ›ฝ่ฏญ่จ€ๅบ“ใ€‚

๐Ÿ”— Repository URL: https://github.com/hiroi-sora/Umi-OCR

๐Ÿ“– Readme: https://github.com/hiroi-sora/Umi-OCR#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 36.7K stars
๐Ÿ‘€ Watchers: 186
๐Ÿด Forks: 3.6K forks

๐Ÿ’ป Programming Languages: Python - QML

๐Ÿท๏ธ Related Topics:
#screenshot #qt #ocr #qml #ocr_python #paddleocr #umi_ocr


==================================
๐Ÿง  By: https://t.me/DataScienceM
๐Ÿ”ฅ Trending Repository: tesseract

๐Ÿ“ Description: Tesseract Open Source OCR Engine (main repository)

๐Ÿ”— Repository URL: https://github.com/tesseract-ocr/tesseract

๐ŸŒ Website: https://tesseract-ocr.github.io/

๐Ÿ“– Readme: https://github.com/tesseract-ocr/tesseract#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 69.4K stars
๐Ÿ‘€ Watchers: 1.7k
๐Ÿด Forks: 10.2K forks

๐Ÿ’ป Programming Languages: C++ - CMake - Java - Makefile - NSIS - C

๐Ÿท๏ธ Related Topics:
#machine_learning #ocr #tesseract #lstm #tesseract_ocr #hacktoberfest #ocr_engine


==================================
๐Ÿง  By: https://t.me/DataScienceM
๐Ÿ”ฅ Trending Repository: PaddleOCR

๐Ÿ“ Description: Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

๐Ÿ”— Repository URL: https://github.com/PaddlePaddle/PaddleOCR

๐ŸŒ Website: https://www.paddleocr.ai

๐Ÿ“– Readme: https://github.com/PaddlePaddle/PaddleOCR#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 53.9K stars
๐Ÿ‘€ Watchers: 470
๐Ÿด Forks: 8.6K forks

๐Ÿ’ป Programming Languages: Python - C++ - Shell - Java - CMake - Cuda

๐Ÿท๏ธ Related Topics:
#ocr #db #kie #crnn #document_translation #ocrlite #chineseocr #pp_ocr #document_parsing #pp_structure #pdf2markdown #chatocr


==================================
๐Ÿง  By: https://t.me/DataScienceM
๐Ÿ”ฅ Trending Repository: Dolphin

๐Ÿ“ Description: The official repo for โ€œDolphin: Document Image Parsing via Heterogeneous Anchor Promptingโ€, ACL, 2025.

๐Ÿ”— Repository URL: https://github.com/bytedance/Dolphin

๐Ÿ“– Readme: https://github.com/bytedance/Dolphin#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 6.3K stars
๐Ÿ‘€ Watchers: 53
๐Ÿด Forks: 516 forks

๐Ÿ’ป Programming Languages: Python - Shell

๐Ÿท๏ธ Related Topics:
#python #pdf #parser #ocr #pdf_converter #document_analysis #pdf_parser #layout_analysis #vlm_ocr


==================================
๐Ÿง  By: https://t.me/DataScienceM
๐Ÿ”ฅ Trending Repository: siyuan

๐Ÿ“ Description: A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

๐Ÿ”— Repository URL: https://github.com/siyuan-note/siyuan

๐ŸŒ Website: https://b3log.org/siyuan

๐Ÿ“– Readme: https://github.com/siyuan-note/siyuan#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 37.6K stars
๐Ÿ‘€ Watchers: 159
๐Ÿด Forks: 2.3K forks

๐Ÿ’ป Programming Languages: TypeScript - Go - JavaScript - SCSS - HTML - CSS

๐Ÿท๏ธ Related Topics:
#electron #markdown #pdf #ocr #s3 #webdav #self_hosted #openai #note_taking #evernote #anki #knowledge_base #obsidian #notion #notes_app #local_first #chatgpt #ollama #deepseek


==================================
๐Ÿง  By: https://t.me/DataScienceM
๐Ÿ”ฅ Trending Repository: LaTeX-OCR

๐Ÿ“ Description: pix2tex: Using a ViT to convert images of equations into LaTeX code.

๐Ÿ”— Repository URL: https://github.com/lukas-blecher/LaTeX-OCR

๐ŸŒ Website: https://lukas-blecher.github.io/LaTeX-OCR/

๐Ÿ“– Readme: https://github.com/lukas-blecher/LaTeX-OCR#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 15.4K stars
๐Ÿ‘€ Watchers: 85
๐Ÿด Forks: 1.2K forks

๐Ÿ’ป Programming Languages: Python - JavaScript - Jupyter Notebook

๐Ÿท๏ธ Related Topics:
#python #machine_learning #ocr #latex #deep_learning #image_processing #pytorch #dataset #transformer #vit #image2text #im2text #im2latex #im2markup #math_ocr #vision_transformer #latex_ocr


==================================
๐Ÿง  By: https://t.me/DataScienceM
๐Ÿ”ฅ Trending Repository: MinerU

๐Ÿ“ Description: Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

๐Ÿ”— Repository URL: https://github.com/opendatalab/MinerU

๐ŸŒ Website: https://opendatalab.github.io/MinerU/

๐Ÿ“– Readme: https://github.com/opendatalab/MinerU#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 45.7K stars
๐Ÿ‘€ Watchers: 183
๐Ÿด Forks: 3.8K forks

๐Ÿ’ป Programming Languages: Python - Dockerfile

๐Ÿท๏ธ Related Topics:
#python #pdf #parser #ocr #pdf_converter #extract_data #document_analysis #pdf_parser #layout_analysis #ai4science #pdf_extractor_rag #pdf_extractor_llm #pdf_extractor_pretrain


==================================
๐Ÿง  By: https://t.me/DataScienceM
โค1
๐Ÿ”ฅ Trending Repository: ruvector

๐Ÿ“ Description: RuVector is a high performance vector and graph database built in Rust for AI, agentic systems, and real time analytics. It combines HNSW search, dynamic minimum cut coherence, graph intelligence, and self learning memory into one unified engine for scalable, low latency reasoning and structured retrieval.

๐Ÿ”— Repository URL: https://github.com/ruvnet/ruvector

๐ŸŒ Website: https://ruv.io

๐Ÿ“– Readme: https://github.com/ruvnet/ruvector#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 498 stars
๐Ÿ‘€ Watchers: 14
๐Ÿด Forks: 115 forks

๐Ÿ’ป Programming Languages: Rust - TypeScript - JavaScript - Shell - Metal - PLpgSQL

๐Ÿท๏ธ Related Topics:
#rust #ocr #ai #neo4j #graph #vector #mincut #wasm #low_latency #attention_mechanism #onnx #gnns #graph_neural_networks #gnn #gnn_model #llm_inference #ai_ocr


==================================
๐Ÿง  By: https://t.me/DataScienceM
๐Ÿ”ฅ Trending Repository: opendataloader-pdf

๐Ÿ“ Description: PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

๐Ÿ”— Repository URL: https://github.com/opendataloader-project/opendataloader-pdf

๐ŸŒ Website: https://opendataloader.org

๐Ÿ“– Readme: https://github.com/opendataloader-project/opendataloader-pdf#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 4.7k
๐Ÿ‘€ Watchers: 18
๐Ÿด Forks: 355

๐Ÿ’ป Programming Languages: Java - Python - MDX - JavaScript - TypeScript - Shell

๐Ÿท๏ธ Related Topics:
#html #markdown #pdf #json #ocr #ai #accessibility #a11y #pdf_converter #tables #ocr_recognition #pdf_parser #rag #bounding_box #eaa #pdf_extraction #tagged_pdf #document_parsing #pdf_accessibility #pdf_ua


==================================
๐Ÿง  By: https://t.me/DataScienceM
โค2
๐Ÿ”ฅ Trending Repository: chandra

๐Ÿ“ Description: OCR model that handles complex tables, forms, handwriting with full layout.

๐Ÿ”— Repository URL: https://github.com/datalab-to/chandra

๐ŸŒ Website: https://www.datalab.to

๐Ÿ“– Readme: https://github.com/datalab-to/chandra#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 5.7k
๐Ÿ‘€ Watchers: 50
๐Ÿด Forks: 637

๐Ÿ’ป Programming Languages: Python - HTML

๐Ÿท๏ธ Related Topics:
#ocr #ai


==================================
๐Ÿง  By: https://t.me/DataScienceM
๐Ÿ”ฅ Trending Repository: GLM-OCR

๐Ÿ“ Description: GLM-OCR: Accurate ร— Fast ร— Comprehensive

๐Ÿ”— Repository URL: https://github.com/zai-org/GLM-OCR

๐Ÿ“– Readme: https://github.com/zai-org/GLM-OCR#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 5.1k
๐Ÿ‘€ Watchers: 21
๐Ÿด Forks: 449

๐Ÿ’ป Programming Languages: Python - TypeScript - CSS

๐Ÿท๏ธ Related Topics:
#ocr #glm #image2text


==================================
๐Ÿง  By: https://t.me/DataScienceM
โค2
๐Ÿ”ฅ Trending Repository: ShareX

๐Ÿ“ Description: ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.

๐Ÿ”— Repository URL: https://github.com/ShareX/ShareX

๐ŸŒ Website: https://getsharex.com

๐Ÿ“– Readme: https://github.com/ShareX/ShareX#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 36.5k
๐Ÿ‘€ Watchers: 539
๐Ÿด Forks: 3.7k

๐Ÿ’ป Programming Languages: C# - HTML

๐Ÿท๏ธ Related Topics:
#productivity #screenshot #share #ocr #csharp #image_annotation #dropbox #color_picker #ftp #file_upload #file_sharing #url_shortener #screen_recorder #gif #avalonia #capture #screen_capture #region_capture #gif_recorder #sharex


==================================
๐Ÿง  By: https://t.me/DataScienceM
โค2