Github Top Repositories
12.9K subscribers
370 photos
57 videos
9 files
1.32K links
Top GitHub repositories in one place πŸš€
Explore the best projects in programming, AI, data science, and more.
Download Telegram
html-to-markdown

A modern, fully typed Python library for converting HTML to Markdown. This library is a completely rewritten fork of markdownify with a modernized codebase, strict type safety and support for Python 3.9+.

Features:
⭐️ Full HTML5 Support: Comprehensive support for all modern HTML5 elements including semantic, form, table, ruby, interactive, structural, SVG, and math elements
⭐️ Enhanced Table Support: Advanced handling of merged cells with rowspan/colspan support for better table representation
⭐️ Type Safety: Strict MyPy adherence with comprehensive type hints
Metadata Extraction: Automatic extraction of document metadata (title, meta tags) as comment headers
⭐️ Streaming Support: Memory-efficient processing for large documents with progress callbacks
⭐️ Highlight Support: Multiple styles for highlighted text (<mark> elements)
⭐️ Task List Support: Converts HTML checkboxes to GitHub-compatible task list syntax

nstallation
pip install html-to-markdown

Optional lxml Parser
For improved performance, you can install with the optional lxml parser:
pip install html-to-markdown[lxml]

The lxml parser offers:

πŸ†˜ ~30% faster HTML parsing compared to the default html.parser
πŸ†˜ Better handling of malformed HTML
πŸ†˜ More robust parsing for complex documents

Quick Start
Convert HTML to Markdown with a single function call:
from html_to_markdown import convert_to_markdown

html = """
<!DOCTYPE html>
<html>
<head>
<title>Sample Document</title>
<meta name="description" content="A sample HTML document">
</head>
<body>
<article>
<h1>Welcome</h1>
<p>This is a <strong>sample</strong> with a <a href="https://example.com">link</a>.</p>
<p>Here's some <mark>highlighted text</mark> and a task list:</p>
<ul>
<li><input type="checkbox" checked> Completed task</li>
<li><input type="checkbox"> Pending task</li>
</ul>
</article>
</body>
</html>
"""

markdown = convert_to_markdown(html)
print(markdown)


Working with BeautifulSoup:

If you need more control over HTML parsing, you can pass a pre-configured BeautifulSoup instance:
from bs4 import BeautifulSoup
from html_to_markdown import convert_to_markdown

# Configure BeautifulSoup with your preferred parser
soup = BeautifulSoup(html, "lxml") # Note: lxml requires additional installation
markdown = convert_to_markdown(soup)


Github: https://github.com/Goldziher/html-to-markdown

https://t.me/DataScienceN ⭐️
Please open Telegram to view this post
VIEW IN TELEGRAM
❀4πŸ‘1
❀4
This media is not supported in your browser
VIEW IN TELEGRAM
LangExtract

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

GitHub: https://github.com/google/langextract

https://t.me/DataScience4 πŸ–•
Please open Telegram to view this post
VIEW IN TELEGRAM
πŸ‘2❀1
This channels is for Programmers, Coders, Software Engineers.

0️⃣ Python
1️⃣ Data Science
2️⃣ Machine Learning
3️⃣ Data Visualization
4️⃣ Artificial Intelligence
5️⃣ Data Analysis
6️⃣ Statistics
7️⃣ Deep Learning
8️⃣ programming Languages

βœ… https://t.me/addlist/8_rRW2scgfRhOTc0

βœ… https://t.me/Codeprogrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
This media is not supported in your browser
VIEW IN TELEGRAM
βœ… open-source alternative to Perplexity.

βœ… Real-time web search with Firecrawl API
βœ… Advanced answers with GPT-4o-mini
βœ… Every sentence with reference and source
βœ… Automatic stock display with TradingView


β”Œ πŸ” Fireplexity
β”œ
πŸ₯΅ Website
β”” 🐱 GitHub-Repos

https://t.me/DataScienceN 🌟
Please open Telegram to view this post
VIEW IN TELEGRAM
❀2
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 AI now generates worlds in the style of Minecraft β€” presenting the GameFactory model

Researchers trained the model on 70 hours of Minecraft gameplay and achieved impressive results: 
GameFactory can create procedural game worlds β€” from volcanoes to cherry blossom forests, just like in the iconic simulator.

πŸ”₯ Want your own endless world? Just set the parameters.

🟠 Examples and code β€” at the link: https://yujiwen.github.io/gamefactory/

🟠Github: https://github.com/KwaiVGI/GameFactory

https://t.me/DataScienceN 🌟
Please open Telegram to view this post
VIEW IN TELEGRAM
❀3
python-docx: Create and Modify Word Documents #python

python-docx is a Python library for reading, creating, and updating Microsoft Word 2007+ (.docx) files.

Installation
pip install python-docx

Example
from docx import Document

document = Document()
document.add_paragraph("It was a dark and stormy night.")
<docx.text.paragraph.Paragraph object at 0x10f19e760>
document.save("dark-and-stormy.docx")

document = Document("dark-and-stormy.docx")
document.paragraphs[0].text
'It was a dark and stormy night.'

https://t.me/DataScienceN πŸš—
Please open Telegram to view this post
VIEW IN TELEGRAM
❀2πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
Data scientists, this is for you β€” I dug up LeetCode for DS

DataLemur β€” a powerful platform that collects real interview problems from Tesla, Facebook, Twitter, Microsoft, and other top companies

Inside: practical tasks on SQL, statistics, Python, and ML. You can filter by difficulty level and company

Top-notch for those preparing for interviews for Data Scientist / Data Analyst roles. Get it here 🍯

πŸ‘‰ https://t.me/DataScienceN πŸ‘
Please open Telegram to view this post
VIEW IN TELEGRAM
❀2
πŸ”₯ Trending Repository: Deep-Learning-Roadmap

πŸ“ Description: :satellite: Organized Resources for Deep Learning Researchers and Developers

πŸ”— Repository URL: https://github.com/astorfi/Deep-Learning-Roadmap

🌐 Website: https://machinelearningmindset.com/deep-learning-resources/

πŸ“– Readme: https://github.com/astorfi/Deep-Learning-Roadmap#readme

πŸ“Š Statistics:
🌟 Stars: 3.2K stars
πŸ‘€ Watchers: 144
🍴 Forks: 314 forks

πŸ’» Programming Languages: Python

🏷️ Related Topics:
#reinforcement_learning #deep_learning


==================================
🧠 By: https://t.me/DataScienceM
❀1
πŸ”₯ Trending Repository: awesome-transformer-nlp

πŸ“ Description: A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.

πŸ”— Repository URL: https://github.com/cedrickchee/awesome-transformer-nlp

πŸ“– Readme: https://github.com/cedrickchee/awesome-transformer-nlp#readme

πŸ“Š Statistics:
🌟 Stars: 1.1K stars
πŸ‘€ Watchers: 41
🍴 Forks: 131 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#nlp #natural_language_processing #awesome #transformer #neural_networks #awesome_list #llama #transfer_learning #language_model #attention_mechanism #bert #gpt_2 #xlnet #pre_trained_language_models #gpt_3 #gpt_4 #chatgpt


==================================
🧠 By: https://t.me/DataScienceM
πŸ”₯ Trending Repository: SemanticSegmentation_DL

πŸ“ Description: Resources of semantic segmantation based on Deep Learning model

πŸ”— Repository URL: https://github.com/tangzhenyu/SemanticSegmentation_DL

πŸ“– Readme: https://github.com/tangzhenyu/SemanticSegmentation_DL#readme

πŸ“Š Statistics:
🌟 Stars: 1.1K stars
πŸ‘€ Watchers: 77
🍴 Forks: 315 forks

πŸ’» Programming Languages: Jupyter Notebook - Python - Shell - sed

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.me/DataScienceM
πŸ”₯ Trending Repository: awesome-jetpack-compose-learning-resources

πŸ“ Description: πŸ‘“ A continuously updated list of learning Jetpack Compose for Android apps.

πŸ”— Repository URL: https://github.com/androiddevnotes/awesome-jetpack-compose-learning-resources

πŸ“– Readme: https://github.com/androiddevnotes/awesome-jetpack-compose-learning-resources#readme

πŸ“Š Statistics:
🌟 Stars: 1.4K stars
πŸ‘€ Watchers: 41
🍴 Forks: 140 forks

πŸ’» Programming Languages: Kotlin

🏷️ Related Topics:
#android #kotlin #awesome #mvvm #android_architecture #compose #beginner_friendly #android_apps #hacktoberfest #coroutines_android #mvvm_android #android_jetpack #first_issue #jetpack_android #learn_android #jetpack_compose #hacktoberfest2020 #android_compose #awesome_android


==================================
🧠 By: https://t.me/DataScienceM
❀1
πŸ”₯ Trending Repository: awesome-learning

πŸ“ Description: A curated list for DevOps learning resources. Join the slack channel to discuss more.

πŸ”— Repository URL: https://github.com/Lets-DevOps/awesome-learning

πŸ“– Readme: https://github.com/Lets-DevOps/awesome-learning#readme

πŸ“Š Statistics:
🌟 Stars: 920 stars
πŸ‘€ Watchers: 43
🍴 Forks: 310 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#infrastructure #learning #devops


==================================
🧠 By: https://t.me/DataScienceN
πŸ”₯ Trending Repository: Machine-Learning-Tutorials

πŸ“ Description: machine learning and deep learning tutorials, articles and other resources

πŸ”— Repository URL: https://github.com/ujjwalkarn/Machine-Learning-Tutorials

🌐 Website: http://ujjwalkarn.github.io/Machine-Learning-Tutorials

πŸ“– Readme: https://github.com/ujjwalkarn/Machine-Learning-Tutorials#readme

πŸ“Š Statistics:
🌟 Stars: 16.6K stars
πŸ‘€ Watchers: 797
🍴 Forks: 3.9K forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#list #machine_learning #awesome #deep_neural_networks #deep_learning #neural_network #neural_networks #awesome_list #machinelearning #deeplearning #deep_learning_tutorial


==================================
🧠 By: https://t.me/DataScienceN
❀2
πŸ”₯ Trending Repository: awesome-recursion-schemes

πŸ“ Description: Resources for learning and using recursion schemes.

πŸ”— Repository URL: https://github.com/passy/awesome-recursion-schemes

πŸ“– Readme: https://github.com/passy/awesome-recursion-schemes#readme

πŸ“Š Statistics:
🌟 Stars: 1.3K stars
πŸ‘€ Watchers: 44
🍴 Forks: 56 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#awesome #recursion_schemes #catamorphisms


==================================
🧠 By: https://t.me/DataScienceN
❀1
πŸ”₯ Trending Repository: awesome-deeplearning-resources

πŸ“ Description: Deep Learning and deep reinforcement learning research papers and some codes

πŸ”— Repository URL: https://github.com/endymecy/awesome-deeplearning-resources

πŸ“– Readme: https://github.com/endymecy/awesome-deeplearning-resources#readme

πŸ“Š Statistics:
🌟 Stars: 2.9K stars
πŸ‘€ Watchers: 221
🍴 Forks: 666 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#nlp #video #reinforcement_learning #deep_learning #neural_network #code #paper #corpus #modelzoo


==================================
🧠 By: https://t.me/DataScienceN
πŸ”₯ Trending Repository: Machine_Learning_Resources

πŸ“ Description: :fish::fish::fish: ζœΊε™¨ε­¦δΉ ι’θ―•ε€δΉ θ΅„ζΊ

πŸ”— Repository URL: https://github.com/wangyuGithub01/Machine_Learning_Resources

πŸ“– Readme: https://github.com/wangyuGithub01/Machine_Learning_Resources#readme

πŸ“Š Statistics:
🌟 Stars: 1.2K stars
πŸ‘€ Watchers: 10
🍴 Forks: 179 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics: Not available

==================================
🧠 By: https://t.me/DataScienceN
πŸ”₯ Trending Repository: Awesome-Meta-Learning

πŸ“ Description: A curated list of Meta Learning papers, code, books, blogs, videos, datasets and other resources.

πŸ”— Repository URL: https://github.com/sudharsan13296/Awesome-Meta-Learning

πŸ“– Readme: https://github.com/sudharsan13296/Awesome-Meta-Learning#readme

πŸ“Š Statistics:
🌟 Stars: 1.5K stars
πŸ‘€ Watchers: 68
🍴 Forks: 298 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#one_shot_learning #zero_shot_learning #metalearning #few_shot_learning #deep_meta_learning #meta_reinforcement


==================================
🧠 By: https://t.me/DataScienceN
πŸ”₯ Trending Repository: programming-math-science

πŸ“ Description: This is a list of links to different freely available learning resources about computer programming, math, and science.

πŸ”— Repository URL: https://github.com/bobeff/programming-math-science

πŸ“– Readme: https://github.com/bobeff/programming-math-science#readme

πŸ“Š Statistics:
🌟 Stars: 1.8K stars
πŸ‘€ Watchers: 26
🍴 Forks: 129 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#science #programming #math #awesome_list


==================================
🧠 By: https://t.me/DataScienceN
πŸ”₯ Trending Repository: awesome-knowledge-graph

πŸ“ Description: A curated list of Knowledge Graph related learning materials, databases, tools and other resources

πŸ”— Repository URL: https://github.com/totogo/awesome-knowledge-graph

πŸ“– Readme: https://github.com/totogo/awesome-knowledge-graph#readme

πŸ“Š Statistics:
🌟 Stars: 1.7K stars
πŸ‘€ Watchers: 41
🍴 Forks: 147 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#nlp #graph #knowledge_graph #graph_database #awesome_list


==================================
🧠 By: https://t.me/DataScienceN
πŸ”₯ Trending Repository: mlhub123

πŸ“ Description: ζœΊε™¨ε­¦δΉ &ζ·±εΊ¦ε­¦δΉ η½‘η«™θ΅„ζΊζ±‡ζ€»οΌˆMachine Learning ResourcesοΌ‰

πŸ”— Repository URL: https://github.com/howie6879/mlhub123

🌐 Website: https://www.mlhub123.com/

πŸ“– Readme: https://github.com/howie6879/mlhub123#readme

πŸ“Š Statistics:
🌟 Stars: 1.1K stars
πŸ‘€ Watchers: 30
🍴 Forks: 238 forks

πŸ’» Programming Languages: Not available

🏷️ Related Topics:
#machine_learning #deep_learning


==================================
🧠 By: https://t.me/DataScienceN
❀1