Python | Machine Learning | Coding | R
67K subscribers
1.24K photos
89 videos
152 files
891 links
Help and ads: @hussein_sheikho

Discover powerful insights with Python, Machine Learning, Coding, and R—your essential toolkit for data-driven solutions, smart alg

List of our channels:
https://t.me/addlist/8_rRW2scgfRhOTc0

https://telega.io/?r=nikapsOH
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
👨🏻‍💻 This Python library helps you extract usable data for language models from complex files like tables, images, charts, or multi-page documents.

📝 The idea of Agentic Document Extraction is that unlike common methods like OCR that only read text, it can also understand the structure and relationships between different parts of the document. For example, it understands which title belongs to which table or image.


Works with PDFs, images, and website links.

☑️ Can chunk and process very large documents (up to 1000 pages) by itself.

✔️ Outputs both JSON and Markdown formats.

☑️ Even specifies the exact location of each section on the page.

✔️ Supports parallel and batch processing.

pip install agentic-doc


🥵 Agentic Document Extraction
🌎 Website
🐱 GitHub Repos

🌐 #DataScience #DataScience

https://t.me/CodeProgrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
7👍2🔥1
Please open Telegram to view this post
VIEW IN TELEGRAM
17👍1
Forwarded from Thor data
🚀 Thordata Proxy: Bypass Anti-Scraping for Data Projects

Facing these issues in data collection?
🔴 IP blocks interrupting workflows
🟡 CAPTCHAs breaking automation
🟢 Geo-restrictions limiting data access
Thordata Proxy provides high-performance proxy solutions for ML/DS professionals:

🔥 Key Features
Seamless Integration: Native support for Python (Requests/Scrapy/Selenium), R, Spark
Global Coverage: 200+ countries with city-level targeting
Anti-Blocking: Residential/ISP proxies mimic real users
Low Latency: <0.8s average response time, 99.9% uptime
Compliant: GDPR/CCPA compliant for public data only

📊 Perfect For:
Training data collection for ML models/Competitive pricing monitoring/Cross-region social media analysis/Ad verification testing

🌟 Community Offer
🔗 Start now: https://www.thordata.com/?ls=DhthVzyG&lk=Data
20% off with code: IsyGLO5o

Official Channel : https://t.me/thordataproxy
7
1. What is the output of the following code?
x = [1, 2, 3]
y = x
y.append(4)
print(x)


2. Which of the following data types is immutable in Python?
A) List
B) Dictionary
C) Set
D) Tuple

3. Write a Python program to reverse a string without using built-in functions.

4. What will be printed by this code?
def func(a, b=[]):
b.append(a)
return b

print(func(1))
print(func(2))


5. Explain the difference between == and is operators in Python.

6. How do you handle exceptions in Python? Provide an example.

7. What is the output of:
print(2 ** 3 ** 2)


8. Which keyword is used to define a function in Python?
A) def
B) function
C) func
D) define

9. Write a program to find the factorial of a number using recursion.

10. What does the *args parameter do in a function?

11. What will be the output of:
list1 = [1, 2, 3]
list2 = list1.copy()
list2[0] = 10
print(list1)


12. Explain the concept of list comprehension with an example.

13. What is the purpose of the __init__ method in a Python class?

14. Write a program to check if a given string is a palindrome.

15. What is the output of:
a = [1, 2, 3]
b = a[:]
b[0] = 10
print(a)


16. Describe how Python manages memory (garbage collection).

17. What will be printed by:
x = "hello"
y = "world"
print(x + y)


18. Write a Python program to generate the first n Fibonacci numbers.

19. What is the difference between range() and xrange() in Python 2?

20. What is the use of the lambda function in Python? Give an example.

#PythonQuiz #CodingTest #ProgrammingExam #MultipleChoice #CodeOutput #PythonBasics #InterviewPrep #CodingChallenge #BeginnerPython #TechAssessment #PythonQuestions #SkillCheck #ProgrammingSkills #CodePractice #PythonLearning #MCQ #ShortAnswer #TechnicalTest #PythonSyntax #Algorithm #DataStructures #PythonProgramming

By: @DataScienceQ 🚀
9
📌 How To Learn AI (Roadmap)

🗂 Category: ARTIFICIAL INTELLIGENCE

🕒 Date: 2024-08-05 | ⏱️ Read time: 11 min read

A full breakdown of how you can learn AI this year effectively
4👍1
1. What is the output of the following code?
x = [1, 2, 3]
y = x
y[0] = 4
print(x)

2. Which of the following is NOT a valid way to create a dictionary in Python?
A) dict(a=1, b=2)
B) {a: 1, b: 2}
C) dict([('a', 1), ('b', 2)])
D) {1: 'a', 2: 'b'}

3. Write a function that takes a list of integers and returns a new list containing only even numbers.

4. What will be printed by this code?
def func(a, b=[]):
b.append(a)
return b
print(func(1))
print(func(2))

5. What is the purpose of the __slots__ attribute in a Python class?

6. Which built-in function can be used to remove duplicates from a list while preserving order?

7. Explain the difference between map(), filter(), and reduce() with examples.

8. What does the @staticmethod decorator do in Python?

9. Write a generator function that yields Fibonacci numbers up to a given limit.

10. What is the output of this code?
import copy
a = [1, 2, [3, 4]]
b = copy.deepcopy(a)
b[2][0] = 5
print(a[2][0])

11. Which of the following is true about Python’s GIL (Global Interpreter Lock)?
A) It allows multiple threads to execute Python bytecode simultaneously.
B) It prevents race conditions in multithreaded programs.
C) It limits CPU-bound multi-threaded performance.
D) It is disabled in PyPy.

12. How would you implement a context manager using a class?

13. What is the result of bool([]) and why?

14. Write a recursive function to calculate the factorial of a number.

15. What is the difference between is and == in Python?

16. Explain how Python handles memory management for objects.

17. What is the output of this code?
class A:
def __init__(self):
self.x = 1

class B(A):
def __init__(self):
super().__init__()
self.y = 2

obj = B()
print(hasattr(obj, 'x') and hasattr(obj, 'y'))

18. Describe the use of *args and **kwargs in function definitions.

19. Write a program that reads a text file and counts the frequency of each word.

20. What is monkey patching in Python and when might it be useful?

#Python #AdvancedPython #ProgrammingTest #CodingChallenge #PythonInterview #PythonDeveloper #CodeQuiz #HighLevelPython #LearnPython #PythonSkills #PythonExpert

By: @DataScienceQ 🚀
9
Andrew Ng launches a free course on AI agents 😮

The course covers four key patterns:

Reflection — the agent independently improves its responses

Tool use — using tools

Planning — action planning

Multi-agent collaboration — multiple agents working together on one task
Everything is implemented in pure Python. Andrew emphasizes that creating AI agents is one of the most in-demand skills in the market.

Available here: https://www.deeplearning.ai/courses/agentic-ai/

👉  @codeprogrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
4
🤖🧠 Join the 5-Day AI Agents Intensive Course with Google

🗓️ 07 Oct 2025
📚 AI News & Trends

Artificial Intelligence is rapidly evolving beyond chatbots and text generation. The next frontier is AI agents — intelligent, autonomous systems that can reason, take action and collaborate with tools and other agents. To help developers and practitioners build these next-generation systems, Google is launching the 5-Day AI Agents Intensive, a no-cost, online program running from ...

#aiagents #dayai #googleartificial #agentsintelligent #ai #evolvingchatbots
6👍1
🤖🧠 The Little Book of Deep Learning – A Complete Summary and Chapter-Wise Overview

🗓️ 08 Oct 2025
📚 AI News & Trends

In the ever-evolving world of Artificial Intelligence, deep learning continues to be the driving force behind breakthroughs in computer vision, speech recognition and natural language processing. For those seeking a clear, structured and accessible guide to understanding how deep learning really works, “The Little Book of Deep Learning” by François Fleuret is a gem. This ...

#DeepLearning #ArtificialIntelligence #MachineLearning #NeuralNetworks #AIGuides #FrancoisFleuret
5
📌 Understanding Positional Embeddings in Transformers: From Absolute to Rotary

🗂 Category: DEEP LEARNING

🕒 Date: 2024-07-20 | ⏱️ Read time: 19 min read

A deep dive into absolute, relative, and rotary positional embeddings with code examples
5
🤖🧠 Build a Large Language Model From Scratch: A Step-by-Step Guide to Understanding and Creating LLMs

🗓️ 08 Oct 2025
📚 AI News & Trends

In recent years, Large Language Models (LLMs) have revolutionized the world of Artificial Intelligence (AI). From ChatGPT and Claude to Llama and Mistral, these models power the conversational systems, copilots, and generative tools that dominate today’s AI landscape. However, for most developers and learners, the inner workings of these systems remain a mystery until now. ...

#LargeLanguageModels #LLM #ArtificialIntelligence #DeepLearning #MachineLearning #AIGuides
3
📌 Learn Transformer Fine-Tuning and Segment Anything

🗂 Category: MACHINE LEARNING

🕒 Date: 2024-06-30 | ⏱️ Read time: 13 min read

Train Meta’s SAM to segment high fidelity masks for any domain
6
💠 The Best Tool for Extracting Data from PDF Files!

👩🏻‍💻 Usually, PDF files like financial reports, scientific articles, or data analyses are full of tables, formulas, and complex texts.

⬅️ Most tools only extract texts and destroy the data structure, causing important information to be lost.

But the tool Docling uses artificial intelligence to preserve all those structures (text, tables, formulas) exactly as they are in the file. Then it converts that data into a structured format. Meaning AI models can work on them.

The interesting point is that with just three lines of Python code, you can convert any PDF into searchable data!

🥵 Docling
🔎 Article
📄 Documentation
🐱 GitHub-Repos

🌐 #Data_Science #DataScience
Please open Telegram to view this post
VIEW IN TELEGRAM
4👍1
🔥 Trending Repository: Prompt-Engineering-Guide

📝 Description: 🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

🔗 Repository URL: https://github.com/dair-ai/Prompt-Engineering-Guide

🌐 Website: https://www.promptingguide.ai/

📖 Readme: https://github.com/dair-ai/Prompt-Engineering-Guide#readme

📊 Statistics:
🌟 Stars: 63K stars
👀 Watchers: 668
🍴 Forks: 6.6K forks

💻 Programming Languages: MDX - Jupyter Notebook

🏷️ Related Topics:
#deep_learning #openai #language_model #prompt_engineering #generative_ai #chatgpt


==================================
🧠 By: https://t.me/DataScienceM
7👍1
🔰 Email automation using Python

Why type emails when Python can do it for you? Work smarter, not harder... unless you’re debugging. 😅💻
9👍1
📌 Mastering Object Counting in Videos

🗂 Category:

🕒 Date: 2024-06-25 | ⏱️ Read time: 8 min read

Step-by-step guide to counting strolling ants on a tree using detection and tracking techniques.
6
⚙️ This tool is turning the world of Web Scraping upside down!

👨🏻‍💻 A new tool called Crawl4AI has been introduced that makes Web Scraping and data extraction from websites much easier, faster, and smarter! Especially designed for use in AI models like ChatGPT and similar tools.

1⃣ Its special features:

🔹 Completely free and open-source. That means you can use it however you want without any cost.

🔹 Works much faster than paid tools.

🔹 Its outputs are AI-friendly, such as JSON, HTML, or Markdown.

🔹 Can extract data from multiple websites simultaneously.

🔹 Collects images, videos, and audio from pages as well.

🔹 Extracts all internal and external links for you.
                  

🔢 More advanced features:

🔹 Takes screenshots of pages and collects metadata (like title, description, tags).

🔹 You can write custom code or special settings like auth and headers.

🔹 You can even change its browser User-Agent to behave like a human.

🔹 Before starting extraction, it can run your custom JavaScript code.

♦️ Crawl4AI
🐱 GitHub Repos

🌐 #DataScience #DataScience

https://t.me/CodeProgrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
7
🤖🧠 Diffusion Transformers with Representation Autoencoders (RAE): The Next Leap in Generative AI

🗓️ 14 Oct 2025
📚 AI News & Trends

Diffusion Transformers (DiTs) have revolutionized image and video generation enabling stunningly realistic outputs in systems like Stable Diffusion and Imagen. However, despite innovations in transformer architectures and training methods, one crucial element of the diffusion pipeline has remained largely stagnant- the autoencoder that defines the latent space. Most current diffusion models still depend on Variational ...

#DiffusionTransformers #RAE #GenerativeAI #StableDiffusion #Imagen #LatentSpace
1