Epython Lab
6.45K subscribers
660 photos
31 videos
104 files
1.22K links
Welcome to Epython Lab, where you can get resources to learn, one-on-one trainings on machine learning, business analytics, and Python, and solutions for business problems.

Buy ads: https://telega.io/c/epythonlab
Download Telegram
The get_feature_names() method returns our feature names for this dataset, which is the set of words that make up our vocabulary for 'documents'.
Instructions: Create a matrix with each row representing one of the 4 documents, and each column representing a word (feature name). Each value in the matrix will represent the frequency of the word in that column occurring in the particular document in that row. You can do this using the transform() method of CountVectorizer, passing in the document data set as the argument. The transform() method returns a matrix of NumPy integers, which you can convert to an array using toarray(). Call the array 'doc_array'.
πŸ‘3
Now we have a clean representation of the documents in terms of the frequency distribution of the words in them. To make it easier to understand our next step is to convert this array into a dataframe and name the columns appropriately.

Instructions: Convert the 'doc_array' we created into a dataframe, with the column names as the words (feature names). Call the dataframe 'frequency_matrix'.
πŸ‘3
Epython Lab
Project Idea: Building a spam classifier Introduction Spam detection is one of the major applications of Machine Learning in the interwebs today. Pretty much all of the major email service providers have spam detection systems built in and automatically…
Congratulations! We have successfully implemented a Bag of Words problem for a document dataset that we created. https://t.me/epythonlab/689

One potential issue that can arise from using this method is that if our dataset of text is extremely large (say if we have a large collection of news articles or email data), there will be certain values that are more common than others simply due to the structure of the language itself. For example, words like 'is', 'the', 'an', pronouns, grammatical constructs, etc., could skew our matrix and affect our analysis.

There are a couple of ways to mitigate this. One way is to use the stop_words parameter and set its value to english. This will automatically ignore all the words in our input text that are found in a built-in list of English stop words in scikit-learn.

Another way of mitigating this is by using the tfidf(Term frequency index document frequency) method.
πŸ‘4
Question: We have implemented bag of words with and without scikit-learn. Let you understand each solutions of the problem posted here https://t.me/epythonlab/689 and write short summary which method(without scikit-learn or using python code and with using scikit-learn library) do you think is best option to implement bag of words?
Send your summary to @asibehtenager. Your summary will be posted on the channel that help others to learn from you.
πŸ‘3
Best Selling Python Book on Amazon for free. Who want this book? you can find the pdf at the next post. Like or Dislike?
#pythonbooks @epythonlab
πŸ‘4❀1
Python_Crash_Course,_2nd_Edition:_A_Hands_On,_Project_Based_Introduction.pdf
5.4 MB
Best Selling  Python Book on Amazon for free. Who want this book? you can find the pdf at https://t.me/epythonlab/709/

Python Crash Course, 2nd Edition: A Hands-On, Project-Based Introduction to Programming 2nd Edition
❀3πŸ‘1
We have plan to demonstrate how to build a simple blog app using Flask. We want to know that how many of you are absolute beginner in Flask Framework? We will send you zoom link via your telegram account on Friday @2:00 PM(GMT+3). Replay to this poll.
Final Results
56%
I know HTML, CSS, JS, and Python but not Flask
8%
I know Flask
35%
I don't know all
Auto Scraper Bot.pdf
136.2 KB
Telegram shilling bot for Crypto traders, msg to @asibehtenager
πŸ‘1
What is the output?
Guess what?

print(list("πŸ‡±πŸ‡·"))
Final Results
49%
["πŸ‡±πŸ‡·"]
12%
["US"]
8%
["U", "S"]
32%
Type errror
πŸ‘4πŸŽ‰2😱1
Machine Learning with Python for Everyone

πŸ“– Book https://mariapilot.noblogs.org/files/2020/10/Machine-Learning-With-Python-For-Everyone-Pearson-2020.pdf

@epythonlab Help us growing our channel to make it better learning platform by forwarding this post to at least for one friend.
πŸ‘6πŸ”₯2
Best Python book for you: Python for you and me 2022

πŸ“– Book : https://buildmedia.readthedocs.org/media/pdf/pymbook/latest/pymbook.pdf

@epythonlab #share #pythonbooks
πŸ”₯5
Flask MySQL connection using SQLAlchemy session
Date: Tuesday @2:00 PM( UTC )
Venue: Zoom
Fee: Free
Reserve your seat @asibehtenager
Max time: 40 minutes.
πŸ‘4