Epython Lab
6.41K subscribers
665 photos
31 videos
104 files
1.23K links
Welcome to Epython Lab, where you can get resources to learn, one-on-one trainings on machine learning, business analytics, and Python, and solutions for business problems.

Buy ads: https://telega.io/c/epythonlab
Download Telegram
#Keynote #DataScience #NLP #Python @epythonlab

Natural Language Processing

Natural language processing (NLP) is the field devoted to methods and algorithms for processing human (natural) languages for computers. NLP is a vast discipline that is actively being researched. Some examples of machine learning applications using NLP include sentiment analysis, topic modeling, and language translation. In NLP, the following terms have specific meanings:

- Corpus: The body/collection of text being investigated.
- Document: The unit of analysis, what is considered a single observation.

Examples of corpora include a collection of reviews and tweets, the text of the Iliad, and Wikipedia articles. Documents can be whatever you decided, it is what your model will consider an observation. For the example when the corpus is a collection of reviews or tweets, it is logical to make the document a single review or tweet. For the example of the text of the Iliad, we can set the document size to a sentence or a paragraph. The choice of document size will be influenced by the size of our corpus. If it is large, it may make sense to call each paragraph a document. As is usually the case, some design choices that need to be made.
๐Ÿ“š 5 Free Books on Natural Language Processing to Read in 2023

1. Speech and Language Processing
Authors: Dan Jurafsky and James H. Martin

2. Foundations of Statistical Natural Language Processing
Authors: Christopher D. Manning and Hinrich Schรผtze


3. Pattern Recognition and Machine Learning
Author: Christopher M. Bishop

4. Neural Network Methods in Natural Language Processing
Author: Yoav Goldberg

5. Practical Natural Language Processing

https://epythonlab.t.me/

#books #python #machinelearning #nlp
โค5๐Ÿ‘2
๐Ÿ“š 5 Free Books on Natural Language Processing to Read in 2024

1. Speech and Language Processing
Authors: Dan Jurafsky and James H. Martin

2. Foundations of Statistical Natural Language Processing
Authors: Christopher D. Manning and Hinrich Schรผtze


3. Pattern Recognition and Machine Learning
Author: Christopher M. Bishop

4. Neural Network Methods in Natural Language Processing
Author: Yoav Goldberg

5. Practical Natural Language Processing

https://epythonlab.t.me/

#books #python #machinelearning #nlp
โค3๐Ÿ‘3
๐Ÿ“ขDay 16/100: Tackling Amharic NLP Challenges

Amharic presents unique challenges in natural language processing (NLP), from its complex script to a lack of annotated datasets.



My approach: Fine-tune Large Language Models (LLMs) for Amharic Named Entity Recognition (NER) to extract product names, prices, and locations from Telegram messages.



๐Ÿ’ก Discussion: What strategies can we adopt to make NLP more accessible for low-resource languages like Amharic?

#NLP #AI #Amharic #FintechEthiopia
๐Ÿ“ขDay 19/100: Choosing the Right Language Model

For Amharic Named Entity Recognition, we fine-tuned three models:

1๏ธโƒฃ XLM-Roberta: Best for multilingual NLP.

2๏ธโƒฃ mBERT: Balanced performance.

3๏ธโƒฃ DistilBERT: Lightweight but slightly less accurate.

๐Ÿ’ก Insight: XLM-Roberta outperformed others in accuracy and entity recognition for Amharic e-commerce data.

๐Ÿ’ก Question: Whatโ€™s your experience with fine-tuning NLP models for underrepresented languages?

#AI #NLP #ModelSelection #FintechAfrica
๐Ÿ“ข๐——๐—ฎ๐˜† ๐Ÿฎ๐Ÿญ/๐Ÿญ๐Ÿฌ๐Ÿฌ: ๐—ง๐—ฟ๐—ฎ๐—ถ๐—ป๐—ถ๐—ป๐—ด ๐—”๐—บ๐—ต๐—ฎ๐—ฟ๐—ถ๐—ฐ ๐—ก๐—˜๐—ฅ ๐— ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€

I fine-tuned models on 27,989 labeled examples, optimizing key parameters:

- Learning rate: Experimented to find the sweet spot.

- Batch size: Limited to 16 to manage memory constraints.

- Metrics: Focused on precision, recall, and F1-score.



๐Ÿ’ก Finding: Smaller batches helped balance performance and computational efficiency.

๐Ÿ’ก Question: How do you optimize parameters for low-resource NLP tasks?

#AI #ModelTraining #Ethiopia #NLP