Bag of Word (BoW) Model is used to extract the features from text so that the text can be used in modeling in machine learning algorithms.Now the question arises that why we need to extract the features from text. It is because the machine learning algorithms cannot work with raw data and they need...
Tuesday, June 30, 2020
Monday, June 29, 2020
Dividing Data into Chunks: Chunking
The main job of chunking is to identify the parts of speech and short phrases like noun phrases. We have already studied the process of tokenization, the creation of tokens. Chunking basically is the labeling of those tokens. In other words, chunking will show us the structure of the sentence.There...
Sunday, June 28, 2020
Tokenization, Stemming, and Lemmatization
TokenizationIt may be defined as the process of breaking the given text i.e. the character sequence into smaller units called tokens. The tokens may be the words, numbers or punctuation marks. It is also called word segmentation. Following is a simple example of tokenization:Input: Mango, banana, pineapple...
Saturday, June 27, 2020
Components of NLP
Natural Language Processing (NLP) refers to AI method of communicating with intelligent systems using a natural language such as English.Processing of Natural Language is required when you want an intelligent system like robot to perform as per your instructions, when you want to hear decision from...
Friday, June 26, 2020
Building a KNN classifier
A K-Nearest Neighbors (KNN) classifier is a classification model that uses the nearest neighbors algorithm to classify a given data point. We have implemented the KNN algorithm in the last post, now we are going to build a KNN classifier using that algorithm.Concept of KNN ClassifierThe basic concept...
Thursday, June 25, 2020
Finding Nearest Neighbors
If we want to build recommender systems such as a movie recommender system then we need to understand the concept of finding the nearest neighbors. It is because the recommender system utilizes the concept of nearest neighbors.The concept of finding nearest neighbors may be defined as the process of...