Chatbot Python (Exam 2) Flashcards
A computer program that simulates human conversation with an end user. It can understand human language, process it and interact back with humans.
Chatbot
- Import corpus
- Preprocess data
- Text case handling
- Tokenization
- Stemming
- Bag of words (BOW)
- One hot encoding.
How does a chatbot work?
The training data needed for the chatbot to learn. Without it, a chatbot cannot effectively understand and respond to the user.
Corpus
Convert all the data coming as an input to either upper or lower case. This will avoid misrepresentation and
misinterpretation of words if spelt under lower or uppercase.
Data preprocessing
The structured process of converting a sentence into individual collection of words.
Tokenization
Process of finding similarities between words with the same root words
Stemming
It’s a representation of text data where each
word in the text is treated as an individual token, regardless of its position or order in the text. The context or order of words is not preserved in this method; it only cares about the frequency of each word.
Bag of words
A process that converts
categorical information into a format that may
be fed into machine learning algorithms to
improve prediction accuracy
One hot encoding