AI applications and methodologies Flashcards
Chatbots
Chatbot is one of the applications of AI which simulate conversation
with humans either through voice commands or through text chats or
both.
We can define Chatbot as an AI application that can imitate a real
conversation with a user in their natural language.
They enable communication via text or audio on websites, messaging
applications, mobile apps, or telephones.
Types of chatbots
- Rule based
Machine based
Rule based chatbots
This is the simpler form of Chatbot which follows a set of pre-defined rules in
responding to user’s questions.
* For example, a Chatbot installed at a school reception area,
- Rule based Chatbot can only answer questions that it is trained to. If you ask a question
that it is not trained, it will not be able to answer
AI based chatbots
Such Chatbots are advanced
forms of chatter-bots capable
of holding complex
conversations in realtime.
They process the questions
(using neural network layers)
before responding to them. AI
based Chatbots also learn
from previous experience and
reinforced learning and it
keeps on evolving.
- . AI-based or NLP-based bot identifies the language, context
and intent and then it reacts accordingly. A rule-based bot only
understands a pre-defined set of options.
NLP
- The technology which enables the machines (software) to understand
and process the natural language (of humans), is called natural language
processing (NLP). - it is defined as branch of Artificial
Intelligence that deals with the interaction between computers and
humans using the natural language - NLP is a
sub – area of Artificial Intelligence deals with the capability of software to
process and analyse human language, both verbal and written language
Natural language processing has found applications
- Text Recognition (in an image or a video)
- Camera / machine captures the image of number plate, the image is transferred to
neural network layer of NLP application, NLP extracts the vehicle’s number from
the image. However, correct extraction of data also depends on the quality of
image - Voice recognition
- Speech processing
- Summarization
- Information extraction
how is NLP used?
NLP is natural language processing and can be used in scenarios
where static or predefined answers, options and questions may not
work. In fact, if you want to understand the intent and context of the
user, then it is advisable to use NLP.
Summarization by NLP
NLP not only can read and understand the paragraphs or full article
but can summarize the article into a shorter narrative without
changing the meaning. It can create the abstract of entire article.
There are two ways in which summarization takes place – one in
which key phrases are extracted from the document and combined to
form a summary (extraction-based summarization) and the other in
which the source document is shortened (abstraction-based
summarization).
Information Extraction
Information extraction is the technology of finding a specific information in a document
or searching the document itself. It automatically extracts structured information such
as entities, relationships between entities, and attributes describing entities from
unstructured sources.
Speech processing
- The ability of a computer to hear human speech and analyse and
understand the content is called speech processing.
What happens when we speak to our device?
The microphones of the device hears our audio and plots the graphs of our
sound frequencies. As light-wave has a standard frequency for each colour,
so does sound. Every sound (phonetics) has a unique frequency graph. This
is how NLP recognizes each sound and composes an individual’s words and
sentences
CV
It is a subfield of AI and involves extraction of information
from digital images like the videos and photographs,
analysing and understanding the content.
Applications of CV
- Object detection
- Optical Character Recognition
- Fingerprint Recognition
How does computer see an
image?
- The computer sees images as a matrix of 2-dimensional array (or threedimensional array in case of a colour image).
- The above image is a grayscale image, which means each value in the 2D
matrix represents the brightness of the pixels. The number in the matrix
ranges between 0 to 255, wherein 0 represents black and 255 represents
white and the values between them is a shade of grey. - If we have to represent
the above locomotive image in colour, the 3D matrix will be 9x9x3.
Each pixel in this colour image has three numbers (ranging from 0 to
255) associated with it. These numbers represent the intensity of red,
green and blue colour in that particular pixel
There are primarily four tasks that Computer vision accomplishes
- Semantic Segmentation (Image Classification)
- Classification + Localization
- Object Detection
- Instance Segmentation