Chapter 4: The Search Query Flashcards
Boolean logic
A method for combining search terms (keywords) using logical operators to limit or expand the numbers of results. This system was originally developed by the mathematician
George Boole in the 1800s. Three logical operators are used in information retrieval: AND, OR, and NOT.
Boolean operator
Indicates a relationship between search terms. Using Boolean operators allows the searcher to broaden or narrow the search results. The three most commonly used
Boolean operators are: AND, OR, and NOT.
Concept identification
Indicates a relationship between search terms. Using Boolean operators allows the searcher to broaden or narrow the search results. The three most commonlv used
Boolean operators are: AND, OR and NOT.
Controlled vocabulary
An authorised list of terms that can be used when indexing a document or an entity or when conducting an information search
Human indexing
When human indexers (as opposed to computer algorithms) create representations of documents by allocating keywords to describe the content.
Metadata
Metadata are data that describe and give information about other data. Metadata often contain descriptive keywords for a document.
Describes the properties of the document.
Metadata contain descriptive keywords/ concepts about a
document.
Metadata can also be tags.
Natural language representation
Content described in words that we use in everyday language
Natural language query
A query that uses the same words that the author used in the document to represent the document in a search query.
Query
A query is a combination of search terms (keywords) with Boolean operators in a single statement. The searcher must construct a query to carry out a search.
Query construction
Constructing a query involves combining search terms (keywords) with Boolean operators in a single statement.
Search engine mathematics
These are mathematical symbols used instead of Boolean operators. For example, + is the equivalent of the Boolean operator AND and - is the equivalent of the Boolean
operator NOT
Creating and representing information
- Mental process
- Real world
- Computer ready
Mental process
An author has an idea research problem or interest
Real world
The author creates a document (information object). This can be a blog, book, an article, a website, etc.
Computer ready
For people to find this document, the content must be represented in a database as one or both of the following: full text or representation of the document through metadata.
Seeking and retrieving information
- Mental process
- Real world
- Computer ready
Mental process
An information user needs information to a problem or maybe just out of interest
Real world
The user must formulate a
question (or maybe a
question is already formulated
as an assignment)!
Computer ready
The question now Has to be
reformulated as a query for
the interface to understand it.
Retrieving information
The idea of information retrieval is to MATCH Your need (as a user) for information TO An author's supply of information.
When we search for information, we are trying to
MATCH
The words in a question
TO
The words an author may have written at some point in time
Metadata can be used to represent a document in two ways
- Natural Language
2. CONTROLLED VOCABULARY
Natural language
EVERYDAY LANGUAGE.
WORDS THAT ALREADY EXIST IN THE
DOCUMENT.
Controlled vocabulary
ALTERNATIVE WORDS / SYNONYMS
BROADER AND LESS SPECIFIC THAN
NATURAL LANGUAGE.
DETERMINING SEARCH KEYWORDS
Identify topics and concepts
Identify singular and plural forms
Identify proper nouns
Identify time periods
Identify required formats