AIIM icp Search Flashcards

1
Q

Crawler

A

a machine that goes through the text and metadata to find things in the system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Search engine index

A

creates and organizes a database of keywords for all the different docs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Query Engine

A

plain language or boolean actually searches the documents for the words being searched

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Human-powered directories

A

the directories are created by humans (mahalo) may be subjective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

hybrid search engines

A

uses query and human powered directories (like google or yahoo)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

homogeneous search

A

one search tool, multiple repository indexes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

federated search

A

multiple search tools, multiple repository indexes (merlot)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

universal search

A

one search tool, one repository index (disregards any other search tools or pre-existing repository indexes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

application search

A

built in searches in specific applications (like email search etc) keeps application security in place (but only limited to the application data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Parametric search

A

rules based, fielded search. look at attributes (parameters) already built-in to the documents. Most precise but only limited to the declared fields

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

keyword search

A

a type of parametric search, they are set by the users. May be more precise because it won’t rely on only the words in the document and it applies human reasoning to the document a little inflexible since dependant on humans

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

semantic / pattern search Natural language search

A

looks for the meaning behind the words, not just the specific word.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Statistical search

A

using baseian probabilities looking on search. Also not based on language so it can fit no matter the language. uses relevancy ranking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Concept and fuzzy search

A

looks on synonyms and simpliar spellling etc. but can get large lists.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Concept clustering

A

using algorithims to profile the concepts and compared to the other documents creates a large organization of overlapping concepts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Social search

A

looking at all the social search and creating more relevant documents. looking a docs your friends have liked

17
Q

Search Engine Optimization (SEO)

A

Works on content rather than the index to make the documents more “findable” ambient findability

18
Q

Effective keyword use

A

not just in text, but title meta, header of html and xml docs

19
Q

effective link-building

A

adding more links to make more effective, also plain-lanugage urls, image tags

20
Q

use of social tools

A

organizations wikis, tweet, and social to highlight the best documents for people

21
Q

thesaurus

A

manages and tracks the definitions of words and phrases and their relationships to one another in a heiarchical fashion to correlate between groups and applications

22
Q

semantic networks

A

like thesaurus but a higher view, using a metadata based infrastructure to connect search terms to other related terms

23
Q

comparison technics

A

going through and weeding out duplicates from results

24
Q

semantic feaure extraction and comparison

A

we use words to discribe things so if document uses the a portion of the same words to describe the document then it is likely referring to the same or similar thing

25
Q

hash code

A

a mathamatical description of a document, if identical then, maybe same document if not they are not same document