Lecture 06: Folksonomies and Uncertainty Flashcards

1
Q

What are the seven problems associated with tagging?

A
  1. Homonymy - same word having different meanings
  2. Synonymy - relation between two different words that hold the same meaning
  3. Granularity - level of detail considered
  4. Aspect - the way in which something may be viewed or regarded
  5. Not tight control of tagging - free text vs. controlled vocabulary
  6. Seemingly chaotic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three reasons tagging is contributing to success?

A
  1. Low barriers to entry - not required sophisticated knowledge
  2. Feedback and Asymmetric Communications - negotiate meanings of tags, locate other related resources from tags of a certain resource
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are desire lines?

A

The concept used to identify the relation between emergent semantics from tags. Desire lines is a path developed by erosion caused by human footfall, which provide the shortest or most easily navigated route.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the social aspects of tagging behaviour?

A
  • If a web resource is tagged by many users, it has more chance to be seen by other users.
  • If a web resource is seen by many users, it has more chance to be tagged by other users.
  • This creates a positive feedback loop and leads to exponential growth
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is self-normalisation in regards to folksonomies?

A

The idea that controlled vocabularies become more consistent without any external control over time.
Self-normalisation can be promoted by shared information retrieval spaces.
The folksonomies tend to show a power law curve and long tail effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Power Law?

A
  1. Very small number of events that have a very high probability of appearing
  2. Very large number of events that have a very low probability of appearing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the two types of folksonomies?

A
  1. Broad folksonomy

2. Narrow folksonomy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a Broad folksonomy?

A

A broad folksonomy is whereby many people tag the same object, it follows the power law and there is an agreement on using a few popular tags. Broad folksonomies can be used to select preferred terms or extract a controlled vocabulary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a Narrow folksonomy?

A

A narrow folksonomy is whereby a few people tag the same object, as a result it loses the richness of the masses. Aids in tagging objects which are not easily findable. Specific target audiences will tag objects with specific personal tags to improve future retrieval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a tag cloud?

A

A tag cloud is a visualisation of the tags in a folksonomy with tags being weighted based on their frequency of occurrence. The more important tags are bigger and louder in the illustration. It provides an instant illustration, providing precise and specific orientation of the content. The advantage of a tag cloud is it highlights the most important popular subjects dynamically.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the four types of tag clouds?

A
  1. Sorted Alphabetically - font size, weight, colour
  2. Sorted by Frequency - most important tags may be highlighted
  3. Not sorted at all - most important are highlighted
  4. Sorted according to similarity - similar terms appear close to each other
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a tag index?

A

A tag index is an alternative to a tag cloud in that it provides an index of tags, in a listed manner which is the best solution for precise content presentation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the advantages of folksonomies?

A
  1. Serendipity - information discovery (browsing vs. finding)
  2. Reflect the population’s conceptual model
  3. Accomodate Diversity
  4. Self-moderating
  5. Low cost alternative
  6. Enable emergence of social groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the disadvantages of folksonomies?

A
  1. No synonym control
  2. Lack of precision
  3. Lack of hierarchy
  4. Low findability quotient
  5. Problems with scaling
  6. Susceptible gaming
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the two goals of automatic tag analysis?

A
  1. Map between Synonyms

2. Link related terms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the three problems of automatic tag analysis?

A
  1. Typos
  2. Spelling variations
  3. Morphological variations
17
Q

How can typos be dealt with and what are the two types of spell checking algorithms?

A

Typos can be tackled by spell checking algorithms, in particular:

  1. Phonetic Mastering Algorithms - words that sound alike
  2. String Similarity Algorithms - words that look alike
18
Q

How can morphological variations be tackled?

A
  1. Stemming - reducing inflected words to their stem, base or root form : e.g. fishing, fished, fish, fishes, fisher all relate to fish.
  2. Lemmatisation - determining the lemma of a given word, the canonical, dictionary or citation form of a word : e.g. fishing fished fish relate to fish
    : e.g. fisher, fishers relate to fisher
19
Q

What is synonym detection via dictionary?

A

Databases such as Wordnet are used which are comprised of a large lexical database of content words, and these words are grouped into synsets. The contents of the dictionary links all the lexical relations between words:

  • Antonym (opposite in meaning)
  • Hyponym (narrow term, Hypernym broader term)
  • Meronym ( part of , Holonym the whole )
20
Q

What is data mining?

A

Data mining is a variety of techniques to sort through data, identifying patterns and establishing probabilistic relationships.

21
Q

What is the Dung Argumentation framework?

A

The dung argumentation framework (AF) is a pair >
Where A is a set of arguments, and –> is a binary relation on A.
It can be represented as a direct graph, where nodes are arguments and edges are attacks.
It helps solves errors in presenting data and nomenclature.

22
Q

What are the semantic properties in the Dung Argumentation framework which arguments must follow?

A
  1. Conflict-freeness : an attacking argument and an attacked argument cannot stay together
  2. Admissibility - extension should be able to defend itself
  3. Strong Admissibility - base your defence on unattacked arguments.