1.3.4.3 Search Engine Indexing & PageRank algorithm Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What are the programs that scour the WWW called

A

Spiders/crawlers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do spiders do?

A

Spiders index any pages, content, metadata they find and map links between pages by following all:

Internal links

External links

This updates the index. Updating the index must be done continuously as pages get added/removed/updated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does the PageRank algorithm do and aim to achieve?

A

It is used to help compile and rank webpages and list of results returned by a search engine.

It checks the number and quality of links to a page, determining roughly how important it is. Those of greater importance are more likely to be linked to from other websites.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

PR(A) = (1 – d) + d (PR(T1)/C(T1) + … + PR(Tn) / C(Tn))

What does PR(A) mean?

A

Page rank of page A

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

PR(A) = (1 – d) + d (PR(T1)/C(T1) + … + PR(Tn) / C(Tn))

What does C(Tn) mean?

A

The total count of outbound links from web page n, including to A

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

PR(A) = (1 – d) + d (PR(T1)/C(T1) + … + PR(Tn) / C(Tn))

What does PR(T1)/C(T1) do?

A

The share of the vote page A gets from pages T1 to Tn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

PR(A) = (1 – d) + d (PR(T1)/C(T1) + … + PR(Tn) / C(Tn))

d stands for a damping factor. What is meant by a damping factor?

A

The damping factor prevents PR(Tn)/C(Tn) from having too much influence. It notionally set to 0.85.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly