Search Engine Indexing Flashcards

1
Q

What is a search engine?

A

Software that is used to locate resources, usually on the internet, based on a user search query

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do search engines rely on/use?

A

On their indexes of webpages to locate resources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the index of webpages that search engines use?

A

A database which has data gathered by a search engine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is search engine indexing

A

The process of gathering, building and storing data in the search engine index.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why do we use a search engine?

A
  • makes finding resources easier
  • doesn’t check every document
  • it is very fast and quite accurate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why does the search engine index need to be constantly updated?

A

To avoid broken links, add new webpages and websites and have obsolete websites removed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What programs do search engines use to create the index?

A

Spiders or crawlers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What do spiders/crawlers do?

A

Automatically travel the world wide web and index found words, map out links between pages in order to build up the index database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What do spiders/crawlers usually add to database and look at when travelling.

A
  • URLs
  • Titles of web pages
  • Meta tags
  • Content
  • Links to other sites
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a meta tag?

A

A tag which adds extra keywords or descriptions to a webpage, while keeping them hidden within the HTML code. These are defined within the tag on the html document.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do search engines look through when a query is provided?

A

Their index to match the input. These use algorithms to determine which ones are the best results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Who uses the PageRank algorithm?

A

Google

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the page rank algorithm?

A

code which is used to compile and rank results of a search query. The websites are ordered based on importance, which is determined by the amount of back links and their quality(page rank of connected website).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What assumption does the pagerank algorithm work upon?

A

That a website’s or webpage’s importance is determined by the amount of inbound links from other websites/webpages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a back link?

A

An inbound link from another site

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the equation for the page rank algorithm?

A

PR(A) = (1-d) + d(PR(t1)/c(t1))………………………..

17
Q

What is PR(A)?

A

The resultant page rank of site a

18
Q

What is d?

A

The damping factor.

19
Q

What is the damping factor?

A

a probability used to reduce a ranking of a site based on the assumption that a typical user will give up clicking on links

20
Q

What is PR(t1)?

A

the page rank of linked site to a (an inbound site)

21
Q

what is C(t1) ?

A

the totaL number of outbound pages from page t1

22
Q

Are pageranks constantly updated?

A

Yes

23
Q

What is the damping factor between?

A

0 and 1

24
Q

What does the damping factor reduce?

A

used to reduce a ranking of a site generated by the amount of backlinks . So not the oldest or most important sites take the ranking

25
Q

Does the page rank algorithm ignore spelling mistakes?

A

Yes

26
Q

What does the page rank do, to summarise?

A

assings a rank to a page based on a query, it does so by ranking the amount of inward links

27
Q

Does the page rank algorithm have to deal with millions of results?

A

Yes