Search Engine Indexing Flashcards
What is a search engine?
Software that is used to locate resources, usually on the internet, based on a user search query
What do search engines rely on/use?
On their indexes of webpages to locate resources
What is the index of webpages that search engines use?
A database which has data gathered by a search engine
What is search engine indexing
The process of gathering, building and storing data in the search engine index.
Why do we use a search engine?
- makes finding resources easier
- doesn’t check every document
- it is very fast and quite accurate
Why does the search engine index need to be constantly updated?
To avoid broken links, add new webpages and websites and have obsolete websites removed
What programs do search engines use to create the index?
Spiders or crawlers
What do spiders/crawlers do?
Automatically travel the world wide web and index found words, map out links between pages in order to build up the index database.
What do spiders/crawlers usually add to database and look at when travelling.
- URLs
- Titles of web pages
- Meta tags
- Content
- Links to other sites
What is a meta tag?
A tag which adds extra keywords or descriptions to a webpage, while keeping them hidden within the HTML code. These are defined within the tag on the html document.
What do search engines look through when a query is provided?
Their index to match the input. These use algorithms to determine which ones are the best results.
Who uses the PageRank algorithm?
What is the page rank algorithm?
code which is used to compile and rank results of a search query. The websites are ordered based on importance, which is determined by the amount of back links and their quality(page rank of connected website).
What assumption does the pagerank algorithm work upon?
That a website’s or webpage’s importance is determined by the amount of inbound links from other websites/webpages.
What is a back link?
An inbound link from another site