ITEC50 Flashcards
a program that searches documents for specified keywords and returns a list of the documents where the keywords were found
search engine
sending out a spider to fetch as many documents as possible
search engine
an indexer, who reads documents and creates an index based on the words contained in each document
search engine
proprietary algorithm to create its indices such that, ideally, only meaningful results are returned for each query
search engine
How do web search engines work?
- finding specific information on the vast expanse of the World Wide Web.
- it would be virtually impossible to locate anything on the Web without knowing a specific URL.
Algorithm of search engine:
- to determine the relevance of the information in the index
- frequency and location of keywords on a Web page
- the way that pages link to other pages on the Web
A search engine consists of three components:
- spider
- index
- search engine mechanism
program that traverses the Web from link to link, identifying and reading pages
spider
database containing a copy of each Web page gathered by the spider
index
software that enables users to query the index and that usually returns results in relevancy ranked order
search engine mechanism
backbone or procedure of technology
algorithm
Three types of search engines:
- powered by robots (called crawlers; ants or spiders)
- powered by human submissions
- hybrid of the two
automated software agents
crawlers
use automated software agents (called crawlers) that visit a Web site
crawler-based search engines
read the information on the actual site
crawler-based search engines
read the site’s meta tags
crawler-based search engines
follow the links that the site connects to perform indexing on all linked Web sites as well
crawler-based search engines
returns all that information back to a central depository
crawler-based search engines
check for any information that has changed
crawler-based search engines
rely on humans to submit information that is subsequently indexed and cataloged
human-powered search engines
only information that is submitted is put into the index
human-powered search engines
Query on Search Engine
- searching through the index that the search engine has created
- giant databases of information that is collected and stored and subsequently searched.
- index hasn’t been updated since a Web page became invalid
it enables you to search through various Internet databases
alenka
provides ad-free result
alenka
Some of the Internet Search Engines you can use to search the Web
- yahoo
- baidu
- bing
January 1996 as a research project by Larry Page and Sergey Brin