Section 5 - Networks and Web Technologies Flashcards
Search Engine Indexing
What is a search engine?
Systems that locates resources on the Internet
Search Engine Indexing
What is an index?
A record of the resources located on the Internet/World Wide Web
Search Engine Indexing
What is a web crawler/spider?
Software that creates an index
Search Engine Indexing
What process does the web crawler go through?
- Visits a web page and follows all the links on that page
- Then follows all the links on those pages
- Repeat
Search Engine Indexing
What are meta tags?
A list of keywords or concise phrases specified by the website owner that are built into each webpage which effect the chances of a webpage from being found during a search
Search Engine Indexing
What factors effect a page’s chance of being placed high in the results list?
- Keywords in the <title> tag</title>
- The age of your website and date of last update (or frequency of updates)
- The number and relevancy of keywords appearing in <h1> tags and
- The relevancy of the domain name to the conten
- PageRank
Search Engine Indexing
PageRank equation:
PR(A) = ( 1-d ) + d( PR(Ti)/C(Ti) + … + PR(Ti)/C(Tn) )
- What is the dampening factor?
- What is PR(X)?
- What is C(X)
- What is Ti to Tn?
- The probability a user does not follow any links, normally set to 0.85
- The PageRank of X
- Number of outbound links on page X
- Incoming pages
Search Engine Indexing
What steps are taken using the PageRank algorithm to find the PageRank of multiple unknowns?
- Start by guessing each PageRank as 1
- Substitute the values into the PageRank equation
- Update the guesses based on the answer
- Repeat until the guesses and the answers match
EXAMPLE
Iteration 1:
PR(A) = 0.15 + 0.85( 1/3 + 1/1 ) = 1.283
PR(B) =0.15 + 0.85( 1/1) = 1
PR(C) =0.15 + 0.85( 1/3 ) = 0.433
PR(D) = 0.15 + 0.85( 1/3 ) = 0.433
Iteration 2:
PR(A) = 0.15 + 0.85( 1/3 + 0.433/1 ) = 0.801PR(B) =0.15 + 0.85( 1.283/1) = 1.241
PR(C) =0.15 + 0.85( 1/3 ) = 0.433
PR(D) = 0.15 + 0.85( 1/3 ) = 0.433
Structure of the Internet
What is a Uniform Resource Locator (URL)?
The full address of an Internet resource
Structure of the Internet
What is an internet registrar?
A company that holds records of all existing website names and the details of those domains that are currently available to purchase
Structure of the Internet
What is an internet registry?
- A global organisation governed by the Internet Corporation for Assigned Names and Numbers (ICANN)
- Has worldwide databases that hold records of all the domain names currently issued to individuals and companies, as well as their details
- Allocate IP addresses and keep track of which address(es) a domain name is associated with as part of the Domain Name System (DNS)
Structure of the Internet
What is the Domain Name System (DNS)?
A catalogue of all domain names and IP addresses in a series of global directories
Structure of the Internet
What does a domain name do?
Identifies the area/domain that an internet resource resides in
Structure of the Internet
What is a Fully Qualified Domain Name (FQDN)?
A domain name is one that includes the host server name
(e.g. www, mail, ftp)
Structure of the Internet
What is an Internet Protocol (IP)?
A unique address that is assigned to a network device
Internet Communication
What is a MAC address?
A unique identifier assigned to a Network Interface Controller (NIC)
Structure of the Internet
What is a Wide Area Network (WAN)?
A network spread over a large geographical area
Structure of the Internet
What is a Local Area Network (LAN)?
A network that consists of a number of computing devices on a single site or in a single building
Structure of the Internet
What is physical bus topology?
A LAN topology where all computers are connected to a single cable