1.3.4 Web Technologies pt2 Flashcards
Index
A record of resources located on the world wide web
Web crawler
Internet bots that continuously crawl the web to discover and record publicly available web pages
Creates an index that is searched for relevant content when you search
Meta tags
A description of the content of a web page inside the HTML tags to make it more likely to be found
PageRank uses
Ranks web pages based on how many web pages link to them and the amount of pages that link to those
PR(A) formula
PR(A) = (1-d) + d(PR(Ti)/C(Ti) + … + PR(Tn)/C(Tn))
Ti -> Tn are all the pages that lead to a page so divide each page rank by the amount of outward links and sum
d
Damping factor - probability of a user not opening that link
C(Ti)
The number of outward links from page TI
PageRank Process
Starting with each PageRank as 1, iterate the algorithm for each.
Keep iterating, feeding new values only at the end of each iteration (when every page has had PageRank run on it)
Client-server
Clients on terminals access the central server to carry out tasks
Diagram has a central server connected to terminals by lines
Client-server advantages
+ Secure
+ Can be used for small and large organisations
+ Complex tasks are performed quicker
Client-server disadvantages
- Requires specialist IT staff to maintain (expensive)
- If the server breaks down no devices can be used
Peer-to-peer architecture
Has no central server, processing is done by the client’s devices which are connected to each other
Diagram has terminals connected like in a full mesh topology
Peer-to-peer advantages
+ Computers can communicate without going through a server
+ Useful for small organisations
Peer-to-peer disadvantages
- No central server to protect security
- Data cannot be retrieved from a computer that is switched off
Client side processing
Scripts that handle initial processing are run on the terminal rather than the main server