Algorithms that Changed the World Flashcards
What is an Algorithm?
A finite set of precise instructions for performing a computation or for solving a problem.
What is the PageRank algorithm designed to do?
To rank web pages by importance based on their hyperlink structure, helping users find the most relevant pages in a massive, unstructured web.
What are the two main ideas behind PageRank?
The Hyperlink Trick - Pages linked to by many others are likely more important.
The Random Surfer Model - Models a user randomly clicking links or restarting their browsing at random pages.
How does PageRank define page importance?
A page’s importance is higher if it is linked to by other important pages, with rank distributed proportionally based on the number of outbound links.
What is the sink page problem in PageRank?
Sink pages have no outbound links, which can absorb all rank. The solution is to redistribute their rank equally across all pages.
Sink pages have no outbound links, which can absorb all rank. The solution is to redistribute their rank equally across all pages.
A group of pages that only link to each other can trap rank in an infinite loop. This is mitigated using the random surfer model.