Week 8: influence manipulation Flashcards
How does simple contagion spread in a small-world network, lattice, random, and scale-free network?
Small-world & random: each weak tie allows information to spread to an untouched region of the network. Spreads very fast
Lattice/clustered: spreads consistently but not as fast as small-world or random
Scale-free: existence of hubs means the hubs will quickly spread the information
How does complex contagion spread in a small-world network, lattice, random and scale-free network?
Clustered: spreads consistently
Small-world: rewiring a few edges slows down the spread because the weak ties slow the spreading. Rewiring many edges can stop the spread
Random: spreads slower than clustered because people need reinforcement from multiple neighbours
Scale-free: when the behaviour is risk the hubs will wait and see until they see other communities backing up the behaviour
How does a disease spread in two networks of the same size with different average path lengths?
A) has shorter average path lengths - the peak will come earlier, lots of people infected at the same time
B) has longer average path lengths - takes longer to spread and results in a flatter infection curve, peak is lower
What are 3 regulations to deal with COVID-19 and what are they trying to achieve?
- Remove weak/long ties between distant communities - interact with only those who are geographically proximate
- Keep ties with redundancy but remove narrow bridges - friends only meet when they gave many friends in common
- Build social bubbles through repeated contact - no gathering of more than 3 people
What is the influence maximisation problem (IMP)?
There exists a specific set of nodes which if activated would cause the spread of information to the whole network and if immunised would prevent the diffusion of a large scale epidemic
- IMP: to find a set S of a given size k that maximises f(S)
- f(S) is a function
What is structural IMP?
The influence function f(S) is fully determined by the network topology - when you are only interested in the connectivity/robustness of a network
E.g. f(S) could be the size of the largest component after the removal of S
What is functional IMP?
The influence function f(S) involves both network topology and contagion mechanism
What centrality heuristics can be used to approximate the exact IMP solution and what are each of them useful for?
Degree heuristic: the number of people that can be reached within one path length
Closeness centrality: reflects how efficient a node exchanges information with all other nodes - good for reaching all the people in the shortest time
Betweenness centrality: reflects a nodes power in controlling flow - can’t reach many people within one path length but can potentially lead to a larger outbreak
Eigenvector, k-core: based on node position, importance of neighbours
What are centrality heuristics?
Rank all nodes according to their degrees or other centrality measures and pick up the k top-ranking nodes
What are the advantages and disadvantages of using the degree heuristics for IMP?
Advantages:
-focus on immediate effect
-easy to identify the highest degree nodes for online and offline networks
-least demanding on network information
Disadvantages:
-not ideal to maximise the effect of the whole network
-sometimes inefficient since nodes of the highest centrality may be highly clustered
What are the advantages and disadvantages of closeness, betweenness and eigenvector heuristics for IMP?
Advantages:
-reflect structure of the whole network
-good for project aiming for the whole population and believing that signal will not decay significantly even for longest path
Disadvantages:
-require complete information of the network
-computationally expensive
-not easy to interpret and extent the results from sample network
What are 3 algorithms inspired by centrality heuristics?
High degree adaptive (HDA), top-k spreaders, local rank
What is the HDA method for IMP?
First choose the node of the largest degree, then recalculate the degree of nodes after every step of node removal
What is the top-k spreaders method for IMP?
Network divided into communities, all communities ranked in decreasing order according to size, first spreader is selected from the largest community, then next spreader selected from second largest community having no edges incident to previous communities
What is the local rank method for IMP?
Consider the number of neighbours within 4 path lengths