Network Analysis I Flashcards

Question 1

Q

Describe the Erdös Rényi Model

Answer

A

We have a set of nodes

and a connection probability p.

For each node pair, we use p to decide if there should be a connection or not.

for p = 0.5 (a coin flip)

we may get something simmilar to the figure

Question 2

Q

Can we calculate the number of nodes in a large Erdös Rényi Model?

Answer

A

Yes! using the formula explained in the figure we are actually able to do so, by knowing the number of nodes and the connection probability.

Question 3

Q

What is a node degree?

Answer

A

Degree of a node i: number of edges adjacent to i (for directed graph can also distinguisth in-degree and out-degree). Degree distribution: distribution of degrees of all nodes in a network.

Question 4

Q

What is the power law?

Answer

A

In many real networks there is a linear relationship between the logarithms of the degree d, and its relative frequency f (d).

This applies to word appearances (for example. is, is (no pun intended) a more used word than elephant).

See figure for distribution

Question 5

Q

Why is the power law interesting here?

Answer

A

If we want to add a new node to the network, we can expect it to be added to one of the nodes with many attachments unless we introduce prefential attachments

Question 6

Q

What is the idea of prefential attachments?

Answer

A

The change of assigning a new node to another existing node is propertional to the degree of the existing node. The rules for this can however be a bit ambigous

Question 7

Q

How do we resolve the ambiguities in the informal preferential attachment model?

Answer

A

The Linearized Chord Diagram model resolves the ambiguities in the informal preferential attachment model.

Basis: adding a new node by preferential attachment in the m = 1 (m is the amount of edges) case is easy and non-ambiguous.

For a full explanation of the LCD algorithm:

Generate a random matching of the nodes.

https://gyazo.com/27918d2fb1209e071d98baab41988e89

2.

https://gyazo.com/d80dec9a29169e9dcb1113c63903793e

3.

https://gyazo.com/6952b1856431ee9c66a4cbce9e070de6

4.

https://gyazo.com/82706a42899e0e7eadc11afe21a28d42

5.

https://gyazo.com/d5d5a6cda0943421eaec3bf5bd378171

6.

https://gyazo.com/9e5950d807e79be41b9b968ae5a0b632

last

https://gyazo.com/41da2278e2966b02471d0d5b4993885d

Question 8

Q

What effect did the LCD have on the protein-protein network?

Answer

A

A more straight logaritmic line. Meaning it is better in terms of degree distribution

Question 9

Q

How do we measure the amount of clustering

Answer

A

For a node v in G we use two things:

τ3(v) : (tau) number of length-2 paths with v in the middle
* this does not need to be a triangle*
τ△ : number of triangles containing v

This will give us a clustering coefficient

example - see figure

Question 10

Q

What how do we use exponential random graphs?

Answer

A

We can use it since there are a lot of attributes to a network. We can use these attributes and assign parameters (scores) which can be used to calculate the probability of this to happen, though this requires the normalization constant (paritition function) to be found which is very difficult.

An example is the use of open and closed triangles as attributes

As seen in the figure:

Network Analysis I Flashcards

(10 cards)