Comm 156 Midterm Flashcards

1
Q

What is a network?

A

Mathematical model of interactions. (collection of points joined together in pairs by lines)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two ingredients that every network has?

A

Nodes and links

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the proper verbiage when describing a network?

A

Two nodes are connected if… “they share, they go to”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the degree of a node?

A

The number of links connected to the node

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a component of a network?

A

A subnetwork in which every node is connected to every node through chain of links. There can be more than 1 in a network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the small world effect?

A

The minimum number of links that one must pass to get to another node

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the Allen Curve state?

A

The probability of two people communicating diminishes as the distance increasing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What did the “This Hurts Me As Much As It Hurts You” reading find?

A

People closer together in a concentration network will tend to reinforce their behaviors, where more connections within groups can expose people to new behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What did the Enron emails reveal?

A

They revealed that hubs within a project network highlighted illegal activity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a bipartite network?

A

Nodes that can be divided into two groups so that all the connections go from one group to the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What can a bipartite network be projected into?

A

Two different networks that each only have one type of node. “____ is connected if they share a ____” (Such as: ingredients, and flavor compounds”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a weighted network?

A

Different nodes have different weights (size) according to the strength of their connection, aka how many links they have attached to it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What did the Framingham study find?

A

People’s behavior is correlated to the behavior of their neighbor, or close neighbor. Aka if my neighbor smokes, I am more likely to smoke and a neighbor two links down is more likely to smoke than a random person in the population. The further you are, statistically is goes down, however.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a homophily?

A

When nodes in a network tend to connect to similar nodes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a directed network?

A

When a node goes from one to the other; usually shown with an arrow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are network communities?

A

They are subgroups of a network with a lot of connections within the group but not between the groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Potential Causal Explanation for Social Contagion: Influence

A

An individual’s behavior affects the behavior of her network neighbors (as in the obesity study) . Network changes behavior. Being connected can lead to similar behavior.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Potential Causal Explanation for Social Contagion: Selection/ Network Dynamics

A

The network itself changes. Similar people become closer (homphily) and dissimilar people break ties. Similar behavior can lead to being connected. (as in the liberal/conservative/independent blog)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Potential Causal Explanation for Social Contagion: Exogenous Covariates

A

An external factor that is somehow correlated with the network causes the ties to change. (such as, geography)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the mathematical model for going viral? When does an epidemic occur?

A

Susceptible > Infected > Recovered. (SIR) An epidemic occurs when the infection rate exceeds the recovery rate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the mathematical equation for the Epidemic threshold? What do the variables stand for? What is that number otherwise known as?

A

cid > 1 (C= contact rate, i= infectivity rate, and d= duration time) - The Viral Tipping Point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the infection rate?

A

The total contacts between susceptible people and infected people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is infectivity?

A

The probability that a contact between a susceptible person and an infected person will lead to an infection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the contact rate?

A

On average, how many contacts does the susceptible person have a day

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is the SIR graph model that will happen over time?

A

The S people will go from 100 > almost 0. The I people will go in a bell curve. The R people will go from 0 > almost 100.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What was Ebola a failure?

A

It was always less than 1, so it dies. Low contact rate between people in rural Africa and those who contacted it, usually stayed home in isolation. Low infectivity rate because bodily fluid has to go into us. Low duration time because people tend to die pretty quickly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Why was H1N1 not a failure?

A

It was contacted in the cities, where air travel and modern transportation increased the contact rate. Easier to transmit than ebola because people just need to be near you. There is a longer duration time (weeks).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is the small world problem? 2 possibilities.

A

The probability of 2 people with mutual acquaintance meeting up. X –> Z through Y. Either everyone can be linked through a small amount of acquaintances or people have circles and cliques that prohibit any 2 random people from meeting. Additionally we have social classes and cliques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What 3 things need to be done to start an epidemic?

A

Increase the contact rate, increase the infectivity rate, and increase the duration time. That way, cid > 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Which marketing features work better: Viral or Passive? Why?

A

Viral works better because you are personally sending a message to a friend, rather than it being automatic every time you use that application. It works better because Active has a larger I and D, but the C is low because people ignore it. Viral has a low I and D but the C is high and overtime it has a longer duration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is the issue with the viral tipping point?

A

It assumes that:

  • Everyone interacts at random
  • Everyone is equally as susceptible
  • All people have equal amounts of contacts

However:

  • We often interact with the same people
  • Our status is correlated with the status of others
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is the problem with using social science to measure contagion?

A

We don’t always have a large enough N (sample size). It is difficult to manipulate networks in a controlled way. They are expensive. Are not all face-to-face anymore.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

How do we solve the social science problem?

A

We use a model network that “looks like” a real network. This allows us to be able to measure and manipulate the features of the network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is the average path length? How do we find it?

A

For any two nodes, this is the shortest path between them (the distance). You measure the length between all the pairs of nodes and then take the average.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is the average degree? How do we find it?

A

This is the average number of links connected to a node. We add up the number of links that every node has and then take the average.

36
Q

What is the clustering coefficient? What is the formula?

A

For each node, this is the number of connections between neighbors of that node divided by the number of possible connections between those neighbors.

Is is the # of connections among friends OVER 
# of possible connections among friends 

But, there is a formula to find the bottom when the network is too big and you cannot count. It is:

N(N-1) OVER
2

37
Q

In a real life social network, what happens with the clustering coefficient?

A

It tends to be high, since my friends tend to be friends with one another. “Birds of a feather flock together”

38
Q

P in a regular and a random network. What happens in the middle? What happens when we increase randomness in a network?

A

The P (which is the rewiring possibility) in a regular network is 0. The P in a random network is 1. In the middle, after we rewire:

0 <= 0 <= 1

As we increase randomness, P gets closer to 1.

39
Q

Clustering and path lengths in a structured ring lattice network versus a random network. How are these combined in a real network?

A

Ring lattice:
- Clustering is high and path lengths are high.

Random network:
- Clustering is low and path lengths are low.

Real network:
- Clustering is high and path lengths are low.

40
Q

How does rewiring affect clustering and path lengths?

A

Rewiring makes path lengths shorter, but is does not affect clustering much.

41
Q

How does the mathematical formula change from a random network to a general network through contagion stuff? What is the 1/c?

A

cid > 1 , you divide c so id > 1/c and you set d to 1, so:

i > 1/c

The i/c is the t (tau) , which is the tipping point for a non-random general network. It is the bar something has to cross to spread. It will depend on a network as it is being rewired.

42
Q

What happens to the tipping point when networks are being rewired? What about in a real-life network? Whose model was this?

A

As we increase the randomness, the tipping point goes down and that makes contagion easier. This is the Watts-Strogatz Networks.

43
Q

What 2 concepts were found in Malcom Gladwell’s Six Degrees of Lois Weisberg?

A

The Law of the Few & The Strength of Weak Ties

44
Q

What is the law of the few?

A

It found that exceptional people who find out about a trend, through their social connections and energy, are able to spread things.

45
Q

What is the T(tau) network and how does it work mathematically?

A

It is how infectious something has to be to spread in the network.

Mathematically, it is i > 1/c and if the c is 4, then

i > 1/4.

46
Q

What happens in the T(tau) formula wen we make things less random, if it is i > 1/4?

A

As we make things less random, i goes up. So, as it moves from a random to a ring lattice, it becomes harder to spread. The tipping point will be greater in a ring lattice than it is in a random network because i is going up and it is harder to spread.

So, if the c is 6, then T(tau) is 1/6 and a ring lattice is i > 1/6 and in a random network, i < 1/6.

47
Q

What is network centrality trying to measure?

A

It attempts to measure who is important in enough to pay to tweet, to give a product to, to choose to follow, to endorse your product, etc.

48
Q

What are the 2 measure of centrality that we measured? Which one should we use?

A

Degree Centrality & Eigenvector Centrality. The one we use depends on what problem we are trying to study.

49
Q

What is degree centrality? What does it award?

A

It is its degree divided by n-1, where n is the total number of nodes in the network. It awards 1 point for every neighbor.

50
Q

When is degree centrality a good measure of influence? When is it not?

A

It is a good idea if it takes 1 step to reach a lot of people. It is a bad idea when it takes more than 1 step to reach a lot of people - doesn’t capture enough.

51
Q

What is eigenvector centrality? Why is it different than degree centrality?

A

It gives each node a score proportional to the sum of the scores of all of its neighbors. (Neighbors are not equal- some have a stronger influence)

52
Q

What is the mathematical equation for eigenvector centrality?

A

My score = K x (the sum of my neighbors)

53
Q

What happens when we find K in a network while finding the eigenvector centrality?

A

K will hold for all of the nodes.

54
Q

Mass Marketing Equation

A

Np = n

p- probability per contact
N- reach
n- how many people do it

55
Q

Degree Distributions for Connections

A

Most people have few connections, and a few people have a huge number of connections. This is otherwise known as the power law distribution.

56
Q

Twitter Cascades and the Power Law Distribution

A

Most tweets are reposted 0 times, so it is not actually passed on. The chance of a small cascade has a high probability, but the the chance of a big cascade has a very low probability.

57
Q

Can we predict influence (Statistical Model)?

A

You can look at the date from the past to predict the future. Look at their # of followers, # following, tweet frequency, time on twitter, and their past cascades to predict their future influence.

58
Q

What 2 factors predict Twitter influence?

A

Their # of followers and if they have had influence in the past.

59
Q

Bayes’ Rule and the Base Rate Fallacy (as studied in class)

A

Probability that something goes viral given that an influencer tweeted it. Probability that you are an influence given that it spreads is high, unlike the probability that is spreads given that you are an influence.

60
Q

Does contact matter when attempting to spread something?

A

NO.

61
Q

Would we rather do 1 person with 10,000 followers or 100 people with 100 followers?

A

We would do it with 100 people with 100 followers. It is more cost effective, as well as seeing it multiple times from a lot of people. Cascade sizes are not different whether they are an influencer or a regular person. It will depend more on the network than on the individual.

62
Q

What is the fundamental attribution error?

A

People tend to overemphasize the person, but often forget the context.

Ex: There was nothing special about Daphni Leef. There was already a huge discomfort with the housing prices and it just took 1 person to speak out and rebel against it for others to join. The network was already waiting for an individual.

63
Q

Ro = cid > 1 What happens when R is greater than 1?

A

The viral marketing effect was not a failure. It has to be greater than 1 for it to take off. It it is less than 1, it will die off.

64
Q

What is big seed marketing?

A

It is when you have a small seed that you send out to initial connections. As it continues down, it will eventually burn itself out, but it is still infecting a lot of people. This is done by starting out with traditional marketing and then tacking on the referral to get the seed.

65
Q

What are complex contagions?

A

If a person becomes infected after some threshold number (or fraction) of friends are infected. This takes into account our past interactions. They need clustering in order to invade and survive. Rewiring can cause it to be harder to infect, as some may only have 1 neighbor and they need 2 or more for it to spread.

66
Q

What are simple contagions?

A

Things that spread by simple contact, or one time, like a disease. Here, unlike in the complex contagions, we cannot choose to get infected.

67
Q

Simple Contagions & Random

A

Simple contagions are easier to spread in a random network because they only need 1 contact, so rewiring helps it spread quicker.

68
Q

Complex Contagions and Ring Lattice

A

Complex contagions are easier to spread in a ring lattice. Rewiring makes it harder to spread, since they need at least 2 and there is less clustering.

69
Q

Critical Mass

A

It is the threshold needed for a complex contagion to spread. Above, it will grow and below it will die. (Like Ro and the big seed marketing) BUT it is different than Ro because it requires certain amount of connections.

70
Q

Facebook Selective Release Strategy

A

They released it to certain people at a time. Harvard, then Ivy Leagues, then College then General. This allowed them to always have a low critical mass and high clustering, which made it easier to spread.

71
Q

Why did Google + fail?

A

It failed because you could only sign up if someone invited you to. Therefore, there was no critical mass. That would only work if it was a simple contagion. (Viral)

72
Q

What is the community detection algorithm?

A

Methods for detecting groups in a network based on the patterns of connections.

73
Q

Modularity

A

Given a division of the network nodes into groups (partition of the nodes), the modularity of the division is the fraction of edges within the group minus the fraction expected by change. A good division will have a high modularity.

74
Q

Structural Equivalence

A

It looks almost like classes. We are separated by the class and people like us, are on our level.

75
Q

What is human capital? What is the residual?

A

What gives you value as an individual. Accumulated training, eduction, experience, skills, etc.

The residual is what’s left over with those that are underperforming

76
Q

What are the clusters and bridges of a small worlds network? What are their clustering coefficients?

A

Clusters are a tight knit group of people who have a lot of connection between the groups, often by homophily. Bridges are long rage connections from moving or a random connection outside of your domain. Clusters have a high clustering coefficient and bridges have a low clustering coefficient.

77
Q

Social Capitals: Closure

A

It is a network consisting of a single cluster and has a high clustering coefficient. (This was Cliff) Their value is:

  • trust (group punishment)
  • efficiency (shared jargon)
  • known roles and what theyre good at
  • affirmation (think the way they do)
78
Q

Social Capitals: Broker

A

It is a network consisting of bridges connecting multiple clusters and has a low clustering coefficient. (This is Bob) Their value is:

  • diversity (different groups and a lot of perspective)
  • information (know a lot due to connections)
  • vision (can see a change coming)
79
Q

Brokers vs. Closures. How to succeed?

A

Typically, brokers do better and the underperformers tend to be closures. Closures tend to be echochambers. Brokers can only maximize themselves if they take advantage of the network, since they can be weak and non-jargon ties. You need broker connections acting like closure ties. It cannot be forced, or it will not work. Your friends connections have no effect on you, but you can look for structural gaps and fill those.

80
Q

What to do to create broker ties acting like closure ties?

A
  1. Maintain connections we already have by reaching out to them and when you face a tough challenge, they will be there for you.
  2. Shared activities are a proven way, since we all have passions, there are stakes involved, they are genuine, and removes away hierarchy.
81
Q

Centrality and directed networks

A

Pagerank does not do well in undirected networks. Eigenvector centrality is not useful in directed networks.

82
Q

What is Pagerank?

A

It is a lot like degree centrality, but it is rescaled. Is is only useful when there are directed networks. It splits up equally among all of the nodes its connected to.

83
Q

What is Betweenness centrality?

A

Each node gets a score proportional to the # of shortest paths that go through the node.

84
Q

3 biases that hinder us

A
  1. Overconfidence
  2. Anchoring
  3. Endowment Effect (we latch on to things we already have)
85
Q

Crowd Forecasts

A

They always beat its average members because of diversity. Some people are over and others are below the correct answer, but they cancel each other out and crowd error is lower. If all in the same direction, they have the same average error. If

86
Q

Crowd Error

A

The error of the crowd is always less than or equal to the average error of its members.

87
Q

Diversity Prediction Theorem

A

Crowd error = average error-diversity

Diversity matters just as much as accuracy. It is always true and the sample size does not matter. Add someone if they are different, not if you have someone a lot like them in the crowd already.