Week 1 Flashcards
Main Characteristics of Social Medias are?
- Participation
- Openness
- Conversation
- Community
- Connectedness
POCCC
What is a social media metrics used for?
Quantifying how much someone is influential, how fast someone is spreading info to others in a social network.
What are the three types of Social Media Use
Broadcast / Sending Information
Request for Feedback / Input
Conversation / Interaction
Why did the Philadelphia Police Tweet work?
Use social media mostly for broadcast
They do not tweet overwhelm people
too often
Reach out on multiple channels with clear req feedback in more traditional ways
What is Astroturfing?
the art of faking popularity
How can Astroturfing be detected
These accounts can be detected using structural social network analysis, using retweet and follow behaviour
How to measure success?
Counting - # of fans, followers, friends, views, link etc
Social Sharing - # of retweet, mentioned, share
Engagement Rate
Interaction - # of customers engaged, conversation length
Referral Rates - How much traffic is received to biz site from SM
Influence of Users - Centrality
Engagement Rate Formula
of engagement activities (likes, shares) / # of friends
What are some example of Data hooks
Gestures (to others)
Interaction with environment
Chats (textual & verbal)
What does Hook do?
Hooks collect data & send to logs
Explain the 3 levels of funnel
Top of the funnel - Visitor Researching : Broad Keyword
Mid Funnel - Visitor is Engaged : Specific Keyword
Bottom Funnel - Visitor is Ready to buy : Exact Keyword
Why do we create and sustain networks?
Self interest Social & resource exchange Mutual interest & collective action Contagion Balance Homophily Proximity Co-evolution
What is the main driver of Networks Evolution?
Homophily is one of the fundamental
patterns of how social networks are structured
Why is extreme Homogenization bad?
Extreme homogenization can prevent innovation and new idea generation (people in a cluster all think alike)
Framingham Social Network
Happy people tend to be located in the center of their local social networks and in large clusters of other happy people!
Happiness, in other words, is not merely a function of individual experience or individual choice but is also a property of groups of people.
What is Random Social Network?
Random network models introduce an edge between any pair of vertices with a probability p
What is Real Social Network?
Real networks tend to have a relatively few nodes of high connectivity
Scale free refers to the distribution principle of how many links there are per node
The distribution follows Power Law
few nodes of high connectivity is also called
Hub node
What is the Power Law
Relationship between TWO quantities - One quantity varies as a power of the other
# of cities with a certain population size (y-axis) Varies as a power of the size of the population (x-axis)
What is Cloud Computing?
Cloud computing is the
on-demand delivery of compute power, database storage, applications, and other IT resources
through a cloud services platform via the internet with
pay-as-you-go pricing.
3 models of cloud computing
- IaaS - Infrastructure
- Paas - Platfrom
- Saas - Software
What is IaaS
you manage the server, which can be physical or virtual, as well as the operating system (Windows or Linux). In general, the data center provider has no access to your server.
Basic building block of IaaS
Basic building blocks for cloud IT include:
Networking features
Compute, and
Data storage space
What is PaaS
someone else manages the underlying hardware and operating systems. This enables you to run applications without managing underlying infrastructure (for example – patching, updates, maintenance, hardware and operating systems). PaaS also provides a framework for developers that they can build upon to create customized applications.
What is SaaS
you manage your files, while the service provider takes care of all of the data centers, servers, networks, storage, maintenance, patching, etc. All you worry about is the software and how you want to use it. You are provided with a complete product that is run and managed by the service provider. Facebook and Dropbox are examples of SaaS. You manage your Facebook contacts and Dropbox files, and the service providers manage the systems.
3 cloud deployment model
“All-In” Cloud
hybrid
private
What is the “All-In” Cloud ?
“All-In” Cloud is a cloud-based application that is fully deployed in the cloud, and all parts of the application run in the cloud. Applications in the cloud have either been created in the cloud or have been migrated from an existing infrastructure. Cloud-based applications can be built on low-level infrastructure pieces (for example, networking, compute or storage) or can use higher-level services that provide abstraction from the management, architecting, and scaling requirements of core infrastructure.
What is the hybrid deployment
A hybrid deployment is a way to connect infrastructure and applications between cloud-based resources and existing resources that are not located in the cloud. The most common method of hybrid deployment is between the cloud and existing on-premises infrastructure (sometimes called on-prem). On-premises infrastructure is located within the physical confines of an enterprise, often in the company’s data center. A hybrid deployment model is used to extend an organization’s infrastructure into the cloud while connecting cloud resources to an internal system.
What is private cloud/on premise?
When you run a cloud infrastructure from your own data center, that’s called on-premises or private cloud. While this kind of deployment lacks many of the benefits of cloud computing, it does provide dedicated resources and is a popular choice for organizations who need to meet certain compliance standards. In most cases, this deployment model is the same as legacy IT infrastructure while using application management and virtualization to increase resource utilization.
Advantage of All in cloud
No upfront investment Low ongoing costs Focus on innovation Flexible capacity Speed and agility Global reach on demand
Disadvantage of On-Premise/Private Cloud
Large initial purchase Labor, patches, and upgrade cycles Systems administration Fixed capacity Long procurement cycle and setup Limited geographic regions
What can you do in the cloud?
- Application Hosting for an on-demand infrastructure to host internal or SaaS applications.
- Backup and Storage to store data and build dependable backup solutions.
- Content Delivery to distribute content worldwide, with high data transfer speeds.
- Host static and dynamic websites.
- Enterprise IT to host internal- or external-facing IT applications in AWS’s secure environment.
- Use a variety of scalable database solutions, from hosted enterprise database software to non-relational database solutions.
Explain On-Premise Infrastructure
Security:
Firewalls
ACL
Administrator
Networking:
Router
Network Pipeline
Switch
Servers:
On Premises Server
Storage and Db: DAS SAN NAS RDBMS
Explain Amazon Web Service Infrastructure
Security:
Security Groups
Network Access Control List
Identity Access Management
Networking:
Elastic Load Balancing
Amazon VPC
Servers:
Amazon Machine Image
Amazon EC2 Instance
Storage and Db: Elastic Block Store Elastic File System Amazon S3 Amazon RDS
Concepts : High Availability
High availability refers to a resource that is accessible when you attempt to access it. For example, if every time you go to the ATM to make a withdrawal it works as expected the ATM is highly available; however, if you go to use it and there is a sign on the front that says “Out of Order”, it is not highly available.
Concepts : Fault Tolerance
Fault tolerance is the ability to withstand a certain amount of failure and still remain functional. It also refers to the ability of a system to be self-healing and return to full capacity despite a failure. It is the ability of a system to fail in some way but still remain functional.
Concepts : Scalable
Scalability is the ability to easily grow in size, capacity, and/or scope when required particularly in response to demand. If something cannot quickly grow in an easy manner it is not scalable.
Concepts : Elastic
Elasticity is the ability to not only grow (or scale) when required, but also to reduce or contract in size as needed. A system that is elastic can scale to grow as needed usually based on demand and contract as demand decreases.
Explain Capital Expense
Funds used by a company to acquire, upgrade, and maintain physical assets such as property, industrial buildings, or equipment.
Explain Variable Expense
A variable expense is an expense that is easily altered or avoided by the person bearing the cost.
By using the cloud, businesses won’t have to invest money into data centers and servers, and can pay for only what they use in a pay-as-you-go fashion. This lets businesses save money on technology and enables them to adapt to new applications with as much space as needed in minutes, rather than weeks or days. Maintenance is reduced so the business can spend more time focusing on the core goals of the business.
Benefit of Cloud Computing
Trade capital expense for variable expense.
Benefit from massive economies of scale.
Eliminate guessing on your capacity needs.
Increase speed and agility.
Stop spending money to run and maintain data centers.
Go global in minutes.
What are Web Services?
A web service is any piece of software that makes itself available over the internet and uses a standardized format (XML or JSON) for the request and the response of an API interaction.
How can you access AWS services?
You can access them in any of three ways: using the AWS Management Console, the AWS Command Line Interface (AWS CLI), and via Software Development Kits (SDKs).
What is a network?
A network is a graph, or a collection of points connected by lines
What are other ways to call a Node
Points are referred to as nodes, actors (social setting) , or
vertices (plural of vertex)
What do you call a directed edge
A directed edge is sometimes called an arc
Edges CAN have direction (T/F)
True
For any node, the set of nodes it is connected to via an edge is called
Neighborhood
number of edges connected to one node is called
the degree of the node (the size of its neighborhood)
Degree in an undirected graph equation
Theorem 1. The summation of degrees in an undirected graph is twice the number of edge
2|𝐸| 𝑖
Degree in a directed graph equation
Lemma 1. In any directed graph, the summation of in-degrees is equal to the summation of out- degrees:
When dealing with very large graphs, how nodes’ degrees are distributed is an important concept to analyze and is called?
degree probability 𝒑𝒅
𝑝𝑑 =𝑃(𝑑𝑖 =𝑑) = 𝑛𝑑 / 𝑛
We are seeking representations that can store
graphs in a way such that
Does not lose information
Can be manipulated by computers
Can have mathematical methods applied easily
What time of Adjacency matrices does Social network have?
Social networks have very sparse Adjacency matrices
How is adjacency list sorted?
The list is usually sorted based on the node order
or other
What is an Edge List?
In this representation, each element is an edge and is usually represented as (𝑢, 𝑣), denoting that node u is connected to node v via an edge
(v1, v2)
(v2, v3)
(v2, v4)
….
What is a null graph
A null graph is one where the node set is empty. (No nodes)
What is an empty graph/edge-less graph
An empty graph or edge-less graph is one where the edge set is empty, 𝐸 = ∅
A null graph is not an empty graph
False, A null-graph is an empty graph.
The adjacency matrix for undirected graphs is not symmetric (T/F)
False, The adjacency matrix for undirected graphs is symmetric
The adjacency matrix for directed graphs is not symmetric (T/F)
True, The adjacency matrix for directed graphs is not symmetric
What is a Simple Graph?
Simple graphs are graphs where only a single edge can be between any pair of nodes
What is a Multigraph?
Multigraphs are graphs where you can have multiple edges between two nodes and loops (self- links)
What is a Webgraph?
A webgraph is a way of representing how internet sites are connected on the web
Webgraph is also known as a
directed multigraph (Two sites can have multiple links pointing to each other and can have loops (links pointing to themselves))
When weights are binary (0/1, -1/1, +/-) we have _____ graph
signed graph
Two nodes are adjacent if they are connected via an edge. (T/F)
True
A graph is connected, if there exists a path between any pair of nodes in it (T/F)
True
A graph is strongly connected if there exists a path between any pair of nodes, without following the edge directions
False, A graph is WEAKLY connected if there exists a path between any pair of nodes, without following the edge directions
(If v5 connects to v6 even if its through other nodes it is strong)
Trees are special cases of…..
undirected graphs
A tree is a graph structure that has no____ in it
In a tree, there is exactly one path between any
pair of nodes hence A tree is a graph structure that has no cycle in it
A set of disconnected trees is called
A forest
What is a complete graph
A complete graph is a graph where for a set of nodes 𝑉, all possible edges exist in the graph
In a complete graph, any pair of nodes are connected via an edge
What is a Bipartite Graph
Bipartite Graph is a graph where the node set can be partitioned into two sets such that, for all edges, one end-point is in one set.
DFS search enforces a stack or queue structure?
Stack Structure
The algorithm can be used both for trees and graphs
Each AWS region is a geographical area and is made up of _____________
Each Region is made up of two or more Availability Zones.
AWS has ____ regions worldwide
18
What is Centrality
Centrality defines how important a node is within a network
The degree centrality measure ranks nodes with
more connections higher in terms of centrality
Normalized Degree Centrality types
• Normalized by the maximum possible degree
d / (n-1)
• Normalized by the maximum degree
d / (max d)
• Normalized by the degree sum
d/ 2E
What is Eigenvector centrality
Generalizes degree centrality by incorporating the importance of the neighbors (undirected)
What is PageRank centrality
We can divide the value of passed centrality by the number of outgoing links, i.e., out-degree of node
Each connected neighbor gets a fraction of the source node’s centrality
What is Closeness Centrality
Measures social distance from you to others
Closeness in effect measures how fast you can spread info/epidemics to the rest of the network.
In a non-weighted network, distance is the total number of links existing between two nodes (in the shortest path).
In a weighted network, distance is a function of the link/edge weights along the shortest path between two
nodes.
For every node:
1 / (v1 to v2 + v1 to v3 + v1 to v4…) =
1 / (v2 to v1 + v2 to v3 + v2 to v4…) =
What is Betweenness centrality
Measure of brokerage.
For one community to pass info/epidemics to another, they must go through you. This makes you very important.
Formula:
c / (n-1)(n-2) for every node that pass through you