Thinking Probabilistically; Chp. 1 Flashcards

1
Q

What is experimental design?

A

A branch of statistics that deals with data collection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Explain Exploratory Data Analysis: EDA

A

This is about numerical summaries, such as the mean , mode, standard deviation, and interquartile ranges. EDA is also about visually inspecting the data, using tools you may be already familiar with , such as histograms and scatterplots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Explain Inferential Statistics

A

this is about making statements beyond the current data. We may want to understand some particular phenomenon, or maybe we want to make predictions for the future data points, or we want to choose among several competing explanations for the same observations. Inferential statistics is a set of methods and tools that will help us to answer these types of questions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

If we are the ones in charge of generating or gathering the data , it is always good to …..?

A

first think carefully about the questions we want to answer and which methods we will use and only than proceed to get the data. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does the word ontological mean?

A

adjective: ontological
1. relating to the branch of metaphysics dealing with the nature of being.
“ontological arguments”
2. showing the relations between the concepts and categories in a subject area or domain.
“an ontological database”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does stochastic mean?

A

adjective : TECHNICAL
adjective: stochastic
randomly determined; having a random probability distribution or pattern that may be analyzed statistically but may not be predicted precisely.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data does not speak but through …?

A

Models. We need to interpret data in the context of models. Including metal and formal ones.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Models are …?

A

simplified descriptions of a given system or process that, for some reason , we are interested in.

(Those descriptions are deliberately designed to capture only the most relevant aspects of the system and not to explain every minor detail)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the first step in the Bayesian modeling process?

A
  1. Given some data and some assumptions on how this data could have been generated, we design a model by combining building blocks known as probability distributions. Most of the time these models are crude approximations, but most of the time it is all we need.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the 2nd step of the bayseian modeling process?

A
  1. We use bayes’ theorem to add data to our models and derive the logical consequences of combining the data and our assumptions. We say we are conditioning the model to our data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the 3rd step of Baysesian modeling?

A

We criticize the model by checking whether the model makes sense according to different criteria, including the data , expertise on the subject, and sometimes by comparing several models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe what an event is in Bayesian statistics.

A

an event is just any of the possible values(or subset of values) a variable can take.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Probabilities are numbers in the _________ interval?

A

[0,1]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe the product rule of probability:

A

p(a,b) = p(A | B)p(B) . the probability of A and B is equal to the probability of A given B, times the probability of B. This is only true if A and B are independent of each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Probability models are built by ……

A

properly combining probability distributions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A probability distribution is a mathematical object that ….?

A

describes how likely different events are

17
Q

A common and useful conceptualization in statistics is to think data is generated from some …?

A

true probability distribution with unknown parameters. Why ?

18
Q

There are two types of random variables. What are they?

A

Continuous Variables and discrete variables

19
Q

When dealing with continuous distributions is that the values plotted on the y axis are not probabilities but ….?

A

probability densities

20
Q

What are the major components of Baye’s Theorem?

A
  1. Prior
  2. Likelihood
  3. Posterior
  4. Marginal Likelihood
21
Q

Describe the Prior:

A

the prior distribution should reflect what see/know about the value of theta or Hypothesis before seeing the data.

if we know nothing, we could use flat priors that do no convey too much information. Generally we can do better than flat priors.

Why?

22
Q

Describe the Likelihood:

A

The Likelihood is how we introduce data in our analysis. It is an expression of the plausibility of the data given the parameters/hypothesis.

23
Q

Describe the posterior distribution:

A

The posterior distribution is the result of bayesian analysis and reflects all that we know about a problem( given our data and model).

The posterior is a probability distribution of the parameter/hypothesis in our model and not a single value.

This distribution is a balance of the prior and likelihood.

Why?