Statistics Flashcards

1
Q

What are statistical measures?

A

Measure of frequency - Histogram and Frequency Distribution
Measure of central tendency - Mean, Median and Mode
Measure of spread - Standard deviation, Variance and Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Mean formula

A

μ = ∑N / Nsum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Median formula

A

μ = N / Nsum or μ = N+1 / Nsum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Mode formula

A

Max count of data point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Standard deviation formula

A

√ ∑(μ -xi)2 / N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Normal distrbution curve what is the % of values distributed?

A

68.2% = -1sd to +1sd
13.6% = -1sd to -2sd and +1sd to 2sd
1.7% = -2sd to -3sd and +2sd to 3 sd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Formula for Linear Regression

A

y=b0(intercept) + b1x1(coefficientvariable) + E(error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Output of logistic regression is binary, true or false?

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is log transformation?

A

Process of transforming a Non linear curve to a linear curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Sigmoid Curve

A

Its a non linear curve that is part of Logistic regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which Python lib is used for data wrangling?

A

Pandas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which Python lib is used for machine learning?

A

Ski-kit learn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Which Python lib is used for statistical functions?

A

NumPy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the difference between List and Tuple?

A

List is defined by [] and Tuple by (). List can be modified and Tuple cannot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is array in Python?

A

Its a list which is understable by NumPy lib

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

arr[1:2]. What index value is inclusive and what is not?

A

1 is inclusive and 2 is noy

17
Q

How to create arithmatic progression in NumPy array?

A

Using arrange function

18
Q

How to convert a 1dim array to 2 dim array in NumPy?

A

Using reShape

19
Q

Joint Probability (Prob of A and B occureing simultaneously)
Conditional Probability (Prob of A cosidering B has occurred)
Marginal Probability (Prob of A irresepective of B)

A

P(A)*P(B)
P(A intersection B) / P(B)
P(A and B)/P(B)

20
Q

What is probability distribution

A

A probability distribution provides a list of all values
that the random variable can take along with the
probability of each value occuring.

21
Q

Discreet Random Variable vs Continuous Variable

A

A discrete random variable can only take a very specific
value out of a predefined set of values. For example a
throw of a dice can only have 1 of 6 possible values, a
coin toss can only have 1 of 2 possible values.

A continuous random variable, can take any value
within a certain range, for example a mileage of a car,
weight of a person etc.

22
Q

Bernoulli distribution
Vs
Binomial distrbution
Vs
Uniform Distribution
Vs
Poisson distribution
Vs
Normal Distribution
Vs
Exponential Distribution

A

Single trial with only 2 possible outcomes
Vs
Multiple trial with only 2 possible outcomes
Vs
Equal probability of an event out of any trial
Vs
The number of events occurring in a certain time
interval
Vs
Most values of the above events fall in the middle
Vs
To predict the time till when the next event will occur