Module 3 Flashcards

1
Q

Data analytics from start to finish: The tools of BA range from collecting the data (the start) to analyzing data (the finish)
this includes the following:

A
  1. the data analytics process
  2. the importance of statistics
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data analytics proces: what do we do with data?

A
  • collect
    -clean
    -organize
    -analyze
    -communicate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

measures of the data: we study:

A
  1. measure of the center of the data (mean, median, mode)
  2. measures of variation: range and standard dviation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

mean

A

the most common
mean = sum of values divided by the number of values
most affected by outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The median

A

-middle number
-less senstive to outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

mode

A

occurs most often
not affected by outliers
may be no mode or several modes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

sample standard deviation

A

most commonly used measure of variation
shows variation about the mean
square root of variance
in excel it STDEV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Histogram
-representation of the ___________ of ______ data
-great for checking how the data is ____

A
  • distribution, numerical
    -spread
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

correlation
- measures whether two variables _____
- a ___ correlation means they are unrelated
-a ____ correlation means they move in the same direction
-the _____ is a good way to visualize correlation
-coorelation is not ____

A
  • move together
    -0
    -positive
    -XY scatterplot
    -causation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Two types of categorical variables

A

input variables
-group data into segments
-0/1

Outcome variable
-group data into two outcomes
-essential to the concept of probabiltity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

hypothesis testing
method to decide whether the data in hand (____) sufficiently support a particular hypothesis about population parameters
a hypothesis test makes _____ statements about ____ parameters

A
  • samples
    -probablistic, population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

P value
if <.05, we ____ H0
if > .05, we ____ H0

A
  • reject
    -retain
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Type 1
Type 2
Type 3

A
  1. both populations are the same group, comparing them
  2. comparing ample 1 with sample 2, 2 populations with equal variances
  3. comparing two samples, 2 populations with unequal variances
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Tails

A

2 tail, whether 2 populations are different from another
1 tail, whether one population mean is greater than or less that the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly