Data Management Flashcards

1
Q

Latin word meaning “the state”

A

status

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Came from a Latin word “status”

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

branch of science that deals with the collection, presentation, organization, analysis and interpretation of data
study of variation

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Empowers us to make intelligent choices

A

Information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

allows us to answer problems by giving a clear picture of a particular collection of elements

A

Statistical Inquiry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Collection of all elements under consideration

A

Population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Subset of a population from which raw data are being obtained

A

Sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The specification of interests depends on this.

A

Scope of the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Characteristics or attributes of the elements in a collection

A

Variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

realized value of a variable

A

Observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

collection of observations

A

Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Data set consist of some basic measurements of individual items

A

Data Structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

3 basic ways of classifying data set

A
  1. number of variables
  2. kind of information
  3. time sequence/cross sectional
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

summary measure describing the specific characteristic of the population

A

Parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

summary measure describing the specific characteristics of the sample

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

organizing and summarizing data
deals with the techniques used in the collection, presentation, organization and analysis of the data on hand
used to say something or describe a set of information collected
represented with graphs

A

Descriptive Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

come up with generalizations or inferences about the population using the information in the selected sample
used to say something about a larger group using information collected from a small part of that group

A

Inferential Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

It is based on good procedures for producing data and thoughtful examination of data.

A

Effective interpretation of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

fancy way of saying we are estimating population values based on your sample data

A

Estimation Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

drawing conclusion about a population parameter
uses data to decide between two or more different possibilities
produces a definite decision

A

Hypothesis Testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

designed research to provide information needed to solve a research problem

A

Statistical Inquiry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

most important step in statistical study

A

Determining the Sample Size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

tells the researcher how sure the responses of the sample represent the population

A

Confidence Level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

how much a percentage points deviate from the real population value

A

Margin of Error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
applicable only when estimating a population proportion and when the confidence level is 95%
Slovin's Formula
26
Process of selecting a representative group from the population under study
Sampling
27
list of all items in your population complete list of everyone/everything you want to study specific
Sampling Frame
28
total group of individuals from which the sample might be drawn
Target Population
29
group of people who take part in an investigation, called "participants"
Sample
30
extent to which we can apply the findings of our research to the target population
Generalizability
31
Non Probabilistic Sampling Techniques
Convenience Sampling - subjective basis of selection Purposive Sampling - used in exploratory research Referral/Snowball Sampling - produces biased results Quota Sampling - hypothesis was generated
32
Probabilistic Sampling Techniques
Simple Random Sampling - random basis of selection Systematic Random Sampling - used in conclusive research Stratified Random Sampling - can make statistical inferences Cluster Sampling - hypothesis was tested
33
based on personal choice does not follow a randomization mechanism allow researcher to choose the elements in the sample subjectively
Non-Probabilistic Sampling
34
uses random selection wherein each element in the sampled population has equal chances of being selected
Probabilistic Sampling
35
Allows us to calculate an ideal sample size and appropriate especially in situations with larger population
Cochran's Sample Size Formula
36
4 things to consider when using Cochran's Formula
1. Population 2. Level of Precision (Margin of Error) 3. Confidence Level/ Risk Level 4. Standard Deviation/ Degree of Variability
37
the more sample you examine, the better the results will be
Law of Large
38
values are determined by chance
Variables
39
placed into distinct categories, categorical
Qualitative Variables
40
order or ranked, numerical
Quantitative Variables
41
used to give overview of data via rows and columns enables the reader to look up specific information used to display individual values and compare to other used to present more precise values of data
Tables
42
used to represent data by using vertical/horizontal bars used when the data we want to present are qualitative
Bar Graph
43
used to represent data that occurs over a specific period used to see patterns on the increase/decrease of values over time
Line Graph
44
used to represent data that are in percentage or proportion used to describe composition of data or how one part contributes to the whole
Pie Graph
45
most commonly used measure of central tendency what most people think of as average most appropriate when data are in interval or ratio scale only one value of the mean for the given set of values easily influenced by extreme values
Mean
46
extremely high or low value compared to other values
Outlier
47
value in the data set which occurs most frequently used for nominal data have one or more modes least reliable measure of center quick approximation of average
Mode
48
middle value of a given set of measurements not influenced by extreme values
Median
49
arrangement of values in an increasing or decreasing order
Array
50
Which average should be used in numerical data?
Mean/Median
51
Which average should be used in categorical data?
Mode/Median
52
not only specifying the measure of central tendency but also the measure of dispersion
Measure of Variation
53
measure of variation that is most appropriate to any numerical data
Standard Deviation
54
most appropriately used when you have two data sets with different unit of measurement you want to compare
Coefficient of Variation
55
Measures of variation are essentially ---- for categorical data
Non-existent
56
if categorical data, it is most appropriate to describe the variation by ---
Identifying extreme scores
57
used when the researcher wants to generalize about a population given a sample based on hypothesis
Statistical Tests
58
conjecture about the population parameter that may or may not be true Null/Alternative Hypothesis
Statistical Hypothesis
59
numeric characteristics computed from the sample from which the decision to reject or fail to reject the null hypothesis is based
Test Statistic
60
size of a risk of erroneously rejection the null that the researcher is willing to make
Level of Significance
61
assumptions of the statistical test are met sample size is large data are numerical
Parametric
62
at least one assumption of the statistical test is not met sample size is too small data are categorical
Non-Parametric
63
multiple comparison of the mean
ANOVA and Kruskall-Wallis Tests
64
pairwise comparison of the mean
Tukey's HSD and Dunn's Test
65
There is enough evidence to reject the claim
Reject Null Hypothesis
66
There is not enough evidence to reject the claim
Fail to Reject Null Hypothesis
67
There is enough evidence to support the claim
Reject Alternative Hypothesis
68
There is not enough evidence to support the claim
Fail to Reject Alternative Hypothesis