Intro Flashcards

1
Q

Bigdata- statistics
3V?

A

3V -volume, velocity, variety
Volume - the amount of available data
Velocity- the speed of collecting and processing data
Variety -different data types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What can big data do?

A

Hold great promises for understanding
Commonality: in the presence of large variations. (Noises)
Heterogeneity: personalized medicine or services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Applications in business and economics

A

Accounting会计
Finance金融
Economics经济
Marketing(营销)
Operation(运营)
Information systems信息系统

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is average?
What is average for?

A

Average is mean median, mode
Average is for to measure the central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Mean, mode, median?

A

Arithmetic mean,
Mode is most frequent value in the data set
Median is middle value that separates the higher half from the lower half

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data is raw facts and figures

A

Categorical(qualitative)- nominal, ordinal scale
Quantitative- interval, ratio scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Simpson’s paradox

A

There is trend in several different sets of data, but when these data sets are combined, this trend disappears or reverses.
Confounding variables: because school is a hidden variable that cannot be ignored.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

累积分布的类似(cumulative distribution )

A

Cumulative frequency(累积频数)
Cumulative relative frequency(累积相对频数)
Cumulative percent frequency(累积百分频数)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

累积分布是?

A

Cumulative frequency: buleg buriiin deed hyzgaartai tentsuu buyu tuunees baga zuilsiin toog haruuln
Cumulative relative frequency: buleg buriin deed hyzgaartai tentsuu buyu tuunees baga zuilsiin haritsangui davtamjiig haruuln
Cumulative percent frequency:buleg buriin deed hyzgaartai tentsuu buyu tuunees baga zuilsiin percent g haruuln

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

累积分布等于:

A

Cumulative frequency:hurimtlagdsan davtamjiin tarhaltiin suulchiin utga ni ajiglaltuudiin niit tootoi urgelj tentsuu
Cumulative relative frequency : hurimtlagdsan haritsangui davtamjiin tarhaltiin suulchiin utga ni urgelj 1.00tei tentsuu bn
Cumulative percent frequency: hurimtlagdsan percent davtamjiin tarhaltiin suulchiin utga ni urgelj 100tai tentsuu bn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

画stem and leaf display(茎叶图)

A

P.28
Durslel deerh mur buriig ish gej nerlene
Ish bur deerh tsifruud ni navch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Navchnii negj?

A

Navchnii hed ch bj bln
Navchnii negjiig haruulaagu tohioldold 1tei tentsuu gej uzn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Cross tabulation(交叉分组表)

A

Cross tabulation is a useful analysis tool commonly used to compare the results for one or more variables with the results of another variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

交叉分组表的remark

A

Remark: when the cross tabulation involves aggregated data, we should investigate whether a hidden variable could affect the results. (Like Simpson’s paradox)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

在Data上有几个类似 Data

A

Categorical Data
Quantitative Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Categorical data ‘s tabular displays, graphical displays

A

Tabular displays -frequency dist, Rel freq dist, % freq dist, cross tabulation
Graphical displays -bar chart, pie chart, side by side bar chart, stacked bar chart

17
Q

Quantitative Data?
Tabular displays, graphical displays

A

Tabular displays -freq dist, rel freq dist, % freq dist, cum freq dist, cum rel freq dist, cum % freq dist, cross tabulation
Graphical displays -dot plot, histogram, stem and leaf display, scatter diagram