1.2 Organizing, Visualing, and Describing Data Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Numerical data (a.k.a. quantitative data)

A

Values that represent measured/counted quantities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

two types of numerical data

A

continuous data and discrete data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

continuous data

A

Data may take on any numerical value in a specified range of values

ex: Any value between 0 and 1 (Infinite number of possibilities)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

discrete data

A

Data may only take on a countable number of values

ex: 0, 0.5, and 1 (Only 3 possible values)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Categorical data (a.k.a. qualitative data)

A

Values that describe the characteristic of a group of observations

For example, companies can be classified into bankrupt vs. not bankrupt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

two types of categorical data

A

nominal data and ordinal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is a variable

A

Characteristic/quantity that can be measured and is subject to change (e.g., stock price)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is an Observation

A

A value of the variable that is collected (e.g., stock price yesterday was $30)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Cross-sectional data

A

observations that capture characteristics of different units at a specific point in time

An example of this is a list that shows the current dividend yields of different FTSE 100 companies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Time-series data

A

observations of the same unit at different points in time

An example of this is a list that shows the dividend yield of an FTSE 100 company over the past 10 years

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Panel data

A

a mix of time-series and cross-sectional data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Structured data

A

highly organized in a pre-defined manner with repeating patterns

They are relatively easy to store, search, and analyze

Common examples of structured data include market data and fundamental data stored in Excel databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

unstructured data

A

do not follow any conventionally organized forms

They typically require manual processing prior to being analyzed by financial models

Common examples of unstructured data include text (from financial news), audio, video, and photo

helped byalternative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

alternative data

A

the data generated through unconventional sources (e.g., individual social media posts, satellite imagery, etc.), drives the availability and importance of unstructured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Raw data

A

data available in the original form as collected

They normally cannot be used directly to extract information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

the first step to be able to use raw data

A

usually to organize them into a one-dimensional array or two-dimensional array

17
Q

one-dimensional array

A

suitable for a single variable

For example, a one-dimensional array can be built to show the annual return of the S&P 500 index for the past 10 years.

–> This is appropriate because the annual return is the only variable that needs to be evaluated

18
Q

A two-dimensional rectangular array (or data table)

A

used to analyze multiple variables

For example, in addition to the annual return, a data table can also show the dividend yield and earnings yield of the S&P 500 index for the 10-year period

19
Q

An analyst uses a software program to analyze unstructured data—specifically, management’s earnings call transcript for one of the companies in her research coverage. The program scans the words in each sentence of the transcript and then classifies the sentences as having negative, neutral, or positive sentiment. The resulting set of sentiment data would most likely be characterized as:

A. ordinal data.

B. discrete data.

C. nominal data.

A

A. ordinal data.