DATA TYPES Flashcards

1
Q

Definition of DATA

A

can be defined as a collection of numberpanel data, characters, words, and text—as well as images, audio, and video—in a raw or organized format to represent facts or information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Three differen
perspectives of classifications

A

numerical versus categorical data; cross-sectional vs.
time-series versus panel data; and structured versus unstructured data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Numerical data

A

are values that represent measured or counted quantities as a number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Numerical data are also called

A

quantitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Numerical (quantitative) data can be split into
two types:

A

continuous data and discrete data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Continuous data

A

are data that can be measured and can take on any numerical
value in a specified range of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Examples of Continuous data

A

future value of a lump-sum, price returns of a stock

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Discrete data

A

are numerical values that result from a counting process (finite number of values)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Categorical Data

A

also called qualitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Examples of categorical data

A

bankrupt vs. not bankrupt and dividends increased
vs. no dividend action

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Nominal Data

A

are categorical values that are not amenable to being organized in a logical order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Example of Nominal Data

A

the classification of publicly
listed stocks into 11 sectors (GICS - Global Industry Classification Standard)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

MSCI

A

Morgal Stanley Capital Internacional - 11 sectors, 24 industry groups, 69 industries,
and 158 sub-industries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Often, financial models, such as regression models, require input data to be numerical

A

so, nominal data in the input dataset must be coded numerically before applying an
algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Ordinal data

A

are categorical values that can be logically ordered or ranked

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Cash dividends per share are

A

continuous data

17
Q

Number of coupon payments is

A

discrete data.

18
Q

Credit ratings are

A

ordinal data (arithmetic operations cannot be done on credit ratings)

19
Q

Hedge fund classification types

A

are nominal data

20
Q

Another data classification standard is based on how data are collected

A

cross-sectional, time series, and panel.

21
Q

A variable is

A

a characteristic or quantity that
can be measured, counted, or categorized and is subject to change. A variable can also
be called a field, an attribute, or a feature.

22
Q

Examples of data variable

A

For example, stock price, market capitalization, dividend and dividend yield, earnings per share (EPS), and price-to-earnings
ratio (P/E) are basic data variables for the financial analysis of a public company

23
Q

Cross-sectional data

A

are a list of the observations of a specific variable from multiple observational units at a given point in time. For example, January inflation rates (i.e., the variable) for each of the euro-area countries (i.e., the observational units) in the European Union for a given year constitute cross-sectional data.

24
Q

Time-series data

A

are a sequence of observations for a single observational unit
of a specific variable collected over time and at discrete and typically equally spaced
intervals of time, such as daily, weekly, monthly, annually, or quarterly. For example,
the daily closing prices (i.e., the variable) of a particular stock recorded for a given
month constitute time-series data.

25
Panel data
are a mix of time-series and cross-sectional data that are frequently used in financial analysis and modeling. Panel data consist of observations through time on one or more variables for multiple observational units. The observations in panel data are usually organized in a matrix format called a data table.
26
Structured data
are highly organized in a pre-defined manner, usually with repeating patterns. The typical forms of structured data are one-dimensional arrays, such as a time series of a single variable, or two-dimensional data tables, where each column represents a variable or an observation unit and each row contains a set of values for the same columns. Structured data are relatively easy to enter, store, query, and analyze without much manual processing. Examples: Market data, Fundamental data (eg: EPS) and Analytical data (eg: cash flow projections or forecasted earnings growth)
27
Unstructured data,
are data that do not follow any conventionally organized forms. Eg: text, audio, video
28
Unstructured data can be classified into three groups:
from: individuals, business processes or sensors