WEEK 1 Flashcards

1
Q

Have you ever wondered why some online ads seem to make really accurate suggestions or how some websites remember your preferences?

A

Cookies can help inform advertisers about your personal interests and habits based on your online surfing, without personally identifying you.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Ways to collect and generate data

A

Forms,
questionnaires and
surveys and
Interviews
Cookies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data collection considerations

A

Select the right data
how the data will be collected
the data sources
how much data to collect
Solving your business problem
the time frame for data collection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

population

A

Refers to all possible data values in a certain data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A sample

A

Is a part of a population that is representative of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data sources

A

first-party data. This is data collected by an individual or group using their own resources.

second-party data, which is data collected by a group directly from its audience and then sold.

third-party data or data collected from outside sources who did not collect it directly. This data might have come from a number of different sources before you investigated it. It might not be as reliable, but that doesn’t mean it can’t be useful.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data formats

A

Qualitative data is usually listed as a name, category, or description. In our spreadsheet, the movie titles and cast members are qualitative data.

Quantitative data, which can be measured or counted and then expressed as a number.

Discrete data first. This is data that’s counted and has a limited number of values.

Continuous data can be measured using a timer, and its value can be shown as a decimal with several places.

Nominal data is a type of qualitative data that’s categorized without a set order. In other words, this data doesn’t have a sequence (Yes or No)

Ordinal data, on the other hand, is a type of qualitative data with a set order or scale.

Internal data, which is data that lives within a company’s own systems

External data is, you guessed it, data that lives and is generated outside of an organization.

Structured data is data that’s organized in a certain format, such as rows and columns.

Unstructured data. This is data that is not organized in any easily identifiable manner. Audio and video files are examples of unstructured data because there’s no clear way to identify or organize their content.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Unstructured data

A

Audio files, video files, emails, photos, and social media are all examples of unstructured data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data Model

A

is used for organizing data elements and how they relate to one another.
This makes it easy for analysts to enter, query, and analyze the data whenever they need to.
This also helps make data visualization pretty easy because structured data can be applied directly to charts, graphs, heat maps, dashboards and most other visual representations of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data elements

A

Are pieces of information, such as people’s names, account numbers, and addresses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

An unfair dataset

A

Does not accurately represent the population, causing skewed outcomes, low accuracy levels, and unreliable analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Data modeling

A

Is the process of creating diagrams that visually represent how data is organized and structured.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Levels of data modeling

A

Conceptual: business concepts.

Logical: data entities

Physical: physical tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Conceptual data modeling .

A

Gives a high-level view of the data structure, such as how data interacts across an organization.
For example, a conceptual data model may be used to define the business requirements for a new database. A conceptual data model doesn’t contain technical details

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Logical data modeling

A

Focuses on the technical details of a database such as relationships, attributes, and entities.
For example, a logical data model defines how individual records are uniquely identified in a database. But it doesn’t spell out actual names of database tables. That’s the job of a physical data model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Physical data modeling

A

Depicts how a database operates. A physical data model defines all entities and attributes used; for example, it includes table names, column names, and data types for the database.

17
Q

Data-modeling techniques

A

Entity Relationship Diagram (ERD): are a visual way to understand the relationship between entities in the data model.

Unified Modeling Language (UML): Are very detailed diagrams that describe the structure of a system by showing the system’s entities, attributes, operations, and their relationships.

18
Q

Data modeling usefulness

A

Data modeling can help you explore the high-level details of your data and how it is related across the organization’s information systems.

Data modeling sometimes requires data analysis to understand how the data is put together; that way, you know how to map the data.

And finally, data models make it easier for everyone in your organization to understand and collaborate with you on your data.

19
Q

A data type

A

A data type tells you what kind of data you’re working with.

20
Q

Data type in a spreadsheet can be one of three things:

A

a number, a text or string, or a Boolean.

21
Q

Difference between records and fields and rows and columns.

A

Records and fields can be used for any kind of data table, while rows and columns are usually reserved for spreadsheets.

22
Q

Rows and Columns equals

A

Records and fields

23
Q

Long data

A

is data in which each row is one time point per subject, so each subject will have data in multiple rows.

24
Q

Wide data

A

Lets you easily identify and quickly compare different columns.