Unit 3 Flashcards

1
Q

Primary Key

A

field that uniquely identifies a record. ex: student id or employee id

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Foreign Key

A

Field that is in a relational table, that relates to a primary key of another table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Schema

A

Blueprint on how the database is constructed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Big Data

A

Data come from everywhere—including smartphone metadata, internet usage records, social media activity, computer usage records, and countless other data sources—to be sifted for patterns and trends.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Four Vs of Big Data

A

Volume (amount), Variety (Various forms), Veracity (quality and trustworthiness), Velocity (speed).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data mining/Data discovery

A

examination of huge sets of data to find patterns, connections, outliers and hidden relationships. It is a BI tool used for decision making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Structed data

A

resides in fixed formats, well labeled, easily queried and searched.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

unstructured data

A

is unorganized data(social media/big data) cannot be easily read or or processed by a computer. not stored in rows and columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

semi structured data

A

in between structured and unstructured data, can be read but takes work. example is email.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Big data tools

A

ETL and Hadoop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Extract

A

Once you have determined where your data is coming from and where you want it to reside, you can start extracting. usually comes from CRM or ERP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Transform

A

Once you have extracted data, they need to be transformed to fit into the database table. This may involve removing decimals and dollar signs from financial transactions so it will fit into the structured data table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Load

A

Once data are transformed, they are ready to finally be transferred into the data warehouse and data mart. The more often this is done, the more up-to-date analytic reports can be.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Hadoop

A

an infrastructure for storing and processing large sets of data across multiple servers. Instead of centralized files in one place like a data warehouse or data mart, Hadoop uses a distributed file system that allows files to be stored on multiple servers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Which restriction applies to the data in the primary field of a database?

A

The primary key has to be unique

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The data are full of missing, misplaced, or duplicate data, which the data analyst needs to remove.
Which process can this data analyst use to remove such data?

A

Normalization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

A data analyst wants to use software to look for useful patterns and hidden relationships in this large set of social media data. Which process can be used to look for these patterns and relationships?

A

Data Mining

18
Q

A business analyst wants to use the social media data to create and present business intelligence. She will create visualizations that will be used by the executive team. Which tool is appropriate for creating and presenting this business intelligence?

A

Tableau

19
Q

scrubbing the data

A

the process of transforming data into an accurate, clean, and error-free form

20
Q

Master Data Management (MDM)

A

is a methodology or process used to define, organize, and manage all the data of an organization that provides a reference for decision-making. Master data management tools can be used to support master data management by removing duplicates, standardizing data, and incorporating rules to prevent incorrect data from entering the system, thus creating an accurate source of master data

21
Q

Data Governance

A

is managing the availability, integrity, and security of the data to ensure that the data remain high quality and valid for data analytics. Policies and procedures are established that define the data governance program, such as who has access, who has update capabilities, when and how backups are made and stored, and who administers the policies to ensure that they are followed

22
Q

Data Management Process

A

can be defined as acquiring data, making sure the data is valid, and then storing and processing the data into useable information for a business

23
Q

data management return on investment

(ROI)

A

value of having the data available to process to make key decisions. ROI is negatively impacted by improper data management

24
Q

Benefits of data management

A

predictive analytics, BI, and data analysis

25
Q

data analytics

A

patterns, correlations, and hidden data relationships.

26
Q

Online analytical processing (OLAP)

A

end users can submit queries against the database to gain insight into data relations such as trend analysis and also to create data models that guide future decisions. A common use of OLAP is the creation of what-if scenarios for budgeting and forecasting.

27
Q

Business intelligence (BI)

A

is a set of software and services that turn data into information that helps leaders in an organization make wise decisions.

28
Q

Topic analytics

A

enables you to sift through large sets of data and identify the most common and most important topics in an easy, fast, and scalable way. For example, if a customer said, “the barista was friendly,” that would be categorized under the topic “Employee Friendliness.”

29
Q

Text analytics

A

sometimes called text mining, is the process of extracting information from written sources such as websites, e-books, and emails and inserting the data into a database to evaluate and interpret relevance or to understand customers’ feedback on products and services.

30
Q

Business analytics

A

attempts to make connections between data so organizations can try to predict future trends that may give them a competitive advantage. Business analytics can also uncover computer system inadequacies within an organization

31
Q

A data analyst wants to search through unstructured data from social media posts to look for useful customer behavior patterns and sentiments. Which type of analytics is appropriate for this task?

A

Text analytics

32
Q

descriptive analytics

A

is the baseline that other types of analytics are built on.

33
Q

predictive analytics

A

attempts to reveal future patterns in a marketplace, essentially trying to predict the future by looking for data correlations between one thing and any other things that pertain to it.

34
Q

=SUM(number1,[number2],…)

A

Adds all the values selected in the argument.

35
Q

=AVERAGE(number1,[number2],…)

A

Calculates the average of all the values selected in the argument.

36
Q

=MAX (number1,[number2],…)

A

Finds the maximum value of all the values selected in the argument.

37
Q

=MIN (number1,[number2],…)

A

Finds the minimum value of all the values selected in the argument.

38
Q

=COUNT(value1,[value2],…)

A

Counts the number of values selected in the argument that contain numbers

39
Q

arguments

number1,[number2],…

A

The argument number1 is required and is the first number, cell reference, or range for which you want to calculate what the functions return. The argument [number2] and the rest are optional (use of square brackets indicates these additional arguments are optional), including any additional numbers, cell references, or ranges that you want to calculate what the functions return.

40
Q

value1,[value2],…

A

The argument value1 is required and is an item, cell reference, or range. The argument [value2] and the rest are optional (use of square brackets indicates these additional arguments are optional), including any additional numbers, cell references, or ranges.

41
Q

Boolean logic

A

is based on only two values, 0 and 1. In spreadsheets, the values 0 and 1 are represented by the values FALSE and TRUE, respectively. Boolean logic and Boolean functions deal with expressions that result in either the value TRUE or the value FALSE. No other outcome is allowed in Boolean formulas or functions.

42
Q

IF function =IF(logical_test,[value_if_true],[value_if_false])

A

logical_test~ A test or logical comparison of value that is either TRUE or FALSE.
[value_if_true] ~Used to specify the result of the IF function if the Boolean expression result is TRUE.
[value_if_false] ~Used to specify the result of the IF function if the Boolean expression is FALSE.