Week 2: Business Analysis Tools Flashcards

- Learn different business analytics techniques. - Understand the importance of spreadsheet tools for business analytics. - Understand the importance of programming tools in business analytics.

1
Q

Big Data Analytics (examples)

A

Data mining, clustering, regression, classification, association analysis, decision tree, neural networks,
statistical analysis, optimization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Text analytics (examples)

A

Information retrieval, text summarization, sentiment analysis, topic modeling, thematic analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Web/mobile analytics (examples)

A

Web information retrieval, search systems, web crawling, website ranking, search log analysis, smartphone platforms, mobile advertising and marketing, gamification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Social media analytics (examples)

A

Content-based analytics, Structure-based analytics (or social network analytics), community detection, social influence analysis, Link prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Multimedia analytics (examples)

A

Audio analytics, speech analytics, video analytics, automatic video indexing and retrieval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

3 main approaches to business analytics tools

A
  1. Spreadsheet tools
  2. Programming tools
  3. Proprietary business analytics solutions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Spreadsheet tool (definition)

A

Interactive software application for structuring, transforming, analyzing, and storing data in rows and columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Tabular data (definition)

A

Data that has rows and columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Comma-separated value [CSV] file

A

Tabular spreadsheet data that uses commas to separate lines and new lines to separate records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Comma-separated value [CSV] file

A

Tabular spreadsheet data that uses tabs to separate lines and new lines to separate records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Examples of data cleaning in spreadsheets

A
  • Search and replace
  • Sorting and filtering
  • Built-in functions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Programming (definition)

A

process of solving a problem using computer algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Programming language (definition)

A

formal set of instructions that can be used to produce various kinds of output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Open-source programming tools (definition)

A

programming tools that are made
freely available, often developed by and for the community

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Programming code (definition)

A

collection of statements written in a particular programming language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Pros of Excel

A
  • Highly accessible functions easily implemented at the point of contact with data
  • Functions built into spreadsheet environment (easier to implement)
  • Experience with Excel is more comment
17
Q

Cons of Excel

A
  • Functionality limited for advanced statistics
  • Limited number of rows and columns on a worksheet
  • Limited number of characters in one cell
18
Q

Pros of R [programming language]

A
  • Good for data-oriented projects
  • Handles very large datasets (big data)
  • Large number of ready-made packages
  • Data visualization tools built-in
  • Developed by data scientists
  • Large community support
  • Supported by RStudio (integrated development environment) that lacks good competitors and and has no Python equivalent
19
Q

Cons of R [programming language]

A
  • Steep learning curve
  • Less efficient for general computations
  • Some inefficiently written packages
20
Q

Pros of Python

A
  • Growing community of compsci software engineers and programmers
  • More opportunities to take advantage of AI
  • Flexible
  • data analysis can be integrated with website and mobile apps or production database
  • Can do other programming tasks besides data analysis
21
Q

Cons of Python

A
  • Less efficient for statistical computations
  • Less visually-appealing data visualization tool
  • Fewer packages
22
Q

Types of variables

A

Logical, integer, numeric, and character

23
Q

Logical variable (definition)

A

contains only two possible values: TRUE or FALSE
(indicator variable or dummy variable)

24
Q

Integer variable (definition)

A

contains numbers without decimal points

25
Q

Numeric variables (definition)

A

contains numbers with decimal points

26
Q

Character variables (definition)

A

contains words that do not have order or numerical meaning
(string variable or text variable)