Eksamen Flashcards

1
Q

What contributes BA to?

A

Company strategy. Core compenticies, Predict, internal processes and understand customers. Relation between BA and decision makers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the goal of BA?

A

To implement it as a strategic ressource. Think it on every level, and not something you do on the side.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the three steps of analytics?

A

Deskriptiv - What is there?
Predictive - what will happen?
Preskriptiv - What we need to do to optimize?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which 3 skills do we need as a data scientist?

A

Hacking - programming
Math - statistician and math
Substantive expertise - viden om emneområdet

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the difference between data analyst and business analyst?

A

Business analyst is a bit more pragmatic, and have more domain knowledge.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is machine learning?

A

They learn without saying exactly how to. Learn by experience.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the difference between supervised learning an unsupervised learning?

A

Supervised learning has a predefined relation between input and for example regression. Good for purpose pull.

Unsupervised learning do not where it ends. Can be clustering. Good for Datapush

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe the data-purpose paradox?

A

Either you have the data or the purpose first.

Datapush - you have the data first, no purpose. Unsupervised learning, clustering.

Purpose pull - you have the purpose, but no data. Supervised learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describe the linear regression

A

Overall use:

Next step after correlation. Used for predictions, and relationship between variables.

Dependent variable - the ones that get affected by the others. For example grade.

Independe variable - The ones that affect the others. For example study hours.

Relationship between the variables can be either positive or negative.

Continous outcome, so it can be everything, price, age and so on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe the logistic regression.

A

Same as linear regression, but has a binary outcome, so it can be used to categorise. Do you pass the exam, should we invest or not?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe a decision tree

A

Decisionmaking, and binary outcome. If you do this, then this can happen. Predictions. See how big the probability is that the one will happen and the other ting. Make general rules for next time doing something.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe regression tree

A

Same as decision tree but continuous outcome. Predictions and what will happen if you do this, and this. General rules.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe neural network.

A

Alghoritms that is trained like a human brain. Can be used to predict what a image is, and predict if it is a triangle or a square.

It needs to be trained, so it will become better and better.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe support vector machine

A

Good for classifying. You find the maximal margin classifier, but that does not allow for misclassification. If there is misclassification and, you find a soft margin classifier, if there is misclassification. You find that by using cross validation, that compares the diffrent soft margin classifiers.

But sometimes the dataset is not possible to make a linear line, and you move it up to another dimension so you can make a linear line. You use a so-called kernel function, which there is many different of. Then you can make a linear line.

Effective when the dataset is not linear.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the different measuring scale elements?

A

Nominal - just categories without able to compare. blue green

Ordinal - you can compare, but not exactly in which degree. Good, very good.

Interval - You can compare, but there is no 0. For example weather, you can’t say that 20 is double as hot as 10.

Ratio - weight. You can compare directly. The best for statistics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Explain mean, median and standard deviation

A

Mean - average
Median - Mid number
Standard deviation - how far is a number for the mean.

17
Q

Explain covariance and correlation

A

Both of them explain the relationship between two variables. Covariance is best when the numbers are on the same scale, and can be hard to compare.

Correlation can only go from -1 to 1. Easier to compare to different versions.

18
Q

Explain sample from a distribution

A

You want to say something broader of a population based on a sample.

Can help you to verify your statistics.

19
Q

Explain statistical significance

A

Whether you can say your research is true. Often a p value on 5 %. Says that it is under 5 %, that the result you will have happened by a coincidence.

20
Q

Which ML methods for classification?

A

K-nearest neighbour, support vector machine and logistic regression.

21
Q

Which ML methods for continuous outcome?

A

Linear regression, regression tree

22
Q

Which issues can you have with digital data and analytics?

A

Pricing model - if you have many customers, high prices. uber

Customer classification - if they shop baby clothe, you can say she is pregnant

Surveillance - monitor people like china.

23
Q

Positive forces with digital data

A

Citizens can become powerful

Alghoritms can sometimes make better decisions that human.

24
Q

Which ethical issues can there be with digital data?

A

Legal - discirimated gender or race.
Homogenization of workspace - hire to many of the same people based om AI.
Impaxt on business partners - the trust can be less.

25
Q

Which biased can there be?

A

Bias agaings first information. Altid bias mod den første information, den er mere stærk end de næste. Kan skabe bias.

Want the status quo to stay. Firmaer er bange for forandringer. Kan være svært for folk.

You want to justify your choices. Man vil gerne justify sine tidligere choices.

You use your own instinct. Du bruger dit eget instinkt, og går efter noget du tror der skal være

You use your own techniques. Man bruger sine egne problme solve metoder, “man kan ikke bruge en hammer til alt”

26
Q

Which 3 elements for Gartner 2019 to managing ethical issues

A

Empathy
Responsibility
Competences

27
Q

The role of a business analytics manager, business analyst and ITKO

A

Business analyst - Business and technical knowledge. Analyse data to make suggestions to company strategy to suggest to manager.
Manager - managing skills and domain knowledge. Understand the data.
ITKO - connectors and communicate with the diffrent areas.

28
Q

Examples of datadriven companies.

A

Netflix - recommandation based on machine learning.

Amazon - use data to predict market demand.

29
Q

Hvordan aflæser du en regression i R?

A

Call - hente variable. Den første er den afhængige, de andre uafhængige.

Under estimate kan du se relationen til den afhængige.

Residuals er den asftand til hvert datapunkt.

Significance . under 0,05 * under 0.001 ***

30
Q

Binary outcomes presentation

A

SVM - Decision tree - Logistic regression - K-neares neighbour.