Learning curve Flashcards

Question 1

Q

rule of 10 000 hours of practice

Answer

A

idea that after 10 000 hours you would become expert in the field

no empirical data to support this theory

actually refuted

Question 2

Q

descriptive models

Answer

A

just fit the data - doesn’t tell anything about underlying cognition

Question 3

Q

cognitive models

Answer

A

parameters actually mean sth -> underlying mechanism in terms of cognition, derived from psychological theory

Question 4

Q

exponential model

Answer

A

P=1-exp(- μ*t)

where
P - performance scaled between 0 and 1 (proportion correct)
t - trial run
u - learning rate

Question 5

Q

what are characteristics of exponential model?

Answer

A

learning very quickly at the beginning, then plateau

Question 6

Q

fitting model to the data

Answer

A

estimate model parameters given the data

Question 7

Q

concave model of law of practice

Answer

A

both power and exponential functions are concave -> decelerating curve!

Question 8

Q

hyperbolic function

Answer

A

P = t/(t+d)

Question 9

Q

What is Gaussian noise?

Answer

A

type of statistical noise where probability density function (PDF) follows normal distribution
can be simulated with stats.normv or random.normal

Question 10

Q

What did Estes assumed about exponential law of practice?

Answer

A

change in performance over time depends on total performance yet to be achieved - elements to be learned

dP/dt = u(P max - P)

dP/dt = changing performance over time
P max - maximum performance
P - current performance
u - learning rate

Question 11

Q

What is alternative to concave models?

Answer

A

s-shape learning function!

Question 12

Q

when should you use concave exponential function?

Answer

A

P = 1 - exp(-ut)
while learning single words (items)

Question 13

Q

when should you use compound exponential function?

Answer

A

P = ( 1 - e **-ut)c

when one has to learn sets of c words (fragments)

Question 14

Q

maximum likelihood function

Answer

A

used to estimate parameters of probability distribution (pdf) by maximizing likelihood function, so that under the assumed statistical model the observed data is most probable

in short: given the model, find parameters for which data are most probable

Question 15

Q

probability

Answer

A

Prob(data/model, parameters)

data - people who pick option x

given

model - number of people asked
parameters - probability of picking option x

Question 16

Q

likelihood

Answer

Study These Flashcards

A

Likelihood(parameters/model, data)

parameters - probability of picking option x

given

model - number of people asked
data - people who pick option x

Question 17

Q

what is log likelihood? why is it prefered for maximum likelihood calculations?

Answer

Study These Flashcards

A

natural logarithm of likelihood function

-> it turns products into sums, making complex likelihood functions easier to deal with
(it pushes large numbers down, avoiding infinity calculations)

-> smooths out numerical instability issues that may occur when multiplaying small probabilites

Question 18

Q

How to use optimization to find maximum likelihood estimate?

Answer

Study These Flashcards

A

the idea is that you want to find the optimum (max or min) - which is similar to finding deepest point in the lake

you can use

derivatives = give you local information about the slope or the direction of the function at a given point
positive derivative = function is increasing in this direction
negative derivative = function is decreasing in this direction

concavity (2nd derivative) = helps to inform whether you are in concave region (bowl) or convex region (hill)
- helps to get an idea how near you are to minimum/maximum

Question 19

Q

What is local optimum?

Answer

Study These Flashcards

A

it is illusory ‘‘deepest lake point’’ - so it is lower, but not the lowest point in the lake

you can use special algorithms like simulated annealing or genetic algorithms to avoid it

Question 20

Q

What can we done instead of maximizing the likelihood?

Answer

Study These Flashcards

A

You can minimize! -> then you minimize NEGATIVE log-likelihood

Question 21

Q

multiple regression
y = 3 - 0.2 x + 0.5 w
- what is B0?

Answer

Study These Flashcards

A

3 = intercept!
baseline value of DV y when both x and w are zero

Question 22

Q

multiple regression y = 3 - 0.2 x + 0.5 w
- what is B1?

Answer

Study These Flashcards

A

0.2 = slope/effect of predictor x on y
how much y changes when x changes by one unit

Question 23

Q

multiple regression y = 3 - 0.2 x + 0.5 w
- what is B2?

Answer

Study These Flashcards

A

0.5 = slope/effect of predictor w on y
quantifies how much y changes when w changes by one unit

Question 24

Q

what is sigma?

Answer

Study These Flashcards

A

standard deviation
assumes that residuals follow normal distribution (important because likelihood is based on assumption that errors -residuals- are normally distributed with sd)

Learning curve Flashcards

(24 cards)