large data set Flashcards

1
Q

what are the 5 uk locations

A

leuchars - Scottish cost

leeming - North Yorkshire

heathrow - Greater London

hurn - south west

Camborne - cornwall

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what are the 3 international locations

A

beijing Perth Jacksonville

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what time periods is the large data set measured

A

may to October 1987

may to October 2015

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are the large data set variables

A

daily mean air temp

daily total rainfall

daily total sunshine

daily maximum relative humidity

daily mean windspeed and direction

daily maximum gust and direction

cloud cover

daily mean visibility

daily mean pressure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

daily mean air temp

A

celcius, between 9am and 9pm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

daily total rainfall

A

mm, for the 24 hours starting 9am, tr is less than 0.05mm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

daily total sunshine

A

hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

daily maximum relative humidity

A

%, above 95% is mist/fog

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

daily mean windspeed and direction

A

knots, described using Beaufort conversion (calm, light etc) , direction is given as cardinal (north south east west)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

daily maximum gust and direction

A

maximum instantaneous speed over 24hrs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

cloud cover

A

okras ( 1/8’s of sky covered)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

daily mean visibility

A

decametres horizontally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

daily mean pressure

A

hectopascals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is unknown for first half of may in 1987 for uk cities

A

daily total sunshine, mean windspeed and max gust

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what do the international cities contain data for

A

mean temp, rainfall, pressure, windspeed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what are near a coast

A

Jacksonville, Perth, cambrone, hurn, leuchars

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is in south hemisphere

A

Perth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what variable is discrete

A

cloud cover

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what should you replace tr with

A

0 or 0.025

20
Q

what happened 15-16 October 1987

A

great storm

high wind speeds

south England affected

can skew wind/gust/rainfall

21
Q

how many days does lds cover

22
Q

what is a census

A

collects data about all members of a population

23
Q

advantage of census

A

fully accurate results

24
Q

disadvantage of census

A

time consuming, expensive

25
what is sampling
collecting data from a subset of the population
26
what is simple random sampling
every group within population has an equal probability of being selected for the sample uniquely number every member and randomly select numbers using a random number generator
27
what is systematic sampling
choose members of a population at regular intervals using a list choose every kth member where k= (size of population)/(size of sample)
28
what is stratified sampling
population divided into groups and random sample from each group % taken from each group reflects that groups prevalence in population
29
what is quota sampling
population split into groups and members of population chosen until quota selected (not random)
30
what is opportunity sampling
sample formed using available members at time of study who fit criteria
31
pros of systematic sampling
simple, quick, suitable for large samples/populations
32
disadvantages for systematic sampling
sampling frame needed, bias introduced if frame not random
33
pros for stratified sampling
accurately reflects population structure
34
disadvantages for stratified sampling
population must be clearly classified into groups (strata), same as simple random within group
35
pros of simple random sampling
free of bias, cheap
36
disadvantages of simple random sampling
not suitable for large samples, sampling frame needed
37
pros of quota sampling
allows small sample to represent population, np sampling frame, easy, quick, cheap
38
disadvantages of quota sampling
non random so can introduce bias, population must be divided into groups, non responses not recorded
39
pros of convenience sampling
easy, inexpensive
40
is the binomial distribution continuous or discrete
discrete
41
is the normal distribution continuous or discrete
continuous
42
convert p(x=a) in discrete distribution to continuous
p( a-0.5 < X < a + 0.5)
43
convert P(x
P(X< a - 0.5)
44
convert P(X>a) in discrete to continuous
P(X>a+0.5)
45
convert P( X <= a) in discrete to continuous
p( X <= a + 0.5)
46
covert p(x>= a) in discrete to continuous
p(x>= a - 0.5)
47
if X is coded with y = ax + B, what is the mean, SD
( mean of y) - B all divided by a MEAN SD sd y divided by a