Lecture 1 - Software for Statistics Flashcards

1
Q

Central Processing Unit

A

Controls the computer and executes instructions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Different types of memory

A

Registers - within the CPU, superfast
Random Access Memory - slower
Mass storage - hard disk drives etc, much slower

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How are programming languages classified ?

A

Level of abstraction (low/high)
Speed/Efficiency
Generality
Generations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is difference between low level and high level language?

A

Low level - close to specific hardware

High level - far from the hardware, closer to natural language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What type of language (lower/higher) is faster?

A

Lower languages tend to be faster but this is not always true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

1st Generation

A

Machine language - CPU specific set of instructions.

Relatively few instructions and many of these required to achieve anything useful

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

2nd Generation

A

Assembly language

Human readable version of machine language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

3rd Generation

A

More human friendly, CPU independent language with variables, data, and code structures.
Also object oriented languages such as C++ and Java.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

4th Generation

A

Language designed with a specific application in mind.
Lots of built in capabilities for that application.
Database query languages (SQL), graphical user interface creators, mathematics languages, statistic languages.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

5th Generation

A

Language based on solving problems given the problem specifications.
User does not need to explicitly write the algorithm for solving the problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Interpreted Language

A

Turns each line into machine code as it is entered using an interpreter and run it straight away.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Compiled Language

A

Once code is written and saved into a file, turn it all into machine code in one go using a compiler. Then it can be run.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Difference between Interpreted and Compiled coding languages

A

Interpreted codes provides instant feedback - good for run once short jobs. Tend to be used by 4GL.
Compiled code runs faster - good for jobs that will run many times. Tend to be used by 3GL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What programming language for stats?

A

For simple run once stuff - favourite stats package. (CON: no reproducile trail)
More complex stuff to be run once or few times - 4GL with an interpreter
Production software or where efficiency is important - 3GL and compile or prototype in 4GL then rewrite the slow bits in 3GL.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Software for enormous data sets

A

SAS/ Microsoft R open/ R with Hadoop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Pros and Cons of R

A

Pros:
Contains cutting edge methods
Highly extensible
Free

Cons:
Steep learning curve
Less well supported than a commercial package
Greater tendency to ignore backwards computability.

17
Q

What is an algorithm?

A

An ordered sequence of unambiguous and well defined instructions for performing some task and halting in finite time

18
Q

Important features of an algorithm

A

An ordered sequence
Unambiguous and well defined instructions, each instruction is clear, do-able, and can be done without difficulty
Performs some task algorithm needs to be complete, with nothing left out
Halts in finite time

19
Q

Phase 1 of an R programmer

A

R as a calculator

20
Q

Phase 2 of an R programmer

A

The script
10-15 lines of code strung together to perform a task
Might have a loop, maybe even an if statement.
One or two comments
Not intended for the use of others

21
Q

Phase 3 of an R programmer

A

Modular programming with functions
Undertaking a sufficiently complex analysis that organisation becomes critical
Description of input and outputs are critical to ensure reusability
When performing functional programming there are elements of R language that become useful (e.g. stop( ) )