Session 8 - Advanced Programming Techniques Flashcards

Question

`os.listdir()` includes hidden files, which start with a... Hidden files may.... You may need to fitler... - (3)

Answer 1

with a dot (e.g., `.DS_Store`) - Hidden files may not be useful and can clutter the list. - You may need to filter out hidden files from the list returned by `os.listdir()`

Answer 2

glob function from glob module

Answer 3

It is short for 'global pattern match'

Answer 4

find files and directories matching a specific pattern.

Answer 5

search for strings that match certain patterns.

Answer 6

- Importing the glob function is achieved with `from glob import glob`. - `filelist = glob('/content/pin-material/*.jpg')` finds all .jpg files in the 'pin-material' directory. - `print(filelist)` displays the list of .jpg files found. - `pyFiles= glob('/content/pin-material/*.py')` finds all Python script files. - `print(sorted(pyFiles))` prints the Python script files as a sorted list - in ascending order

Answer 7

- When provided with the full path as an argument, glob returns a list of full paths.

Answer 8

each one in turn

Answer 9

Wildcard characters are special symbols used in glob patterns to match filenames or paths.

Answer 10

1. * (an asterix) 2. ? (a question mark) 3. [1234] a list of characters - 4. [1-9] a range of characters -

Answer 11

- It matches any set of characters, including no characters at all. - For example, 'file*.txt' matches 'file.txt', 'file123.txt',

Answer 12

- It matches any single character. - For example, 'file?.txt' matches 'file1.txt', 'fileA.txt', but not 'file12.txt'.

Answer 13

- '[1234]' is a wildcard character in glob that matches any single character from the list [1234]. - For example, 'file[1234].txt' matches 'file1.txt', 'file2.txt', but not 'file5.txt'.

Answer 14

- '[1-9]' is a wildcard character in glob that matches any single character in the range from 1 to 9. - For example, 'file[1-9].txt' matches 'file1.txt', 'file2.txt', but not 'file10.txt'.

Answer 15

The glob pattern '/content/pin-material/fft*' matches all files in the '/content/pin-material' directory that start with 'fft'. - From the given list of files: - 'fft_colour.jpg' and 'fft_bw.jpg' match the pattern. - Therefore, glob('/content/pin-material/fft*') would print ['fft_colour.jpg', 'fft_bw.jpg'].

Answer 16

- The glob pattern '/content/pin-material/*md' matches all files in the '/content/pin-material' directory that end with 'md'. - Based on the * wildcard, which matches any set of characters, it will find files ending with 'md'. - From the given list of files, 'README.md' matches the pattern. - Therefore, glob('/content/pin-material/*md') would print ['README.md'].

Answer 17

The glob pattern '/content/pin-material/pop?_*' utilizes two wildcard characters: '?' and '*'. - '?' matches any single character, allowing for flexibility in matching filenames. - '*' matches any set of characters, including no characters at all. - Therefore, the pattern matches files in the '/content/pin-material' directory that start with 'pop', followed by any single character, and then an underscore, and then any set of characters. - Based on this pattern: - Files like 'pop2_tidy_script2.py', 'pop2_tidy_script1.py', 'pop2_debug_script2.py', and 'pop2_debug_script1.py' would match. - Therefore, glob('/content/pin-material/pop?_*') would print ['pop2_tidy_script2.py', 'pop2_tidy_script1.py', 'pop2_debug_script2.py', 'pop2_debug_script1.py'].

Answer 18

- The glob pattern '/content/pin-material/pop*' matches all files in the '/content/pin-material' directory that start with 'pop'. - Based on the '*' wildcard, which matches any set of characters, it will find files that start with 'pop'. - From the given list of files, 'pop2_tidy_script2.py', 'pop2_tidy_script1.py', 'pop2_debug_script2.py', 'pop2_debug_script1.py', and 'pop3_test_script.py' match the pattern. - Therefore, glob('/content/pin-material/pop*') would print ['pop2_tidy_script2.py', 'pop2_tidy_script1.py', 'pop2_debug_script2.py', 'pop2_debug_script1.py', 'pop3_test_script.py'].

Answer 19

- The glob pattern '/content/pin-material/pop?_tidy_script[1-2]*' matches files in the '/content/pin-material' directory that start with 'pop', followed by any single character, then '_tidy_script', then either '1' or '2', and then any set of characters. - '?' matches any single character, allowing flexibility in matching filenames. - '[1-2]' matches either '1' or '2'. - '*' matches any set of characters, including no characters at all. - From the given list of files, 'pop2_tidy_script2.py' and 'pop2_tidy_script1.py' match the pattern. - Therefore, glob('/content/pin-material/pop?_tidy_script[1-2]*') would print ['pop2_tidy_script2.py', 'pop2_tidy_script1.py'].

Answer 20

- The glob pattern '/content/pin-material/fft*.jpg' matches files in the '/content/pin-material' directory that start with 'fft', followed by any set of characters, and end with '.jpg'. - '*' matches any set of characters, including no characters at all. - From the given list of files, 'fft_colour.jpg' and 'fft_bw.jpg' match the pattern. - Therefore, glob('/content/pin-material/fft*.jpg') would print ['fft_colour.jpg', 'fft_bw.jpg'].

Answer 21

turn myfile.txt into myfile and txt).

Answer 22

1) basename 2) dirname 3) splitext

Answer 23

extract different parts of a file path.

Answer 24

- The basename function, from the os.path module, extracts the filename from a full path. - It returns the last component of the path, excluding the directory. - For example, basename('/content/pin-contents/s4/s4_rt_data_part01.hdf5') would return 's4_rt_data_part01.hdf5'.

Answer 25

- The dirname function, from the os.path module, extracts the directory name from a full path. - It returns the directory component of the path, excluding the filename. - For example, dirname('/content/pin-contents/s4/s4_rt_data_part01.hdf5') would return '/content/pin-contents/s4'.

Answer 26

- The splitext function, from the os.path module, splits a filename into its base name and extension. - It returns a tuple containing the base name and the extension separately. - For example, splitext('/content/pin-contents/s4/s4_rt_data_part01.hdf5') would return ('/content/pin-contents/s4/s4_rt_data_part01', '.hdf5')

Answer 27

The first element is everything except the extension of the file and the second element is the extension (including the leading .).

Answer 28

dname = dirname(my_path) fname = basename(my_path) print(splitext(my_path))

Answer 29

- The splitext function splits the full path into its base name and extension. - When applied to the full path, it returns a tuple containing the base name and the extension separately. - For example, splitext('/content/pin-contents/s4/s4_rt_data_part01.hdf5') would return ('/content/pin-contents/s4/s4_rt_data_part01', '.hdf5').

Answer 30

- When applied to just the filename, the splitext function splits the filename into its base name and extension. - It returns a tuple containing the base name and the extension separately. - For example, if fname is 's4_rt_data_part01.hdf5', splitext(fname) would return ('s4_rt_data_part01', '.hdf5').

Answer 31

The code first imports necessary modules from glob module and basename, split text and dirname functions from os module Use glob to find files: The glob function searches for all files ending with '.hdf5' in the '/content/pin-material/s4/' directory. The resulting list of file paths is stored in the fileList variable. A for loop iterates over each file path in fileList. In each iteration of element in fileList, fNameOnly stores the filename extracted from the full path thisFileName (e.g., if thisFileName is '/content/pin-material/s4/s4_rt_data_part04.hdf5', then fNameOnly will store 's4_rt_data_part04.hdf5'. parts variable = splitext (fNameOnly) so splitext splits the filename stored in fNameOnly into its base name and extension. The base name is stored in parts[0]. For example, if fNameOnly is 's4_rt_data_part04.hdf5', then after splitting: parts[0] will store 's4_rt_data_part04' (the base name). print(parts[0]):Only the base name stored in parts[0] is printed. For example, if parts[0] is 's4_rt_data_part04', then this base name will be printed. For loop continues until each element of list in fileList is covered

Answer 32

These are list comprehensions and list enumerating .

Answer 33

- `list1=[0,1,2,3,4,5]`: Defines a list named `list1` containing integers 0 through 5, entered manually. - `list2=list(range(6))`: Creates a list named `list2` using the `range()` function to generate integers from 0 to 5. - Prints both list1 and list2

Answer 34

- `input_list = range(10)`: Creates a range object containing integers from 0 to 9 (not including 10), assigned to `input_list`. - `output_list = []`: Initializes an empty list named `output_list`. - `for value in input_list:`: Iterates over each value in `input_list`. - Inside the loop: - `value` takes on each value from `input_list` in sequence. - `output_list.append(value * 2)`: Multiplies each value by 2 and appends the result to `output_list`. - `print(list(input_list))`: Prints the contents of `input_list`, displaying integers from 0 to 9. - `print(output_list)`: Prints the contents of `output_list`, displaying each element multiplied by 2.

Answer 35

Python gives us an alternative: the list comprehension.

Answer 36

A list comprehension is simply a statement inside of square brackets which tells Python how to contruct the list.

Answer 37

The example above therefore reads as (x * 2) for each value (x) in range(10). i.e., for each value in the list produced by range(10), put it in the variable x, then put the value x*2 into the list. Note that the variable x is just a placeholder and could be called anything.

Answer 38

[0, 2, 4, 6, 8, 10, 12, 14, 16, 18]

Answer 39

loud to yourself.

Answer 40

any sort of list and any sort of data, e.g.,

Answer 41

- `original_data = ['Alex', 'Bob', 'Catherine', 'Dina']`: Defines a list named `original_data` containing four strings. - `new_list = ['Hello ' + x for x in original_data]`: Utilizes list comprehension to create a new list named `new_list`. - For each element `x` in `original_data`, the expression `'Hello ' + x` concatenates 'Hello ' with the value of `x`, which represents each name in original_data. - The resulting strings are added to `new_list`. - `print(original_data)`: Prints the contents of `original_data`, displaying the original list of names. - `print(new_list)`: Prints the contents of `new_list`, displaying each name prefixed with 'Hello '.

Answer 42

list comprehension e.g.,

Answer 43

- `original_data = ['This', 'is', 'a', 'test']`: Defines a list named `original_data` containing four strings. - `new_list = [len(x) for x in original_data]`: Utilizes list comprehension to create a new list named `new_list`. - For each element `x` in `original_data`, the expression `len(x)` calculates the length of the string `x`. - The resulting lengths are added to `new_list`. - `print(new_list)`: Prints the contents of `new_list`, displaying the length of each string in `original_data`. - For example, 'This' has 4 characters, 'is' has 2 characters, 'a' has 1 character, and 'test' has 4 characters.

Answer 44

The pass statement in Python serves as a placeholder and does nothing when executed. It is often used to create empty loops, functions, or classes.

Answer 45

- `data1 = [10, 20, 30]` creates a list of numbers from 10 to 30. - The for loop iterates over each item in `data1`. - Inside the loop, a comment explains that the loop is pointless but serves as a placeholder for future code. - The pass statement is used to indicate that no action needs to be taken inside the loop. - Essentially, pass allows the loop to exist without any executable code inside it, avoiding syntax errors in situations where a loop is required but no action is necessary.

Answer 46

The pass statement itself does nothing when executed and serves as a placeholder. - As a result, there are no print statements or other operations that would produce output NO OUTPUT

Answer 47

- It imports the randint function from the random module to generate random integer numbers. - Inside a while loop that runs indefinitely (while True), random numbers between 0 and 5 (inclusive) are generated and printed. - If a randomly generated number is equal to 0 printed, the break statement is executed. - The break statement immediately terminates the while loop, exiting the loop and ending the program execution. - This allows the program to stop generating random numbers once a 0 is encountered.

Answer 48

no code to execute inside it

Answer 49

(other than .csv)

Answer 50

are stored inside it.

Answer 51

a text editor (like Spyder or Notepad or you can look at them using cat on the command line) and read them although they might not make a lot of sense.

Answer 52

iles like this include .csv files, files ending in .txt and most types of programming language 'source code' like python files (.py) and web pages (.html).

Answer 53

text editor.

Answer 54

'numbers' of some sort stored in a way that computers like to read. We call them 'binary' files.

Answer 55

. Files like this include the image and video formats that you might be familiar with (.jpg, .gif, .mp4).

Answer 56

nii, .mat)

Answer 57

plain text' (.csv, .txt).

Answer 58

right modules.

Answer 59

plt.FUNCTIONNAME, e.g. plt.plot; the same way as, when using numpy, we use np.array.

Answer 60

i.e., numbers separated - either by spaces, tabs or commas with one row per line of text.

Answer 61

appear directly in a numpy array.

Answer 62

'bowl' over the subject's head. Like in this picture.

Answer 63

magnetic field activity in a particular location.

Answer 64

what is happening all across the subject's head at any moment.

Answer 65

pin-materials s4 sub-directory using this code:

Answer 66

always start by having a look at it to understand the format. Mostly often we just want to see a few lines of the file to get an idea of what is in it.

Answer 67

head which gives first 10 lines of a file

Answer 68

list the contents of the "s4" subdirectory, we use the `ls` command:

Answer 69

The `ls` command lists the contents of a directory. - `-lh` is a combination of options: - `-l` lists detailed information about each file, including permissions, owner, size, and modification time. - `-h` displays file sizes in a human-readable format (e.g., kilobytes, megabytes). - `pin-material/s4` specifies the directory whose contents will be listed. - Therefore, this command lists detailed information about the contents of the "s4" subdirectory within the "pin-material" directory.

Answer 70

The `head` command displays the beginning of a file. - `pin-material/s4/s4_meg_sensor_data.txt` specifies the file whose beginning will be displayed. - This command shows the first few lines of the "s4_meg_sensor_data.txt" file located within the "s4" subdirectory of the "pin-material" directory.

Answer 71

It gives us this output

Answer 72

We can see that the first line of the file contains some column headings. We make a note of these as we will need them later on: Column 0: Time Column 1: Left Mean Column 2: Left Lower CI Column 3: Left Upper CI Column 4: Right Mean Column 5: Right Lower CI Column 6: Right Upper CI

Answer 73

It displays the last 10 lines of a file by default.

Answer 74

The code snippet uses NumPy's "loadtxt" function to load numerical data from a text file. - It attempts to load data from the file "s4_meg_sensor_data.txt" located within the "s4" subdirectory of the "pin-material" directory. - However, it may encounter an issue if the first line of the file contains column names or non-numeric data instead of numerical values. - By default, "loadtxt" expects numeric data and will raise an ValueError error if it encounters non-numeric content in the first row.

Answer 75

telling nump to skip the first line (skip first row) which contains t is the header with column names which is string - words

Answer 76

(400, 7) - 400 rows and 7 columns float64

Answer 77

np.savetxt.

Answer 78

This code snippet uses NumPy's "loadtxt" function to load numerical data from a text file. - It loads data from the file "s4_meg_sensor_data.txt" located within the "s4" subdirectory of the "pin-material" directory, skipping the first row (which likely contains column names or non-numeric information). - The loaded data is stored in the variable "importedData". - This line extracts a subset of the loaded data ("importedData"). - It selects columns 1, 2, and 3 (exclusive indexing) from the loaded data. - The extracted data, representing the timecourse and confidence intervals, is stored in the variable "ourdata". - The "savetxt" function from NumPy is used to save the extracted data ("ourdata") to a new text file named "my_new_meg_data.txt". - This file is saved in the "/content/pin-material" directory. - After saving the data, the "!ls -lth" command is executed to list the files in the directory, providing information about file sizes and modification times.

Answer 79

np.savetxt: This function from the NumPy library is used to save data to a text file. 'my_new_meg_data.txt': Specifies the name of the text file where the data will be saved. ourdata: Represents the data to be saved. This variable holds the extracted subset of the loaded data, which includes columns 1, 2, and 3. header='Mean UppcrCI LowerCI': Defines a header string that will be written at the beginning of the file. This header typically contains column names or other descriptive information. In this case, the header string specifies the column names as "Mean", "UppcrCI", and "LowerCI". fmt='%1.4e': Specifies the format string for writing the data. The %1.4e format specifier formats floating-point numbers with scientific notation (exponential format) and exactly 4 digits after the decimal point. This ensures that the data is written with a precision of 4 decimal places.

Answer 80

average timecourses measured from some sensors on the left and right side of the head after a 'beep'.

Answer 81

'average' response.

Answer 82

list or as a numpy array.

Answer 83

Since only one set of values is passed, these values will be used as the y-values, and the x-values will be automatically generated as the indices of the data points (0, 1, 2, 3). In this example, we did not supply an 'X' axis. matplotlib assumed that we just wanted 0,1,2,3 and so on. So we really plotted (0, 2), (1, 3), (2, 4) and (3, 5) and joined them up with a straight line. When matplotlib draws a line in this way, it joins the points with straight lines by default.

Answer 84

This is a string which describes how to format the data in the plot. Here we do this by just adding an 'o' to the function call.

Answer 85

The first line plots the data points as circles ('o') without joining lines, creating a scatter plot.

Answer 86

The second line plots the same data points as green circles ('og') without joining lines.

Answer 87

The third line plots the data points as circles with a dashed line ('o--') joining them.

Answer 88

In Matplotlib, markers are symbols used to denote individual data points in plots. Here are some commonly used markers: - '.': Point marker - 'o': Circle marker - 'v': Downward triangle marker - '^': Upward triangle marker - '+': Plus marker - '*': Star marker

Answer 89

For instance, -- means used a dashed line whilst -. means use a dash/dotted line

Answer 90

adjust the colour of our lines or markers using the format string.

Answer 91

but we can override this

Answer 92

1. `import matplotlib.pyplot as plt`: Imports the Matplotlib library and aliases it as `plt` for convenience. 2. `plt.cla()`: Clears the current axes to ensure a fresh plot. 3. `plt.plot([2, 3, 4, 5], 'r+')`: Plots the data points [2, 3, 4, 5] with red plus markers ('r+'). The 'r' indicates the color red, and the '+' specifies the marker style as a plus sign. The datapoints are not joined by a line

Answer 93

- 'b': blue - 'g': green - 'r': red - 'c': cyan - 'm': magenta - 'y': yellow - 'k': black - 'w': white

Answer 94

This code plots the points [2, 3, 4, 5] using blue stars at each data point with a dash-dotted line in between.

Answer 95

This code plots a graph with green star markers at the coordinates (-2, 2), (1.5, 3), (2, 4), and (4, 5),

Answer 96

The absence of a line in the plot is due to the format string 'g*', where 'g' denotes green color and '*' denotes star markers, but no line style is specified.

Answer 97

existing figure

Answer 98

create complex plots from multiple components

Answer 99

A simple example would be a line plot with many lines of data, or a scatter plot where we show different types of data with different symbols

Answer 100

#set limits of y axis go from -3 to +4

Answer 101

#even put a beautiful grid on it

Answer 102

This code saves the current figure as an image file named "my_figure.png" with a resolution of 300 dots per inch (dpi).

Answer 103

savefig function

Answer 104

You can either select the figure and then use plt.savefig() or you can use the .savefig() method directly on the figure object.

Answer 105

figureHandle = plt.figure() # Generate a new figure. Hold its 'ID' or 'Handle' in a variable

Answer 106

Close all existing figures

Answer 107

The plt.legend() function adds a legend to the plot, which provides labels for the plotted lines. In this code, it labels the first line as "A straight line" and the second line as "A wiggly line".

Answer 108

plot means.

Answer 109

In the provided code, the line plt.plot([2, 3, 4, 5]) creates a plot with only y-values, so it auto-generates x-values as sequential integers starting from 0. This line is plotted in the default color, which is blue. The subsequent line plt.plot([-2, 1.5, 2, 4], [2, 3, 4, 5], 'g') plots the provided x and y values in green color as specified by the 'g' argument.

Answer 110

use plt.title to add a title to our plot and plt.text allows us to plot text at arbitrary positions in our figure.

Answer 111

The plt.text() function in this code adds text to the plot at a specified location. In this case, it adds the text "Hello World" at the position (2, 2) on the plot, with the color set to red.

Answer 112

np.loadtxt() loads data from a text file, skipping the first row. Time data is stored in the first column (t). Sensor readings from columns 1 and 4 are extracted (plot_dat). plot_dat.shape prints the size of the extracted data - (400, 2) rows column plt.plot(t, plot_dat) plots sensor data over time. Legends and gridlines are added for clarity.

Answer 113

Until this point, each of our plt.plot() calls has only plotted a single line. If there are multiple columns in the data passed to plot_dat, matplotlib will automatically plot multiple lines - magic! We have passed values for the x axis in (t: the time variable), Notice that we have time before 0s. Time 0 is the time we present the stimulus (in this case a 'beep'). We see that we get a large deviation in the signals shortly after the presentation of the stimulus.

Answer 114

are mean response across multiple presentations of the same 'beep'

Answer 115

plt.fill_between()

Answer 116

(perhaps an 'error envelope' is better desccription) to our plots.

Answer 117

filling the area between two curves

Answer 118

Plots the error enevelope - shaded area between let_lower_ci and left_upper ci on the plot along time axis t parameter is added to define x-axis values left_lower_ci = data[:, 2] - extracts data from column 2 lower CI left_upper_ci = data[:, 3] - extracts data from column 3 of upper CI

Answer 119

first plotting the mean line in black using plot.plot() (color='k'), then plotting the error envelope over the top (plt.fill_between()). When we plot over the top we have to set the color to be a bit transparent otherwise you will not see the line below. Computers often refer to the transparency or 'solidness' of a color as its 'alpha' value. So if we set 'alpha' to 0.5, it will become 50% see-through. 20% is even more see-through.

Answer 120

plt.plot and plt.fill_betweeen() where we use 'shorthand' for colors where 'green' is 'g', 'blue' is 'b', 'black' is 'k' and so on. e.g., plt.plot(data[:,0],color='r') e.g. plt.fill_between(t, left_lower_ci, left_upper_ci,alpha=0.5, color='g')

Answer 121

specificy single letter 'r' = 'red' Red , green, blue format Using colour names like aquamarine, mediumseagreen

Answer 122

numbers are hirstograms (frequency plots) and boxplots.

Answer 123

data produces 2-D array with 10 rows and 4 columns filled with randomly generated numbers between interval 0 and 1 - numbers between 0 and 1 (exclusive) data[:,0] selects all the rows of array of the first column (0) and plots them on y axis Since no x axis values are explicitly mentioned, indices on x axis are generated --> x indices would be 0 to 9 since there are 10 rows in data array Plots the values from data on y axis and indices of that on x axis as red line

Answer 124

This produces an array containing random numbers 10,000 random numbers drawn from standard normal distribution (mean = 0, SD = 1) - unlike rand that produces numbers from flat distribution Majority of numbers would fall around mean (0)

Answer 125

Produce an array containing 10,000 random numbers drawn from standard normal distribution but majority of numbers fall around 1.8 than 0

Answer 126

plt.figure() - produce a figure for the histogram plt.hist(data1, bins = 50) x axis = range of values in dataset data1 divided by 50 bins Y axis is frequency or count of occurence of data points falling within each bin on x axis specifices bins - how lumpy i want histogram to be

Answer 127

Specificying alpha parameter in plt.hist() Alpha parameter controls the transparency of the histograms

Answer 128

Alpha value of 0.3 means bars in histogram are somewhat transparent

Answer 129

less transparent the histogram will be

Answer 130

binEdges = np.linspace (-10,10,1000) produces array of bin edges ranging from -10 to 10 inclusive with 1000 evenly spaced interval - each interval defines boundaries of a bin of histogram In plt.hist(data1, bins = binEdges) - bin parameter used to specificy bin edges to use for histogram

Answer 131

skewed distribution.

Answer 132

(25-75% range) as well as the median (50% value). Outlier points (as defined by a multiple of the interquartile range) are plotted as individual points outside the whiskers of the plot.

Answer 133

- data = np.random.rand(1000,3) - produces 1000x3 Numpy array filled with random numbers between 0 and 1 with uniform distribution data[:,1] = data[:,1]*2 + 1 - this multiples the second column of data array (all rows) by 2 and adds 1 to value - scales and shifts values in second column data[:,2] = data[:,2]*1.5 +- 1 - this multiplties the values in the third column of data array by 1.5 and subtracts 1 from each value - shifts and scales values from third column plt.figure() - produces figure for plot plt.boxplot(data)- produces boxplot using data in data array - each column has a different dataet plt.show() - displays plot with boxplot

Answer 134

Adds label for the 3 boxplots by setting first boxplot Set 1, second boxplot Set 2 and third boxplot is Set 3

Answer 135

zero-referenced system.

Answer 136

The first argument is a list of numbers indicating the different categories. The second argument is a list of strings saying what to call them.

Answer 137

he color of the background, the line thickness, the font.

Answer 138

individual tweaks to plotting layouts, colours etc, or by changing all of its settings in one go.

Answer 139

plt.style.use() function

Answer 140

Technically this might execute but the file_path is not an absolute path as expected. Almost certainly it is missing the initial '/'

Answer 141

The plt.show() command fixes the image so the last two lines do not do anything. Place them before the plt.show() command.

Answer 142

It assumes there is at least one line of data in the file as well as the header. If this it not true, it will throw an exception.

Answer 143

Again - the plt.show() command stops the next line from working. Change their order.

Answer 144

The 'directory' variable hard-codes the '/' separators. This might fail on Windows where the separator should be '\'

Answer 145

Data is defined as a 2 (down) by 10 (across) array. So it will plot 10 random time series with two points each. Change to rand(10,2) to make it work.

Answer 146

Line of error - plt.savefig('/plots/parabola.png') Windows uses \ instead of /. - fix it by plt.savefig('\\plots\\parabola.png') Use os.path.join to glue together all the bits in a platform-independent manner.

Answer 147

A The second one ('Group 2'). The offset is defined by the +2. The spread is defined by the *.5. So this one will have a median at +2 which is bigger than any of the others. This is because offset (+2) directly adds a constant value to each data point, it has a more significant impact on the median compared to the spread (*.5).

Answer 148

a: apple.txt, allFruit.csv, allFruit.xls, allFruit.tsv, apple.jpg b: apple.jpg, banana.jpg

Session 8 - Advanced Programming Techniques Flashcards

(202 cards)