Final Iteration Flashcards

Question

How do you reduce noise?

Answer 1

Overlay multiple copies of the image on top of each other, and then produce an image where each pixel is the average across all the other pixel values in that location. Alternatively, applying filters can reduce noise e.g. mean filtering

Answer 2

The process of applying a filter to an image

Answer 3

The process of applying a filter whose values are determined by a Gaussian function. Higher weight is given to pixels near the source pixel (origin).

Answer 4

Create a small square window which samples the Gaussian function, and normalises the results so that the filter entries add to 1.

Answer 5

Depends upon the variance. A higher variance leads to more values being included that are above the 98% threshold.

Answer 6

A 2D filter which can be split into two 1D filters e.g. a 2D Gaussian filter can be split into two 1D Gaussian filters, a horizontal one and a vertical one

Answer 7

When a faulty sensor registers either an error for a sample (black), or false saturation (white)

Answer 8

Add up all of the pixel values, then divide by the amount of pixels to generate the median value. Then, apply that median value to pixels within it's radius.

Answer 9

Anisotropic - not the same all sides Diffusion - spreading out Anisotropic Diffusion - making each pixel more like neighbouring pixels that it is already similar to.

Answer 10

Calculated by using (D-d)/D, where D is the maximum possible difference, and d is the difference between the two target pixels. S(p,q) is near to 1 - pixels are borderline identical S(p,q) is near to 0 - pixels are almost entirely opposite to each other S(p,q) means the new value at pixel p is based on all its neighbours, called q in this case.

Answer 11

Higher K value - greater smoothing, mostly preserves edges

Answer 12

Works by using two Gaussians. One weighs the value of pixels near the source pixel, whilst one weights the value of pixels similar to that of the target pixel.

Answer 13

An adaptive thresholding technique Assumes histograms are bimodal. Computes the weighted sum of the histogram, and selects the smallest T. This then becomes the new threshold for that area.

Answer 14

Assumes histograms are unimodal. Draws a line from the highest peak, down to the furthest bin's peak. The bin that is the furthest away from that line is set as the threshold.

Answer 15

Turns a binary image into one that is labelled. Any areas that are connected via 1s is assigned a label. Anything with 0s is not labelled.

Answer 16

Dilation - expands the foreground Takes the structuring element's origin, and puts it 'on top' of the target pixel. Any background pixel that the structural element overlaps becomes a part of the foreground. Erosion - Shrinks the foreground Takes the structuring element's origin, and puts it 'on top' of the target pixel. If there are any background pixels within the range of the structural element used, then set the target pixel to the background.

Answer 17

Opening - erode, then dilate Smoothes contours, and eliminates protrusions Closing - dilate, then erode Smoothes contours, fuses small gaps, and fills in small holes

Answer 18

Take the original image Apply either erosion or dilation Then, with the new image, subtract that from the original. The edges will now be displayed

Answer 19

Calculated by subtracting the current element from the one prior to it. Also written as (x+1)-x

Answer 20

(x+1) + (x-1) - 2(x) I.e. element before current + element after current - 2(current element)

Answer 21

f is roughly |Gx|+|Gy|

Answer 22

Gx: 1|0 0|-1 Gy: 0|1 -1|0

Answer 23

Gx: -1|0|1 -2|0|2 -1|0|1 Gy: -1|-2|-1 0|0|0 1|2|1

Answer 24

Take image Apply Gaussian smoothing Subtract the smoothed image from original to obtain unsharp mask Add the unsharp mask to the original image

Answer 25

Produces a stronger reaction to fine details, and has a simple implementation.

Answer 26

Using a Laplacian filter Advantage - simple implementation via convolution

Answer 27

Normal: 0|1|0 1|-4|1 0|1|0 Single Operator: 0|-1|0 -1|-5|-1 0|-1|0

Answer 28

Look for peaks in the 1st derivative, or zero-crossings (i.e. crossing the x-axis) in 2nd derivative

Answer 29

Very efficient - uses only 4 pixels, and only subtracts and adds Very susceptible to noise, and only reacts to very strong edges

Answer 30

Using Marr-Hildreth - convolving the Laplacian of a Gaussian OR applying Gaussian smoothing, followed by a Laplacian

Answer 31

1st derivative peaks - strong response at edges, but also responds to noise. Peak detection and threshold selection need care 2nd derivative zero-crossings - well-defined, easy to detect. Edges must form smooth, connected contours, but tends to found off on corners.

Answer 32

Good Detection Good localisation Minimal response

Answer 33

1st derivative of a Gaussian smoothed image. Most implementations are 2D Gaussian smoothing + Roberts' style derivative

Answer 34

Check if pixel is a local maximum along the gradient direction, and select a single maximum across the width of the edge.

Answer 35

Test each pixel independently. Industry standard allows a band of variation, but assumes continuous edges. User still selects parameters, but at the cost of less precision. Idea is to keep weak edges connecting strong edges if the strong edges are exceptionally strong, and the weak edges are not exceptionally weak.

Answer 36

P(Rk) = Nk Rk = kth grey level Nk = amount of pixels at kth grey level Normalised: P(Rk) = Nk / N N = amount of pixels in the image

Answer 37

A normalised histogram has bins that add up to 1.0 Each bin gives the probability of that grey level appearing in the image.

Answer 38

Low contrast - Very minimal spread, exceptionally high spikes High Contrast - very minimal height, extreme spread across the histogram.

Answer 39

Dark - Very left-hand side is full of extreme spikes Light - very right-hand side is full of extreme spikes

Answer 40

Aims to spread out the values across the histogram, so that the new image has a nearly uniform distribution of pixel values across the histogram i.e. no spikes at certain grey levels. Increases contrast of overall image.

Answer 41

Take the R values and the Nk values. To calculate the Pr(Rk) values, you need to divide each Nk by the sum of all Nk values. Next, using Pr(Rk), you can work out T(r) by taking the value for the first Pr(Rk), and copying it across. Then, for each subsequent value, add it, and any previous values, on top. That is their T(r) value. Next, take the T(r) value, and multiply it by the maximum R value (should be a whole number). Finally, for each of those values, round them to the nearest whole number (Note - always round down, no matter what) Using these values, you can plot a new histogram using specifically Pr(Rk) and the rounded Sk values (the rounded numbers). Design a histogram such that Pr(Rk) is the value on the y-axis, and the rounded value is on the x-axis. If there are multiple values for one rounded value, then add them together, which will result in the final y-axis point for that rounded value.

Answer 42

Works well when the input isn't too noisy, and there aren't any exceptionally bright or dark areas that could overpower the new image. A way to counter these problems is using equalisation of histograms in local areas of the image.

Answer 43

Noise and/or different camera responses can give similar images with very different histograms Histogram resolution - may need many bins to store all the colours, which can get very expensive very quickly (expensive in terms of storage and memory cost) The illumination may be coloured - same object may generate a different histogram under different lighting They ignore spatial information

Answer 44

Using two 'arrays' for histograms, put one above the other. Then, for each pair (one above, one below), take the smallest value. Add all those values together and you get the histogram intersection score. A higher score means more intersection.

Answer 45

Textual query - finds images in the database based on key words associated with each image - can get expensive Content-based retrieval - use the shapes, textures and colours within the query image to search for images that have similar properties.

Answer 46

Divide the image into small windows, and see how much of the target colour is present, by highlighting those pixels in the image that are similar to those in the query.

Answer 47

Clustering - seeks groups of similar pixels, with no regard for where they are, and views images as uncorrelated data. Region-based - starts with a 'seed' (origin pixel), and computes a similarity value for comparison with other neighbours e.g. average grey level value. If the neighbour is close to that value, adds it to the region. Split and merge - Splits up the image into many regions, then merges them based on their similarity to each other.

Answer 48

Trees which represent how the image has been split - used within split and merge. Each time it splits the image/region, it splits it into 4 parts, hence quad

Answer 49

Edges are represented as ridges in an otherwise massive 'valley' where basins are regions in the image. Figuratively, it raises the water level, and when the water overflows a ridge, it detects an edge. In practice, it orders pixels from lowest value to highest, and goes through them one by one, assigning labels to them. if it doesn't have a label, and no neighbours have a label, then give it one. If it has a neighbour with a label, then give it that one. If it has two or more neighbours with different labels, then mark that pixel. It is considered as an edge.

Answer 50

Initialise cluster centres on pixel grid in S steps - image has N pixels and you want K superpixels, where each superpixel is roughly a square region of N/K pixels. Thus, S = square root of (N/K) Move centres to the position in a 3x3 window with the smallest intensity gradient - moving centres away from edges Compare each pixel to all cluster centres with 2S pixels and assign it to the best matching cluser Recompute cluster centres as mean colour and position of the pixels belonging to each cluster Repeat steps 3 and 4 until total change made to position and colour of centres is below a threshold, or for a fixed number of iterations.

Answer 51

Take a straight line, and match it to all possible image orientations and positions, and compute a measure of fit to the edge data - incredibly expensive, don't do this :(

Answer 52

Takes lots of edges, transforms them into lines that exist in m,c space and finds lots of places where they intersect and returns those parameter points

Answer 53

Smoothing - low pass filtering - attenuate high frequency components Sharpening - high pass filtering - attenuate low frequency components Band limiting - set all components to 0 outside a given frequency range

Answer 54

Variable Length Encoding Assigns fewer bits to more commonly used grey levels than to less probable ones - in order to save space

Answer 55

A measure of success of psychovisual redundancy compression. Lower score is better quality, and score is determined by comparing the original image to the newly compressed version of the image.

Answer 56

Transforms input data in a way that facilitates reduction of interpixel redundancies - reversible

Answer 57

Transforms input data in a way that facilitates reduction of psychovisual redundancies - not reversible

Answer 58

Assigns the shortest code to the most frequently occurring output values - reversible

Answer 59

The average information content of an image, a measure of the histogram dispersion.

Answer 60

Map vector values (R, G, B) onto scalar values. Multiple vectors map to each scalar For each pixel in the original image, find the closest colour in the Colour Table. Record the index for that colour. To reconstruct the image, place the indexed colour from the Colour Table at the corresponding spatial location.

Answer 61

Find clusters of pixels that are close/similar in colour, and combine them all to form a 'generic' colour, which represents them all. Replace them all by this single colour.

Answer 62

Code the difference between adjacent pixels. Prediction is that the next pixel value is equal to the current one, and you need the first value to provide a point of reference. It is lossless and is of a lower entropy.

Answer 63

Break into 8x8 blocks, and start from the top left and work your way through as if you were reading a book. For each one, subtract half the maximum intensity value. Once finished, apply the 2D-DCT to this block.

Answer 64

DC components summarise patch intensity AC components are quantised - divide the DCT block by values in a quantisation table, with different tables for luminance and chrominance

Answer 65

Invariant to translation and rotation Change slowly as viewing direction changes Change slowly with object size Change slowly with occlusion

Answer 66

Spatial Psychovisual Coding

Final Iteration Flashcards

(90 cards)