Everything Flashcards
Why CV is hard
AI complete, representations, ML, interfacing with plans, signal-to-symbol converter, signals explicitly express very little info for plans, convert to symbols for manipulation, inverse optics, inverse graphics, little cognitive penetrance, CNNs utilise prior knowledge, poverty of signal data wrt bottom up analysis (edges and foxes), need top down prior knowledge model driven vision; face pixel intensity array unrecognisable as 3D plot; poverty of signal dat;
Ill posed problems
figure ground segmentation; infer 3D arrangement - occlusion; surface properties texture/colour from image stats; volumetric properties from 2D image projections; real time; depth property inference; surface property inference; colour inference invariant wrt illumination; structure from motion – shading, texture shadows; 3D shape from 2D line drawing; pose invariant recognition; understanding objects never seen before; Hadamard: well posed if solution exists; unique; depends continuously on data;
Pixel arrays
- CCD dense independent sensor array charge ~ energy
- CCD local charge coupling; CMOS
- sensing elements only few microns in width
- photon flux limits resolution growth via more dense sensors
- spatial resolution of image determined by sensor density and optical figure of merit of lens
- luminance resolution is no of distinguishable grey levels
- LR det by bits per pixel (digitizer) + SNR of CCD array
- Colour 3 subarrays preceded by RGB filters Bayer pattern twice as many G to reflect cone sensitivity
Data in video streams
- Composite video high frequency chrominance burst colour encoding;
- S-video separate luma/chroma
- Separate RGB ccs
- Colour requires less info than luminance – exploited by coding schemes
- Framegrabber/strobed sampling block high speed ADC discretises video into byte stream sequence of frames
- NTSC 30 fps interlace of alternate lines 60 fields per second; PAl – 25 fps
- Vast flood of data in a video stream even without HDTV
- PAL 11 million pixels/sec … 8 bits per pixel … 264 MB/s - coping with data flux
Image formats and sampling theory
- Rectangular array of sampled intensities
- Separate colour planes
- Redundancy in correlation neighbouring pixels highly compressible
Examples of image formats and encodings
- jpeg controllable Q factor quantised DCT coefficients of tiles frequency dependent depth
- jpeg2000 better v of jpeg smooth Daubechies wavelets avoid block quantisation artefacts
- mpeg stream oriented individual frames jpeg equal amount of temporal redundancy removed via inter frame predictive coding interpolation
- gif sparse binarised images bandwidth limited media
- png lossless compression
- tiff tagged image file formats non compressive randomly embedded tags
- bmp non compressive bit mapped individual pixel values easily extractable
- Colour spaces are used for colour separation
- In compressed formats image payload actually in a transform domain so pixel vals obtained by inverse transform
Information content in an image
- Bit count doesn’t relate to optical properties nor freq analysis
- Nyquist – highest spatial frequency component of information contained = 1/2 the sampling density of pixel array
- 640 cols – 320 cycles/image highest spatial frequency components
- 30 fps – highest temporal frequency is 15 Hz
- RGB-D sensors capture depth
Second order pixel statistics to aid segmentation
- low level metrics useful for segmentation
- NIR – compute pixel variance and mean in local patches imaging ratio sets eyelid boundaries on fire
Neuron properties
Neurones are sluggish but richly interconnected cells having both analogue and discrete aspects, with nonlinear, adaptive features.
What is a neurone
Fundamentally they consist of an enclosing membrane that can separate electrical charge, so a voltage difference generally exists between the inside and outside of a neurone.
Neuronal membrane properties
Bilipid layer capacitance of 10K microfarad /cm2+ pores that are differentially selective to different ions Na+, K+, Cl-
Catastrophic breakdown
- Neuronal membrane differentially selective to diff ions
- Ion species cross through membrane via protein pores (discrete conductances/resistors)
- Resistors for Na+, K+ are voltage dependent
- Na+ flow into neurone, voltage becomes more + on inside further reducing membrane resistance to Na+ so more enters
- Catastrophic breakdown in resistance to Na+ constitutes a nerve impulse
- Within a msec slower but opposite effect involving K+ restores original transmembrane voltage
- Refractory period to restore electro osmotic equilibrium after which we’re ready to fire again
Refractory period duration
2 msec
Prevents clocking faster than 300 Hz about 10^6 times slower than PC clock
Balanced by massive interconnectivity
Nerve impulse propagation speed down axons
100 m/sec
Character of impulse signalling
Impulse signalling can be described as discrete, but the antecedent summations of current flows into a neurone from other neurones at synapses, triggering an impulse, are essentially analogue events.
Synchrony of neural activity
In general, neural activity is fundamentally asynchronous: there is no master clock on whose edges the events occur.
Brain tissue density
10^5 neurones / mm3