L19 Gene Mapping Flashcards
Shotgun sequencing
does not define a genome, requires assembly and annotation
Genome sequence
does not provide all the information
- chromatin information, epigenetic states are important too
Single reference genome does not represent the whole species
variation via:
- mutation
- migration
- selection
- drift
Genetic markers used as landmarks
- to assess genetic diversity
- to gain positional information about genetic basis of traits
Genetic markers predate genome information
first genetic markers: phenotypic markers in drosophila e.g. eye colour
first molecular markers: isozymes
molecular markers often require combining technologies
Genome information
highly valuable at defining a species and gaining information on the structures and function of the organism
Sequencing a genome
see onenote
- Shotgun sequencing e.g. Illumina
Fragments:
- single reads
- paired-end reads
- mate-pair libraries
Assembling a genome
see onenote
- build contigs based on sequence overlap represented in a de Bruijin graph
- scaffold contigs using large inserts/long read technology
Integrating scaffolds into maps
Physical maps = BAC
Genetic maps = linkage maps
Genetic and physical mapsillustrate the arrangement ofgenes andDNA markers on a chromosome. The relative distances between positions on agenetic mapare calculated using recombination frequencies, whereas aphysical mapis based on the actual number of nucleotide pairs between loci.
Annotating a genome
see onenote
- gene prediction models
- expression data
- homologous protein identification
- final model combining multiple sources of evidence
confronting obtained annotation gene set against a biological expectation
Coupling chemical or enzymatic treatment with sequencing
- bisulfite conversion
- chromatin immuno-precipitation
- assay for transposase accessible chromatin
- high throughput chromosome conformation capture
Coupling chemical or enzymatic treatment with sequencing
- bisulfite conversion
- chromatin immuno-precipitation
- assay for transposase accessible chromatin
- high throughput chromosome conformation capture
Genetic markers for population genetics
- simpler genetic landmarks used as a proxy for more complex and less accessible causal source of variation
- explicitly relies on linkage disequilibrium
A good marker is
- polymorphic and abundant
- unambiguous/repeatable
- neutral (not causal) + co-dominant
SNP as the reference marker
- most abundant
- easy to genotype either using microarray or direct sequence
- neutral?