Lecture 1: Organisation of the Human Genome Flashcards

Question

Non-coding RNA genes (ncRNA) MAJOR CLASSES...8

Answer 1

1 * tRNA (Translational machinery; gene cluster on Chr 6 – almost complete set) 2 * rRNA (Translational machinery; 150-200 copies. ) 3 * Short Regulatory ncRNA ---- 4* snoRNA (RNA processing/base modification. 97 snoRNA, >85% single copy) ---- 5. * snRNA (RNA processing/splicing, multiple copies of some) ----- 6. * miRNA/piRNA/tiRNA (gene expression) 7 * lncRNA (epigenetic control of chromatin, promoter-specific gene regulation, mRNA stability, X-chromosome inactivation and imprinting) 8* Others? (very current field of research)

Answer 2

LOOK AT SLIDE 17

Answer 3

1 * Sequences related to coding or non-coding sequences that have mutated such that expression/function is lost (e.g. stop codons introduced, frameshifts etc) 2 * Derived from genes (coding and non-coding) by duplication or retrotransposition 3. * Different types include : --- 4 * Gene fragments * single exons, multiple exons. Very common ---5 * Whole genes * Includes introns. Splice sites often mutated * Processed pseudogenes ---6. * Mature mRNA from expressed gene reverse-transcribed and integrated into the genome

Answer 4

1 * Gene fragments * single exons, multiple exons. Very common 2 * Whole genes * Includes introns. Splice sites often mutated 3 * Processed pseudogenes * Mature mRNA from expressed gene reverse-transcribed and integrated into the genome

Answer 5

1. Missing promoter 2. missing start codon 3. frameshift 4. premature stop codon 5. missing intron 6. partial deletion look at gene segment drawing SLIDE 18

Answer 6

1 * Make up 1.5% of the genome, but they are the most studied 2 * Produce proteins which act perform activities required by the cell (metabolism, transcription, translation, etc etc) 3 * Can be single copy (e.g. Beta globin) or multiple copy (eg HLA class I genes) 4 * Genes can be grouped into families based on sequence similarity --- 5– Often evolved by duplication and divergence and found in clusters 6 * Some families group into superfamilies based on a common protein domains (eg Ig-SF) 7 * Coding genes can be identified by comparing mRNAs (i.e. spliced sequences) with genomic sequences 8 * Genbank is a public store of mRNA sequences generated by laboratories worldwide 9 * Gene/mutation naming conventions important for communication of findings

Answer 7

1 * Numerous genes, different orientations (forward and reverse, opposite strands) 2 * Pseudogenes and gene fragments often intermingled (repeat content very dense) 3 * Coding genes can overlap in opposite orientations 4 * Some genes may contain complete genes within introns

Answer 8

SLIDE 20.. DRAW AND LABEL

Answer 9

1) Promoter – TF & RNA pol binding site (TATA vs TATA-less) 2) Introns and exons (coding) 3) 5’ UTR (drives translation) 4) Start codon (ATG) 5) Splice sites (AG/GT vs AT/AC) 6) Splice enhancers (exonic, intronic) 7) Stop codon (TAA, TAG, TGA) 8) 3’UTR (mRNA stability & localisation) 9) Polyadenylation signal (sequence)

Answer 10

LABEL AND DRAW THE DIAGRAM ON SLIDE 21

Answer 11

1 * DNA is transcribed to RNA in the nucleus 2 * RNA is exported to the spliceosome where introns are spliced out to yield a mature mRNA 3 * Specific sequences affect splicing ---- 4* Splice acceptor/donor sites occur at intron/exon boundaries 5 * Enhancers/Silencers occur within introns and exons and can affect splicing in specific tissues (SR proteins) ---- 6* SR proteins can direct the inclusion and exclusion of specific exons ---7 * The mixture of SR proteins differ from tissue to tissue

Answer 12

draw and label diagram on slide 22

Answer 13

1. exon skipping 2. intron retention 3. alternative 5' donor or 3' acceptor 4. mutually exclusive exons 5. alternative promoters 6. alternative splicing and ployadenylation understand all forms and draw the diagrams on slide 23

Answer 14

1 * Large variation in gene size (2kb – 2Mb) 2 * Large variation in protein sizes 3 * Large variation in UTR lengths (3’ generally longer than 5’) 4 * Many genes have alternative first exons with different 5’UTRs LOOK AT TABLE ON SLIDE 24

Answer 15

1 * Approximately 50000 – 100000 genes were predicted in the genome 2 * The completion of the genome sequence in 2001 showed 20,000-25,000 3 * Alternative splicing explains the difference (some genes can produce >10 different proteins) 4 * Common features can be identified in genes with related functions 5 * Cell Surface Receptors for example; * Leader sequence (to direct proteins to the cell surface) ~20 amino acids * Extracellular domains (of different families. One example is Ig, SH-linked) * Number of EC domains can vary * A stalk region * A membrane anchoring sequence and/or transmembrane sequence * An intracellular domain (of different families). Delivers signals

Answer 16

1. Leader sequence (to direct proteins to the cell surface) ~20 amino acids 2 * Extracellular domains (of different families. One example is Ig, SH-linked) 3 * Number of EC domains can vary 4 * A stalk region 5 * A membrane anchoring sequence and/or transmembrane sequence 6 * An intracellular domain (of different families). Delivers signals

Answer 17

draw image on slide --- gene segment and features... SLIDE 25

Answer 18

1. Cellular processes 2. metabolisim 3. DNA replication/modification 4. intracellular signalling 5. cell-cell communication 6. protein folding and degradation 7. transport 8. multifunctional proteins 9. cytoskeletal/structural 10. defence and immunity 11. miscellaneous function 12. transcription/translation Look at slide 26 graph