Week 3 Flashcards

1
Q

_________ the analysis and comparison of genomes from different species.
Purpose:
-how species evolved
-function of genes and non coding regions

A

comparative genomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

___________ is a measure of relatedness

A

homology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

____________ derived from a common ancestor

A

homology • Homology can also refer to evolutionarily related structures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

IS Sequence similarity alone is homology?

A

No, sequence similarity does not mean common ancestor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

_________ type of homologs, different species

A

orthologs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

________: type of homologs. within the same species

A

paralogs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why does it matter if two sequences are homologous/ similar?

A

similar sequences typically code of a common function. This allow us to construct phylogenetic trees

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

sequence conservation implies ________________

A
functional conservation. 
Sequence alignment is key to: 
-finding important regions 
-determining function 
-uncovering the evolutionary forces
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Genes aren’t the only conserved sequence:

A
  • lots of non coding regulatory elements

- mutational drift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The greater the conservation, _______________

A

the more relevant the residue/ region is to function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

conserved noncoding sequences probably represent _______________________

A

functional elements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

__________ is reconstructing history of changes, the goal is to infer past history that produce a set of Morden sequences

A

phylogeny

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the law of parsimony

A

the most simplistic is the most likely outcome.

ex: the smallest number of mutations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

____________ one base is replaced by another

A

pont mutations (substitution)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

_________ one or more bases are inserted. Caused by copying errors

A

insertions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

____________ one or more bases are removed. Caused by copying errors

A

deletions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is polymerase slipping

A

The slipping of DNA polymerase II from the DNA template strand at the report region and the subsequent reattachment to a more distant site

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

________ a directed acyclic graph

A

a tree

  • there exists a single unique path between any pair of nodes
  • the lengths of the path corresponds to the evolutionary distance (extend of sequence similarity) for some types of trees
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Phylogeny standard assumptions:

-sequences diverge by speciation represented usually as _______________

A

bifurcation events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Phylogeny standard assumptions:

sequences are essentially ___________ once they diverge from their common ancestor

A

independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Phylogeny standard assumptions:

The probability of observing one nucleotide at the same site in the ______________

A

future depends only on the current nucleotide at the site

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Phylogeny standard assumptions: different sites (character within a sequence) evolve __________

A

independently

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

phylogenetic trees can be _________________

A

between or within species

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Phylogenetcis applied to the 1990 HIV dentist case

A

compared to viral sequences from the dentist three of his HIV+ patient and two HIV+ local controls. It showed that 2 of 3 HIV patients were closer to the dentist than other local controls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Human to human genetic variation is estimated to b around __________________
0.5% of the genome
26
Human genetic variation: | -Every human has essentially the __________
same set of genes
27
But there are different forms of each gene known as ______________
alleles ex: blue vs brown eyes genetic diseases such as cystic fibrosis or Huntingtons disease
28
___________ single nucleotide polymorphisms
SNPs
29
___________ are the most simple form and most common source of genetic polymorphism in the human genome
SNPs single nucleotide polymorphisms
30
There are two types of nucleotide base substitution results from SNPs:
- transition | - transversion
31
_________ substitution between purine (A,G) or between pyrimidines (C,T). Constitutes two thirds of all SNPs
Transition
32
__________ substitution between purine and a pyrimidine
transversion
33
To be defined as a single nucleotide polymorphism it has to ___________
have a population frequency >1%
34
SNPs occur every
-200 bases on average and most have no phenotypic effect
35
What are synonymous with SNPs
signal that doesn't cause changes in the amino acid | non synonymous does cause changes in amino acid
36
_________ SNP that is an amino acid change
missense mutation
37
_______________ SNP that is an amino acid change
nonsense mutation
38
If 4% of the human SNPs that have been found fall within a coding region yet only about 3% of all DNA is coding does this mean genes are "mutation magnets"
No because this is based on research that focuses primarily on coding regions so there might proportionally be more SNPs discovered here than in non coding regions
39
What is the primary difference between a polymorphism and a mutation?
polymorphism and mutation occur in different prevalence in a population. Polymorphisms myst occur in at least 1% of population
40
What are the similarities between polymorphism and mutations?
polymorphisms start as mutations during evolution. There exists both heterozygous and homozygous polymorphisms or mutations
41
________ can occur anywhere in the genome. The longer the repeat, the more polymorphic it is. Are the basis for DNA fingerprinting, forensic testing and paternity testing.
polymorphic repeats
42
___________ DNA finger printing uses repetitive sequences that are highly variable called variable numbers tandem pets VNTR. VNTR loci are very similar between closely related humans, but so variable that unrelated individuals extremely
polymorphic repeats
43
polymorphic repeats are found less frequently in ___________
genes | -this is because they have been implicated in several inherited genetic disorders such as myotonic dystrophy.
44
Why are most repetitive elements within genes found in factor of 3?
They occur in the coding regions. Codons occur in 3s.
45
SNPs can be found that are linked to any ________
disease alleles
46
SNPs can be found as markers for __________
genetic diseases and diagnosis
47
__________ is the global study of gene expression at the RNA level
transcriptomics
48
Gene expression. distinguishes:
- gender and age - physiological status - tissue and cell type - response to stimuli - health and disease
49
Gene expression is often measure as the
amount of RNA that is exported out of the nucleus
50
mRNA s often used as a proxy to study ____________
protein levels
51
highly. expressed mRNA's correlated well with ________________
protein levels
52
Evolution of technology: | Northern Blot: ____________
single genes
53
Evolution of technology: | RT-PCR: ______________
multiple genes
54
Evolution of technology: | Microarrays: _______________
whole transcriptome
55
Northern plot is a laboratory technology that is sued to detect a specific RNA sequence in a blood or tissue sample using ______________
radioactive labeling
56
Evolution of technology: | Next generation sequencing _______________
whole transcriptome
57
What. are the two types of microarrays
- spotted or cDNA arrays | - oligonucleotiide array
58
What are the microarray principles?
- array design - hybridization - imaging
59
What are the basics of microarrays: What are the essential components
- probes | - targets
60
What are the 4 steps of Microarray analysis
- experiment and data acquisition - data normalization - data analysis - biological interpretation
61
What is the main application of cDNA microarrays?
compare gene expression in two samples of cells
62
cDNA Microarrays
1. Array Printing 2. Sample Preparation 3. Analysis (hybridize and scan)
63
What is the purpose of dye shaping in Microarray analysis
controls for dye bias. For example sometimes the green dye is just brighter than the red
64
Why do we have hundreds of copies of a single probe in microarray analysis
It Allows for the cDNA to hybridize in a dynamic range and between quantify gene expression levels
65
Miccrodsarrays __________ to show the presence of one, both, or none of the genes being tests
color mix
66
oligonucleotide arrays: | ______________ indicates only hybridization intension, high density, multiple probes per gene/mRNA
single color
67
In Affymetrix exon arrays: | Several probes for each exon allow for the detection of ____________
alternative splicing
68
sports are fried with software and _______________
normalized vs background
69
__________ represents the amount of hybridization. pixel intensity
spot intensity
70
_______ of two dyes are often used to compare two samples
ratios between Cy5 and Cy3
71
to find the red intensity:
=R foreground - R background
72
to find the green intensity
=G foreground - G background
73
Relative expression. of intensity
Log_2 (Red intensity/ Green intensity)
74
___________: is the non specific hybridization and the other chemical
background intensity
75
_________ focuses on small regions around the spot to determine the median/ mean pixel values in this regions. Most common approaches
local background detection
76
What are three possible reasons for experimental noise?
- non uniform spotting - efficency of dye incorporation into CDNA - Efficency of hybridization - dirt/dust - vibration - differences
77
Image processing noise if the image is __________ i.e. spot locations are not properly satisfied
incorrectly processed
78
_______ is correlation between duplicate spot intensities. Percentage of spots with negative signals, Distribution
array
79
_________ is the signal or noise ration
Spot
80
_______ is a statistical measurement of a relationship between two variables. Values range from +1 to -1
correlation
81
_________ accounts for experimental noise, Lab to Lab variation, scanner to scanner variation, accounts for all variation
normalization
82
Normalization allows comparison of __________________________ __________________________
expression levels across experiments (arrays) | -a major goal of microarray experiments
83
There are several methods of normalization
- global mean method - linear regression method - intensity dependent methods - variance model methods
84
Normalization using mean global intensity assumes that the total number of mRNA copies for all genes should one the same in _______________
the experiment and the controls
85
Normalization assumptions:
1. the tool number of mRNA copies per cell is constant | 2. Changes in gene expression are roughly symmetric at all average intensities
86
What are the procedures of normalization
1. Draw the M-A plot to check data distribution 2. Fit the M-A plot with lowest curve (locally weighted scatterplot smoothing) 3. Transform data using the formula (m'=M-c(A)
87
what are the advantages of normalized data
works well for normalizing skewed or nonlinear data
88
what are the disadvantages of normalizing data
more complex and difficult to calculate
89
What do the outliers represent in the lowest curve
These genes are often what we are going after. They are the genes that are differentially expressed. Meaning they are heavily expressed in one sample and not another
90
__________ is a curve that follows the trends of the data
lowest curve