Week 3 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Genome

A

all DNA and identification of all DNA elements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Transcriptome

A

All transcripts expressed (list plus analysis of expression)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Proteome

A

all proteins expressed (list plus analysis of modification)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Bacterial Genome to bacterial proteome

A

Large scale ORF; Looking for open reading frames in the bacteria.

Take the protein files and build a database which we would call the proteome of the bacteria.

Simple for bacteria because of the fact that DNA contains the coding region that is not interrupted.

So you can go from DNA to the protein coding capacity of that DNA very simply.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Bacterial Genome to bacterial proteome

A

Large scale ORF; Looking for open reading frames in the bacteria

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Eukaryotic genome to eularyotic proteome

A

Large scale ORF finder we can’t go directly from the genome to the proteome.

Because of splicing.

we need the transcriptome to get to the proteome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the Transcriptome?

A

All expressed RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

major problem with annotating the genome particularly determining the proteome

A

A major problem with annotating the genome particularly determining the proteome (total coding capacity of the genome), is the modification that RNA undergoes in eukaryotes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Eukaryotic mRNA

A

Extensively processed

5’ prime cap (sevenmethylguanine, sugar, 5’ to 5’ triphosphate)

AUG first ORF codon.

Presence of the poly-A-tail helps us annotate the proteome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Reverse transcriptase

A

The DNA copy is made reverse transcriptase which requires a DNA primer.

Use an oligo dT primer that hybridizes with the poly-A-tail.

Therefore the total transcriptome is not represented.

Before nanopore this was the only way to sequence rna.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Proteome comes form

A

Translation of mRNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Another processing event that causes a large amount of problems with annotation is

A

RNA polymerase transcribes DNA into the primary RNA transcript, that is not the translated transcript.

Intronic sequences removed through splicing.

Those intronic sequences are removed and we have the production of a mature messenger RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

cDNA is simply a

A

complementary copy of the mature mRNA after the intronic sequences are removed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A large amount of the genome is not expressed

A

intragenic regions which are not transcribed and intronic regions that are transcribed and spliced out.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Spliced cDNA sequence

A

A DNA sequence of the expressed sequence that sequence will align to the genomic sequence in a broken pattern.

determine what the cDNA sequence encodes to send this information to the proteome databse.

we can’t easily get this information from the genomic sequence because the sequence is encode within these three exons and is seperated from one another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Alternative Splicing

A

Genes undergo alternative splicing.

When you align different cDNA sequences to the genome you find that some genes that these alignment are quite different from one cDNA to another.

Indicating that they came from transciripts that have undergone alternative splicing

This gene produces six distinct messenger rna transcripts.

That encode three distinct polypeptides.

When you align this sequence to drosophola DNA you ifnd six different patterns of alignments due to six different splicing patterns of the mRNA transcripts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Alternative splicing benefit

A

increases the number of proteins that can be encoded by a single gene.

one gene that produces one premessenger RNA that can be spliced in multiple ways, all three exons are included in the mRNA, but one of 12 exon 4 can be added.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Alternative poly A sites

A

Different poly A sites result in Exon 1 be spliced to either exon 2 or 3.

3’ end is cleaved at alternate positions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Alternative promoters

A

Results in certain exons being included or excluded from transcription

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Exon included/excluded/mutually exclusive

A

exon can be included or excluded

you can have one exon or the other just not both

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Alternative 5’/3’ splice site

A

Earlier or later splicing at the 5’ or 3’ end.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Retained Intron

A

In some messages splicing occurs such that the intron remains in the mature mRNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Major goals of RNA seq Analysis

A

Count the relative number of transcripts in the sample.

Determine the structure of the transcripts in the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Differential Cell Expression

A

Distinctive set of mRNA expressed by a cell.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Goals of sc RNA seq

A
  1. Deterine the poly A+ transcriptome of individual cells.
  2. Useful in the study of development and human disease
  3. Determine cell type by determining the genes and the phenotype/function they perform
25
Q

sc RNA sequencing

A
  1. Suspension of cells + microparticle + lysis buffer are mixed in a microfluoridics apparatus with oil.
  2. Oil encapsulates one cell and microparticle into a droplet.
  3. Cell is lysed by lysis buffer
  4. Polyadenylated RNA is hybridized to the oligoT primer on the microparticle
  5. Barcoded primer bead: PCR handle/cell barcode/UMI
  6. Reverse transcribe using mRNA as the template.
  7. Amplify the STAMPs through PCR and then sequence the (paired end read)
  8. Align cDNA to genome to determine the genes present
  9. Group results by cell
  10. Count unique UMIs for each gene in each cell
26
Q

Changing pattern of gene expression development

A

When you start off as a single cell you have one transcriptome, but as the cells specialize during developement you start to ge epxression od different patterns of genes in each cell.

27
Q

Proteome

A

Catalogue of proteins expressed by an organism

What proteins are unique to the organism

What is the function of the protein

28
Q

What Proteins Are Unique to the Organism or shared?

A

All life is related therefore many genes are shared.

29
Q

Shared genes

A

Homologous genes

30
Q

Orthologs

A

Homologous genes between different species.

Have the same common ancestor.

31
Q

Paralogs

A

Homologous genes within the same species.

Result of a duplication of a gene.

32
Q

What is the function of the proteome?

A

if an orthologous protein is well characterized in one organism then it may be reasonable to propose that all the orthologous proteins share this function.

33
Q

Conserved domain

A

Can suggest the biochemical function of the protein in the proteome,

33
Q

Conserved domain

A

Can suggest the biochemical function of the protein in the proteome,

34
Q

Interactome

A

determine all the interactions between protein.

35
Q

Interactome

A

An interact is the result of a systematic analysis of the proteome.

36
Q

Two Major methods

A

Yeast two hybrid screen

Affinity purification and Mass spectomery

37
Q

Yeast Two Hybrid Screen

A
  • Completely genetic Method of detecting interactions passed on the transcription factor GAL4
  • GAL4 binds to a UAsgal4 and can drive gene expression of reproter gene (LacZ encodes beta-galactosidase)
  • Results in blue yeast cells
  • GAL4 is made up of two domains (DNA binding and activation domain)
  • To get expression of the lacz reporter gene the DNA binding domain and the activation domain have to be in the same molecucle. ( both are necessary)
38
Q

Yeast two hybrid screen check

A

Seperate the DNA binding domain from the DNA activation domain.

Fuse the DNA binding domain to the bait protein.

Activation domain is fused to the prey protien.

If the two proteins do not interact the reporter gene is not expressed colonies are white.

In the case of interaction we get expression for the reporter gene resulting in the blue colony.

39
Q

Affinity Purification and Mass spectometry

A

Set up strains which express one protein with an AP tag.

Purify protein attached to the tag (tag is genetically fused to the protien)

Thousands of proteins are fused to the AP tag.

Purify protein that is attached to the AP tag if it’s attahed to another portein in a protein complex then that protein will be purified along with it.

40
Q

How can genomes vary?

A

Genome Size
Genome content/genome number
Genome structure

41
Q

Genome structure variation

A

Type

Shape (can have a complex mixture of linear and circular genomes)

Number of pieces

42
Q

Chromosome number

A

Varies alot even within a species (fusion) or individuals.

43
Q

Why does genome structure matter?

A

Advantage of circular genomes; start from one point and around.

Linear genomes are hard to replicate because they have telomeres, 3’ ends are difficult we get shorter chromosomes.

44
Q

Exteneded Telomeres

A

Telomerase is an RNA template which is used to provide the sequence for eleongaring the ends, created a set of repeats.

45
Q

Another solution to shortening of DNA during replication

A

Chromosome ends are enclosed single stranded loops. Helps overcome end replication problem and protects telomere.

45
Q

Another solution to shortening of DNA during replication

A

Chromosome ends are enclosed single stranded loops. Helps overcome end replication problem and protects telomere.

46
Q

Genome Size

A

Total length of genetic instruction with a biological compartment.

47
Q

Units of Genome Size

A
Nucleotide (Single stranded)
Base (single and double stranded)
Base pair (double stranded)
48
Q

What is genome size?

A

One full haploid set, never measure duplicated information.

48
Q

What is genome size?

A

One full haploid set, never measure duplicated information.

49
Q

Complexity Bacteria/Virus/Humans

A

There are many different genome sizes, consider wherether the simplicity of viruses and complexity of hymans is reflected in genome parameters

50
Q

Genome Size and Complexity

A

Complexity is associated with either genome size or gene number.

51
Q

Genome size and the number of genes (bac and arch)

A

of genes in bacteria and archea vary depending on the genome being exmaines.

linear relationship between the number of genes and genome size

52
Q

Biggest Genomes (Eukaryotes)

A

Plants have immense variation in genome size.

So do single cell amoeba.

Genome size is not associated with complexity.

Immense amount of variation at the level of organelle and virus genomes.

Eukaryotic microbes are immensenly variable.

Genome size is not associated with complexity

53
Q

What do big genomes have that little genomes do not have (discrepancy between size and complexity)

A

Non coding DNA

Difference in gene (coding) to base ratio

As genome size increases the number of noncoding bases increase.

only conding sequence ends up in mature mRNA.

54
Q

Lungfish and parasite

A

Lungfish (130 Mbp): 99.9% non coding

parasite (2 Mbo) : >90% coding

55
Q

% Non coding

A

Varies within a genome

Chromsome 19 has 25 genes per Mbs.

Y has 2 gene per Mbs.

56
Q

Why lots or little non-coding DNA

A

Non coding DNA occasionally has a function.

Some organisms who are in a race to replication lose noncoding sequence in ording to replicate more quickly.

Mobile DNA has spread throughout the genome increases its length and the number of non-coding genes.

57
Q

Complexity is

A

Not reflected in teh number genes (some genonems might be duplications of the same gene)

No vast increase in the number of protein domains with the number of genes in the genome.