PART II: GENES & GENOMES: SEQUENCE VARIATION AND ANALYSIS Flashcards

1
Q

What are examples of regulatory sequences like promoters and RBS respectively?

A
  • TTGACA-17bp-TATAAT

- GGAGG near the start codon of a gene encoding a protein

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What two things does gene annotation encompass?

A
  1. Predicting and MARKING the position of genes and OTHER elements on a genome sequence
  2. Predicting protein function
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is involved in 1. Predicting and MARKING the position of genes and OTHER elements on a genome sequence (in particular with RNA features and protein coding genes) ?

A
  • RNA features: Predict function and location SIMULTANEOUSLY
  • Protein coding genes: Predict the location of genes on genome (Gene Finders)
  • Translate the encoded protein and predict function
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is involved in 2. Predicting protein function?

A
  • Similarity to the characterised proteins

- “Hypothetical Proteins” (not similar to any characterized protein)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are features of protein coding genes in prokaryotes?

A
  • Contained in an ORF (Sequence b/w INITIATION and STOP codons, >50bp are ideal)
  • Initiation codon, Ribosome binding site, Minimum length
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Can prokaryotes contain introns?

A

-Yes but they are rare and can be polyscistronic (one gene: many proteins)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is usually the initiation codon in prokaryotes?

A
  • ATG (90%) –> Met
  • GTG (8%) –> Val
  • TTG (1%) –> Leu
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why aren’t ALL ORFS marked as genes?

A
  • Because if there are multiple potential ORFs with lots of stop codons (top) BUT there is a clear gene ‘overlap’ where a region ACTUALLY CODES for a gene, that will be the coding one
  • Gene overlap is VERY RARE so generally only the largest one will be coding ORF marked as a gene
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Is gene overlap for coding genes common?

A
  • NO it is RARE
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are Gene Finders?

A
  • Programs such as GeneMarkS, GLIMMER, Prodigal (find all ORFs in gene)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do Gene Finders do?

A
  • Identify a potential START codon (ATG but also GTG and TTG)
  • They check for CONTEXT with the RIBOSOME BINDING SITE
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How can we predict protein function? (2 tools)

A
  1. Databases–> For protein sequences and function (GenBank)
  2. Using sequence SIMILARITY to predict function–> Proteins with almost same sequence are likely to have the same/similar function
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is BLASTp used for?

A
  • Protein query versus protein database
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is BLASTx used for?

A
  • Nucleotide query versus protein database (translated into 6 peptide sequences for each of the 6 reading frames)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the ‘Expect’ section in BLAST mean?

A

-That the likelihood of the match happening by chance is very close to zero (0.1 cut off generally)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What do the identities and positives mean?

A

-23% identical residues and 43% similar residues respectively

17
Q

What does it mean if there is a less than 10% identity between query sequence and database?

A
  • Similarity occurs by CHANCE (NOT RELATED)
18
Q

What does it mean if there is a 10-35% identity

A
  • MIGHT have related function but still low..
19
Q

What does it mean if there is a less than >35% identity between query sequence and database?

A
  • Probably have a related function
20
Q

Can the difference of just ONE amino acid change, cause a strain of P.multocida to be virulent?

A
  • YES!
  • Change from Leucine to Serine
  • changes Fis gene–> makes protein capsule –> contributes to virulence
  • Can prove it by complementation (putting the virulent strain into the avirulent strain and observing restoration of phenotype (capsule)