Lecture 33 Flashcards

1
Q

What is genomics?

A

The study of entire genomes, including the areas of:
Organisation and structure of genomes
Comparisons of genomes among organisms
interactions among genes within a genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is bioinformatics?

A

The management, organisation and analysis of large volumes of biological data to answer questions never anticipated by the investigator or data collector

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the key differences between gene sequencing and genome sequencing?

A

Gene sequencing uses lab bench biology with PCR while genome sequencing uses automation and robots to give high throughput with computational processing of short sequence reads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the features of whole genome shotgun sequencing?

A

The sequencing is done first, mapping is done after this using the short sequences obtained

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the process of genome shotgun sequencing?

A

Break the genome into random locations
Clone the randomly selected fragments
Sequence the clones (using primers that give overlapping sequence reads)
assemble these into contigs
Assemble contigs into scaffolds and use these to estimate the complete sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a contig?

A

A segment of sequence obtained by overlapping two existing known segments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is ordered clone sequencing done?

A

Mapping is done first then the sequence is obtained
Library is searched for sequenced clones with similarities in restriction enzyme recognition sequences
Overlaps are arranged to form a physical map
the minimal number of overlapping clones required to give the minimal tiling pathway is found
Those clones are then sequenced to allow assembly of the genome sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is massively parallel sequencing?

A

New sequencing technologies that produce gigabases of data in very short times
These include the screening of genomes from several individuals, characterising the transcribed RNA in a cell and surveying the microorganisms in an environmental sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What occurs on the computational side of pyrosequencing?

A

Converting the light sequences to DNA sequences is a major computational task
There is a relatively high error rate so it is critical to filter data and correct errors.
This is performed by a computer where it deletes unusually long or short reads
The largest known sequence is then aligned to a known genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the four methods of finding genes?

A

Open reading frames, Statistical predictions, Expressed genes and codon usage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do open reading frames help us find genes?

A

An open reading frame is the genomic region from an initiation point to the stop codon in prokaryotes (This method cannot be used in eukaryotes due to introns)
As the start codon may also code for an internal methionine the longest distance must be found
Important considerations in this method is that both strands need to be considered and each strand has 3 possible reading frames so there are 6 total possibilities
Open reading frames may overlap due to chance occurrence of initiation and STOP codons in other reading frames

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How can statistical predictions be used to find genes?

A

Building a statistical model to describe gene features such as the location of exon/intron splice sites
This can then be applied to an unknown sequence to predict the locations of the feature. The model will give the probability or likelihood that the feature is at any particular location in the sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How can expressed genes be used to help find genes?

A

Exons can be identified through cDNA which when aligned with genomic DNA will reveal exons
Or expressed sequence tags which are fragments of mRNA in a cell type or tissue representing a part of the transcribed genome which will reveal the location of exons through alignment with genomic DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can codon usage help find genes?

A

As there is degeneracy in the genetic code, not all codons for the same amino acid are used with equal frequency within genes
In this way a different frequency of the three base strings is expect in exons than seen in different reading frames or non-transcribed regions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How can gene function be predicted?

A

As there is shared ancestry among organisms we can use sequence comparisons to find shared genes, shared gene products, and shared gene functions
We can then make an inference of occurrence, location, function of a gene through comparison

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is BLAST?

A

A tool which will search public data bases of sequences for sequences similar to the query or unknown sequence
Proteins can maintain a simialr structure with as little as 35% of sites having identical amino acids allowing this tool to be a powerful predictor of homologs and function