Chapter 12 Flashcards

1
Q

What is a genome

A

The entire complement of genetic information. Includes genes, regulatory sequences, and noncoding DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is genomics

A

Discipline of mapping, sequencing, analyzing, and comparing genomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is shotgun sequencing

A

A technique used by virtually all genomic sequencing projects. Random fragments of entire genome are cloned or amplified and sequenced to produce millions of sequence “reads”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can you sequence with shotgun sequencing

A

The organisms genome is broken up into random fragments, there will be so many fragments that every part of the genome will be sequenced. Computer algorithms assemble the overlapping reads into contiguous regions in order to produce the entire genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some difficulties with shotgun sequencing

A

Assembly of complete circular chromosome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When sequencing a circular chromosome how are gaps between contigs filled

A

They can be filled using PCR or cloning techniques or by sequencing methods that will produce very long reads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a closed genome

A

A genome that is complete

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a draft genome

A

A fast and cheap method of sequencing but will contain gaps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is annotation

A

Converting raw sequence data into a list of genes present in the genome. The majority of genes encode proteins

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a functional ORF

A

An open reading frame that encodes a protein. The ORFs are bounded by start/stop codons, Shine-Dalgarno sequences upstream of ATG start site.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How are ORFs used in genomics

A

You can compare new ORFs that have been identified by a computer to those previously found in other genomes, finding homologs suggests the gene is real

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is bioinformatics

A

Science that applies computational tools to DNA and protein sequences. Analyze, annotate, and compare homologs. And identify genes for metabolic pathways by comparison to those well-studied model organisms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is comparative analysis and what is it used for

A

Identifying genes by comparing similarities between sequences with previously identified genes in other organisms. Allows for predictions of metabolic pathways and transport systems. Can be used to determine how to grown an organism in a lab

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are homologous genes

A

Genes that may have a common evolutionary ancestor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are hypothetical genes and proteins

A

Only about 70% or less of total ORFs are detected and these are the uncharacterized ORFs. They encode for proteins that likely exist but whose function is unknown (nonessential genes). All essential genes have been identified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describe a metagenome

A

The total gene content of all organisms present in an environment. Allow a variety of bioinformatic analyses to identify organisms, types of metabolisms, etc. Can be obtained by shotgun sequencing of total DNA from an environmental sample. Goal is to determine DNA sequences of organisms in a population and identify all of them. Sequencing the rRNA is primarily used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Describe a transcriptome

A

Provides data on expressed genes by measuring RNA. The entire complement of RNA produced under a given set of growth conditions (temperature, nutrients, other organisms)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are microarrays

A

Hybridization technique based on genomic sequence data for target organism. Thousands of gene probes are arrayed in a known pattern on solid-state chips or glass slides

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is RNA-seq

A

Direct shotgun sequencing of total RNA samples

20
Q

What can be learned from transcriptomes

A

Gene expression patterns for every gene in genome under different conditions and in different mutants. And expression of specific groups of genes under different conditions can yield clues to possible roles of unknown genes.

21
Q

What is proteomics

A

The study of all proteins present in cells/tissues under different conditions (proteins produce phenotypes). It combines genomic info with quantitative data on protein amounts

22
Q

What are the techniques used in proteomics to obtain data of all the proteins present

A

2D gel electrophoresis, liquid chromatography, and mass spectrometry

23
Q

Describe the 2D gel electrophoresis

A

On the horizontal acis they have been separated by their isoelectric point (the point where they have no charge) and on the vertical axis they have been separated by size.

24
Q

Describe mass spectrometry

A

Can be used to cut out individual spots and determine what protein is in there

25
Q

What are proteins with >50% sequence similarity

A

Typically have similar functions, and are probably homologs

26
Q

What are proteins with >70% sequence similarity

A

Almost certainly have similar functions and are LIKELY homologs

27
Q

What are protein domains

A

Distinct structural regions or functional modules within proteins (transcription repressor protein would have a DNA-binding domain and a co-repressor binding domain). Have characteristic functions that can reveal a protein’s role/function, even in the absence of complete sequence homology

28
Q

What is an interactome

A

Complete set of interactions among proteins. Proteins that physically interact are likely to be involved in the same process or pathway. Is the combination of genome info and experimental data on protein-protein interactions.

29
Q

What is a metabolome

A

The complete set of metabolic substrates, products, and intermediates, and other small molecules in an organism

30
Q

What is used to identify and measure metabolites

A

Mass spectrometry

31
Q

What are homologous chromosomes

A

They are related in sequence to an extent that implies common genetic ancestry

32
Q

What are gene families

A

Groups of gene homologs. The whole gene or part of a gene is homologous and related to other genes. They can all transport molecules but they use different molecules for transporting.

33
Q

What are paralogs

A

Homologous genes within an organism, often the result of gene duplication. Ex. Cyanobacteria contain multiple psbA gnes: psbAI, psbAII, psbAIII. Often they all have the same function in a single species of bacteria but one gene may function better in the light while another gene may function better in the darkness

34
Q

What are orthologs

A

Homologous genes found in different organisms. They are found in different organisms but will have the same function

35
Q

What are gene duplications

A

They are the mechanism for the evolution of new genes.

36
Q

Describe the mechanism of gene duplication in evolution

A

Two gene copies can evolve independently: One copy keeps the original function and the other copy can evolve to acquire a new function.

37
Q

What is deletion

A

Deletions can eliminate genes no longer needed, genes that do not provide a selective advantage

38
Q

Where are housekeeping genes found and what does that tell us

A

Many housekeeping genes are found in all three domains of life showing that all organisms have common evolutionary roots (these housekeeping genes were originally found in LUCA)

39
Q

What is horizontal gene transfer

A

The transfer of genetic information between organisms, as opposed to vertical inheritance (from parental organisms). Can also occur between different species and different genera

40
Q

What does crossing phylogenetic boundaries mean

A

Bacteria can pick up genetic material from something that is very different as long as the codon usage is not so different that the gene is not expressed

41
Q

How do you detect horizontal gene flow

A

Presence of genes typically found only in distantly related species (finding a gene normally in G+ bacteria in just one strain of a G- bacteria). Presence of a DNA region with GC content or codon bias that differes significantly from remainder of genome. or presence of DNA sequence signatures that indicate mobile DNA element such as a transposon or virus

42
Q

What are transposons and insertion sequences

A

Transposons can trasfer DNA between different organisms. Transposons and insertion sequences (IS) may also mediate large-scale chromosomal changes within a single organism. Multiple ISs are often found in a genome. Recombination between related IS elements can result in large-scale chromosomal rearrangements

43
Q

What are chromosomal islands

A

Region of bacterial chromosome of foreign origin (HGT) that contains clustered genes. They usually contribute specialized functions not essential for growth

44
Q

What are pathogenicity islands

A

Chromosomal islands containing genes for virulence

45
Q

What are some specialized functions that are not essential for growth that chromosomal islands are responsible for

A

Virulence, biodegradation of unusual compounds and symbiosis