Chapter 21: Genomics, Bioinformatics, and Proteomics Flashcards

Question

proteomics

Answer 1

The study of the expressed proteins present in a cell at a given time.

Answer 2

A microarray-based method for the analysis of copy number variations in genomic DNA or in specific cell types,such as tumor cells.

Answer 3

A cloning vector in the form of a yeast artificial chromosome, constructed using chromosomal components including telomeres (from a ciliate), and centromeres, origin of replication,and marker genes from yeast. YACs are used to clone long stretches of eukaryotic DNA.

Answer 4

Used model systems Screen for natural & induced mutants Map studies Linkage analysis to map genes Required at least 1 mutant/ gene to find Studies difficult to perform Labor intensive Some mutants lethal, will not find the genes associated with those

Answer 5

Move to molecular methods (1980s), away from classical methods Genomic library clones pieced together Clones sequenced

Answer 6

In the clone-by-clone method, a genomic library is prepared, and clones are organized into genetic and physical maps by observing the inheritance pattern of genetic markers in heterozygous families. After the clones are arranged into physical maps, they are broken into smaller, overlapping clones that cover each chromosome. Each smaller clone is sequenced, and the genomic sequence is assembled by stringing together the nucleotide sequence of the clones.

Answer 7

In the shotgun method, a genomic library is constructed from fragments of genomic DNA. Clones are selected from the library at random, and sequenced. The sequence is assembled by looking for sequence overlaps between clones from different libraries. This is usually done by computer, using assembler software designed for genomic analysis.

Answer 8

National Center for Biotechnology Information Repository of Sequence/Annotation data Genome sequence databases Protein databases Bioinformatic tools, eg BLAST for sequence similarity searches

Answer 9

A field that focuses on the design and use of software and computational methods for the storage, analysis, and management of biological information such as nucleotide or amino acid sequences.

Answer 10

Analyze and store vast amounts of data Visualize data Access data Data mining Many companies have developed bioinformatic software

Answer 11

Genome sizes vary Most circular, but not all Importance of plasmids, some essential: When is a plasmid a chromosome? Gene density is high, little “wasted genome” Not all operons contain genes from same biochemical pathway, which was unexpected Overlapping genes in eubacteria, also unexpected

Answer 12

The origin and terminus of replication. The outer circle of bars represents genes transcribed in a clockwise direction, and the inner circle represents genes transcribed in a counterclockwise direction.

Answer 13

The Vibrio cholerae genome is contained in 2 chromosomes. The larger chromosome (chromosome 1) contains most of the genes for essential cellular functions and infectivity. Most of the genes on chromosome 2 (52 percent of 115) are of unknown function. The bias in gene content and the presence of plasmidlike sequences on chromosome 2 suggest that this chromosome was a megaplasmid captured by an ancestral Vibrio species.

Answer 14

This operon contains genes for protein synthesis (gatC), for DNA recombination (recA and recJ), for a motility protein (pilU), for nucleotide biosynthesis (cmk), and for lipid biosynthesis (pgsA1). This organization challenges the conventional idea that genes in an operon encode products that control a common biochemical pathways.

Answer 15

Organisms typically extremophiles Structurally similar to eubacteria, metabolically more similar to eukaryotes Have histone chromosomal proteins, chromosomes may be organized into chromatin Introns in tRNA genes

Answer 16

A mosaic of organization patterns Usually linear chromosomes Mitochondrial genomes Chloroplast genomes in plants Genome size highly variable Low gene density Introns present Repetitive elements (up to 80% of genome in maize!)

Answer 17

Very different from other eukaryotic genomes ~25% of genes in polycistronic “operons” Large number of introns Genes within introns

Answer 18

In C. elegans, however, about | 25% of all genes are part of operons.

Answer 19

A genetic unit consisting of one or more structural genes encoding polypeptides, and an adjacent operator gene that regulates the transcriptional activity of the structural gene or genes.

Answer 20

Small genome Many gene duplications Used as a model organism Higher plants have larger genomes but approx. the same amount of genes

Answer 21

In large-genome plants, genes are located in clusters, separated by long, gene-empty spaces of repetitive DNA sequences. Within the gene clusters, the intergenic spaces contain many transposons. In the Arabidopsis genome, gene-empty regions have been lost, and transposable elements have been lost or reduced. The result is a much smaller genome with genes at a much higher density throughout the genome.

Answer 22

Only ~20000 - 23000 coding genes (5% of genome) 50% of genome repeat elements Gene clusters & gene deserts Wide range of intron numbers Bacterial derived genes in human genome Human genes tend to be large & contain multiple introns Gene distribution not even on chromosomes Duplicated regions found on some chromosomes Function of ~2/3 of genes determined

Answer 23

These are based on similarity to proteins of known function. Among the most common genes are those involved in nucleic acid metabolism (7.5% of all genes identified), receptors (5%), protein kinases (2.8%) and cytoskeletal structural proteins (2.8%) A total of 12,809 predicted proteins (41%) have unknown functions, reflecting the work needed to fully decipher our genome.

Answer 24

The regions already sequenced are shown in red adjacent to each chromosome. Some of the disease genes identified on each chromosome are shown below the chromosome.

Answer 25

- Human 21 and chimp 22 share 179 genes with coding sequences of identical length - these shared genes are 99.29% similar at nucleotide level, 99.18% at amino acid level

Answer 26

There is much to learn as we sequence multiple genomes and move away from heavily studied “MODEL” organisms. Results may challenge our models of generalization of gene organization, mechanisms, etc.

Answer 27

Compare genomes to gain insight to genome evolution Bacteria: 3.5 billion years ago Eukaryotes: 1.4 billion years ago

Answer 28

In E. coli, gene density is high, and there are very few repetitive sequences. In eukaryotes (b-d), gene density is lower, and portions of the genome are occupied by repetitive DNA sequences. (b) A-50 kb region from chromosome III of yeast contains over 20 genes and little repetitive DNA. (c) A 50-kb region from human chromosome 11 contains 6 genes and stretches of repetitive DNA. (d) 50 kb of the maize genome surrounding the Adh locus. This gene is surrounded by long stretches of repetitive DNA.

Answer 29

Important in origin & evolution of eukaryotic genomes Increases genetic diversity Results in multigene families - arise by unequal crossover - arise by replication errors

Answer 30

Ancestral duplication of an oxygen transport gene ~800mya---> 2 sister genes 1 became modern day myoglobin (muscle oxygen carrier) Second became the ancestral globin gene This second gene duplicated again ~500mya The second duplication resulted in the alpha and beta globin gene families Further duplication of alpha globin results in 3 alpha genes on human chromosome 16 Further duplication of beta globin results in 5 beta globin genes on chromosome 11

Answer 31

About 700–800 million years ago (mya), a duplication event in an ancestral gene gave rise to two lineages. One led to the myoglobin gene, which in humans is located on chromosome 22. The other lineage underwent a second duplication event about 500 mya, giving rise to the ancestors of the alpha and beta subfamilies. Duplications about 200 mya produced the alpha and beta globin subfamilies. In humans, the alpha-globin genes are located on chromosome 16 and the beta-globin genes are on chromosome 11.

Answer 32

Zeta expressed only in embryo Alpha1 expressed in fetus Alpha2 expressed in adults

Answer 33

3 genes expressed prior to birth Delta and beta globin expressed after birth

Answer 34

~20,000 genes in human genome How do we obtain incredible diversity of immune system genes? Immunoglobulin gene subunit diversity Differential splicing to increase diversity Break & nibble mechanism to increase diversity Pages 616-618 in text Result: Incredible amount of diversity of immune system genes to produce antibodies for antigens

Answer 35

Antibodies (immunoglobulins) IgM, IgD, IgG, IgA, IgE found on plasma B cells Proteins produced by vertebrates as a defense against infection. Millions of different forms, each with a different binding site that specifically recognizes another molecule (antigen)

Answer 36

A molecule, often a cell-surface protein, that is capable of eliciting the formation of antibodies.

Answer 37

The molecule is Y shaped and contains 4 polypeptide chains The longer arms are H chains and the shorter arms are L chains. The chains are joined by disulfide bonds. Each chain contains a variable region and a constant region. The variable and hypervariable regions of a pair of L and H chains form a combining site that interacts with a specific antigen. Different combinations of chains create different types of Ig classes eg., IgE (kappa2, epsilon2 or lambda2 epsilon2)

Answer 38

Involved in fighting parasitic infections Involved in allergic responses Tetramer: [kappa2 epsilon2] or [lambda2 epsilon2]

Answer 39

Somatic Recombination occurs in maturing B cells Each mature B cell makes ONE type of light chain (kappa or lambda) and ONE type of heavy chain An antigen stimulates a particular B cell with an antibody for that antigen Produces population of plasma cells with antibody for that particular antigen

Answer 40

Lymphomas-different types depending on stage of B-cell that cancer develops

Answer 41

One set of L-V regions joined to one of the joining regions during B cell maturation Joining event is imprecise, happens over a six base region, also bases are added or removed at recombination region (“break & nibble”) In germ-line DNA, 70–100 different L-V (leader-variable) segments are present. These are separated from the J regions by a long-noncoding sequence. The J regions are separated from a single C segment by an intron that must be spliced out of the initial mRNA transcript. Following translation, the amino acid sequence derived from the leader RNA is cleaved off as the mature polypeptide chain passes across the cell membrane.

Answer 42

One set of L-V regions joined to one of the joining regions during B cell maturation Joining event is imprecise, happens over a 6 base region, also bases are added or removed at recombination region (“break & nibble”) Transcription removes other “J” region, links to constant (C) region Finally, splicing removes intervening regions In germ-line DNA, 70–100 different L-V (leader-variable) segments are present. These are separated from the J regions by a long-noncoding sequence. The J regions are separated from a single C segment by an intron that must be spliced out of the initial mRNA transcript. Following translation, the amino acid sequence derived from the leader RNA is cleaved off as the mature polypeptide chain passes across the cell membrane.

Answer 43

Study of gene products When & where produced Post translational modification Cellular localization Proteome: complete set of proteins expressed during a cell’s lifetime

Answer 44

Description of protein-protein interactions within an organism Aids in understanding pathways & interactions May provide insight for therapeutic interruption of pathway in disease treatment

Answer 45

Interactome: protein interactions with each other Kinome: interaction of kinase proteins, important in cancer research Ionome: study of ions in organisms, eg., Fe, Mn, Mg, K, Cu, Ca, Ni, S etc.

Answer 46

Mice created by a process in which a normal gene is cloned, inactivated by the insertion of a marker (such as an antibiotic resistance gene), and transferred to embryonic stem cells, where the altered gene will replace the normal gene (in some cells). These cells are injected into a blastomere embryo, producing a mouse that is then bred to yield mice homozygous for the mutated gene.

Answer 47

The introduction of a null mutation into a gene that is subsequently introduced into an organism using transgenic techniques,whereby the organism loses the function of the gene. Often used in mice.

Answer 48

Eg., Nuclear pore complex Example of genomics combined with proteomics to determine structure and function Genomic information determined genes involved in pore complex Proteomics to determine molecular architecture of pore complex Next: study protein-protein interactions (interactome) to determine what interacts with Nuclear Pore Complex

Answer 49

Therefore, the chromosome must be broken into fragments before any sequencing can take place.

Answer 50

A plasmid is a small, circular DNA molecule found in bacteria in addition to the bacterial chromosome. Each time a bacterium reproduces, it replicates each of its plasmids. To clone DNA using plasmids, molecular biologists insert DNA fragments into plasmids and then introduce the plasmids into bacteria. Because bacteria reproduce so rapidly, they can make more than a million copies of a DNA fragment in less than 24 hours.

Answer 51

Overlap enables the computer to match up the fragments and determine how they fit together.

Answer 52

Genome sequences are assembled by lining up overlaps in sequence between fragments. The resulting sequence is said to be contiguous, and the assembled fragments are called a contig.

Answer 53

A genomic library is a collection of large DNA fragments from one genome cloned into vectors, such as bacterial artificial chromosomes (BACs).

Answer 54

Genomic libraries contain fragments too large for one sequencing reaction. Therefore, before they can be sequenced, they must be cut into smaller fragments and recloned, in a process called subcloning.

Answer 55

Once an entire genome sequence has been determined, computer algorithms can be used to analyze it for important sequences, such as open reading frames, introns, and regulatory sequences. This process is called genome annotation.

Answer 56

A recombination map is made by determining how often chromosomal locations, or loci, recombine into new combinations during meiosis. Although recombination frequency depends on the number of nucleotides separating two linked loci, different parts of the genome have different rates of recombination. This means that a recombination map is only an estimate of the true (physical) map. Despite this caveat, recombination maps are useful for ordering large clones into a physical map if loci on the clones can be identified and lined up with a known recombination map.

Answer 57

A nonsense mutation (a change from an amino acid codon to a stop codon) after the antibody binding site A nonsense mutation (a change from an amino acid codon to a stop codon) before the antibody binding site An altered amino acid codon in a protease site that prevents cleavage of the precursor protein

Answer 58

A gene encoded in genomic DNA is expressed when RNA polymerase transcribes it into a primary RNA transcript. Transcription begins at a site on the DNA called the start of transcription and requires specific DNA sequences upstream of that site. These sequences recruit and stabilize the transcription complex, which includes RNA polymerase III.

Answer 59

Many eukaryotic genes include a TATA box (a sequence of nucleotides with the consensus sequence TATAAA) as part of the sequences that recruit the transcription complex.

Answer 60

In the nucleus, introns are spliced out of the primary RNA transcript, and exons are joined together into a mature messenger RNA (mRNA) molecule.

Answer 61

The mRNA, once exported from the nucleus, will dock on a ribosome and be translated into an amino acid sequence beginning with the start of translation (a methionine codon, AUG).

Answer 62

Because multiple AUG sequences may be present, the correct reading frame is favored by the interaction of sequences in the 5′ untranslated region with the ribosome.

Answer 63

Aligning an mRNA sequence to a genomic sequence reveals exon/intron boundaries. Aligning an mRNA sequence to a genomic sequence indicates the start of transcription of a gene.

Answer 64

The beginning of the mRNA corresponds to the first transcribed nucleotide, or start of transcription. A TATA box, if present, is typically 24-25 nucleotides upstream of the start of transcription. Also, mRNA has had introns removed, and comparing it to a genomic sequence reveals where exon/intron boundaries are.

Answer 65

cleaving genomic DNA with restriction enzymes, separating the resulting fragments on an electrophoresis gel, transferring the separated fragments to a membrane (blotting), exposing the membrane to a labeled nucleotide probe.

Answer 66

The type III allele promotes insulin gene expression in the thymus, increasing the probability that insulin-reactive T cells generated there will bind with self cells displaying insulin before those T cells are released. Lower insulin gene expression in the thymus (such as that seen with the type I allele) increases the probability that self-reactive T cells will escape the thymus and be carried by the blood to the pancreas, where they can target insulin-producing islet cells. The destruction of these pancreatic islet cells leads to the loss of the ability to produce insulin (type 1 diabetes).

Answer 67

the study of the entire genomes of organisms.

Chapter 21: Genomics, Bioinformatics, and Proteomics Flashcards

(93 cards)