Lecture 16 Genome Content Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Structural genomics:

A

organization and sequence of genetic information contained within a genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a genetic map?

A

locates genes or markers based on genetic recombination frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a physical map?

A

locates genes or markers based on the physical length of DNA sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the limitations to genetic mapping?

A

Recombination frequencies vary from one region of the chromosome to another so it is approximate.
Also has lower resolution bc recombination is difficult to observe between loci that are very close to each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What map has greater accuracy and resolution?

A

Physical maps based on DNA sequences or restriction maps, accurate down to a single base pair of DNA sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What was the 1st step in mapping the human genome?

A

detailed physical and genetic maps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define contig

A

A set of overlapping fragments that form a continuous stretch of DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are some of the ethical concerns arising out of the information produced by the Human Genome Project?

A

Privacy of a person’s genetic info and how such info will be used
Info used to discriminate against individuals with polymorphisms
Denied health insurace, limit opportunity for career etc
Genetic Information Nondiscrimination Act

“designer” babies, select embryos with desirable traits

Given that genes interact with each other and environment, and complex interaction lead to incomplete penetrance and expressivity, will ppl receive adequate counseling to properly interpret their genetic profiles?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a single-nucleotide polymorphism?

A

SNPs are single base-pair differences in the sequence of a particular region of DNA from one
individual compared to another of the same species or population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How are SNPs used in genomic studies?

A

SNPs are useful as molecular
markers for mapping and pedigree analysis and may themselves be associated with phenotypic
differences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a haplotype?

A

A haplotype is a particular set of neighboring SNPs or other DNA polymorphisms observed on a single chromosome or chromosome region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do different haplotypes arise?

A

They tend to be inherited together as a set because of
linkage. Meiotic recombination within the chromosomal region can split the haplotype and create new recombinant haplotypes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is linkage disequilibrium?

A

Linkage disequilibrium is the association of certain genetic variants (alleles) with each other via linkage on the same chromosome, so that these allelic combinations occur more frequently in the population than expected based on random assortment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does it result in haplotypes?

A

The particular combinations of alleles at different loci within a region of the chromosome are called haplotypes. Distinct haplotypes occur because modern human populations have not undergone enough recombination to thoroughly shuffle the allelic combinations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is copy-number variations?

A

The number of copies of DNA sequences varies from person to person
Copy number variation is the occurrence of more or less than the usual diploid copy number for a DNA sequence of longer than 1000 bp.
Most contain multiple genes and potentially affect the phenotype
ex. autism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do copy-number variations arise?

A

Such copy number variations could arise through deletion or

duplication of a segment of chromosomal DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are expressed-sequence tags?

A

ESTs
Markers associated with DNA sequences that are expressed as RNA
(reverse transcription to produce a set of cDNA fragments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Bioinformatics =

A

molecular biology + computer science

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Ab initio approach:

A

Scans sequence looking for features usually found within a gene

20
Q

Comparative approach:

A

Looks for similarity between a new sequence and sequences of known genes

21
Q

The ab initio approach finds genes by looking for:

A

common sequences found in most genes

22
Q

Computer algorithms (and humans) recognize or predict genes within genomic sequences by analysis of the following:

A
  1. the presence of matching cDNA or EST sequences in a database
  2. Open reading frames
  3. Sequences characteristic of promoters upstream of a start codon of an open reading frame
  4. Homology (sequence similarity) to previously characterized genes in the same or other species
23
Q

Metagenomics

A

Sequencing genomes of entire communities of organisms

24
Q

Synthetic biology

A

the creation from scratch novel organisms

25
Q

Functional genomics

A

characterizes what the sequences do - the Encode project

26
Q

Transcriptome:

A

all the RNA molecules transcribe from a genome

27
Q

Proteome:

A

All the proteins encoded by the genome

28
Q

Homologous sequences:

A

Genes that are evolutionarily related, derived from a common ancestor

29
Q

Orthologs:

A

Homologous genes in different species that evolved from the same gene in a common ancestor

30
Q

Paralogs:

A

Homologous genes arising by duplication of a single gene in the same organism

31
Q

What is the difference between orthologs and paralogs?

A

Orthologs are in different organisms; paralongs are in the same organism

32
Q

Describe how homology and sequence similarity is used to infer the function of a gene

A

For protein-coding genes, the DNA sequence is translated
conceptually into the amino acid sequence of the protein. The amino acid sequence of the protein then may yield clues to its function if it is similar to another protein of known function. For example, it is quite easy to recognize histones because their amino acid sequences are highly conserved among
eukaryotes. Even if the whole protein is not similar, it may have regions, or domains, that are similar to other domains with known functions or properties. Finally, the amino acid sequence may contain
small motifs or signatures that are characteristic of proteins with certain enzymatic activities or properties or subcellular localizations.

33
Q

Describe how phylogentic profile is used to infer the function of a gene

A

The coordinated absence or presence of clusters of genes in various species
implies that the genes in the cluster have related functions. For example, genes required for nitrogen fixation would all be present in nitrogen-fixing species but absent in other species.

34
Q

Describe how gene fusions are used to infer the function of a gene

A

In some species, genes of related function have undergone a fusion event to form a single multifunctional polyprotein. Then similar but separate component genes in other species can be presumed to have similar functions.

35
Q

Describe how gene clusters or operons are used to infer the function of a gene

A

In bacteria, genes in metabolic or functional pathways are often clustered together into an operon. Therefore, all the genes that are co-transcribed into a single polycistronic mRNA should have related functions.

36
Q

What is microarray?

A

A microarray consists of thousands of DNA fragments spotted onto glass slides in an ordered grid
(gene chips) or even proteins or peptides arrayed onto glass slides (protein chips). The identity of the
DNA or peptide at each location is known.

37
Q

How is microarray used to obtain info about gene function?

A

Gene chips are typically used in hybridization
experiments with labeled mRNAs or cDNAs to survey the levels of transcript accumulation for
thousands of genes, or even whole genomes, at one time. Peptide or protein chips can be used to identify protein-protein interactions or enzymatic activities or other properties of proteins.

38
Q

Reporter sequences:

A

Encoding an easily observed product used to track the expression of a gene of interest

Used with colored product, or antibody, or the reporter itself may be fluorescent

39
Q

How is mutagenesis screen carried out?

A

After random mutagenesis with chemicals or transposons, the mutant progeny are screened for phenotypes of interest. The mutant gene can be identified by cosegregation with molecular markers
or by sequencing the position of transposon insertion. To verify that the mutation identified is truly responsible for the phenotype, the same mutation can be introduced into a wild-type copy of the gene
via site-directed mutagenesis and the phenotype observed in the offspring.

40
Q

DNA content varies considerably among different multicellular organisms. Is this variation
closely related to the number of genes and the complexity of the organism? If not, what accounts for
some of these differences?

A

The “complexity” of an organism is not well-defined and thus difficult to quantify. DNA content can vary widely among related species, so there appears to be little relation between the “complexity” of an organism, the number of genes, and the DNA content. Large differences in DNA content may arise from differences in the frequency and size of introns, the abundance of DNA derived from transposable elements, and duplication of the whole or substantial parts of the genome in the evolutionary history of the species.

41
Q

Colinearity:

A

Presence of many genes in the same order in related genomes due to evolution for an common ancestral genome

42
Q

Segmental duplication:

A

A duplication involving a relatively large (greater than 1000 bp) stretch of chromosomal DNA
sequence.

43
Q

The human genome does not encode substantially more protein domains than do invertebrate
genomes, yet it encodes many more proteins. How are more proteins encoded when the number of
domains does not differ substantially?

A

The human genome contains proteins with many more combinations of domains, often featuring
multiple domains on a single protein.

44
Q

What is comparative genomics?

A

Comparative genomics analyzes the similarities and differences among genomes to achieve insights into genomic or organismal evolution or relationships between groups of genes and physiological
functions.

45
Q

How does proteomics differ from genomics?

A

Whereas genomics is the analysis of whole genome DNA sequences and their organization,
expression, and function, proteomics focuses on the complete set of proteins made by an organism

proteome varies within a species from cell
type to cell type, with stage of development, and with time in response to signals and other environmental stimuli. Proteins also undergo numerous modifications that affect protein localization
and function.