Week 2 Flashcards

1
Q

_______ is the science of genes, heredity, and the variation of organisms.

A

Genetics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

_______________ attempts the study of large scale genetic patterns across the genome for a given species. It deals with the systemic use of genome information to provide fide answers in biology, medicine, and industry

A

Genomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

___________ estimated cost 3 billion dollars and it took 15 years. It began in 1988 and the first draft was announced in 2000 with the more complete eversion released in 2003

A

Human genome project

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

____________

is the reeecniquea used for the human genome project

A

Shotgun sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the first step of shotgun sequencing?

  • break the genomic DNA into pieces
  • cloning into plasmid vectors
  • collection of clones=library
  • transformation to E coli
A

Library generation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Question? Why do we need to introduce ecoli into the library regeneration

A

To replicate or amplify the library i.e. asexual reproduction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the second step of shotgun sequencing?

A

Template preparation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why do we use antibiotic for select plasmids during template preparation

A

it makes the cell it is injected with resistant to antibiotics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the third and final step of shotgun sequencing?

A

DNA sequencing

*chain termination DNA sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The DNA sequencing in shot gun sequencing is similar to _________________

A

PCR reaction

The reaction mix requires: template DNA, Taq polymerase, dNTPs, ddNTPs, and a primer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How hod ddNTPs work as terminators of DNA sequencing?

A

Because they lack hydroxyl groups, it is missing the 3’ hydroxyl group. Without this group, they can not be extended.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

___________: containing all four of the standard dNTPs and the DNA polymerase template DNA and primer

A

sequencing reactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

During DNA replication there needs to be a higher concentration of dNTPS as apposed to ddNTPS why?

A

You dont want your sequence to terminate prematurely. Higher levels of dnTPS allow for more extension of DNA molecules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the two conditions that must be satisfied to have DNA products of all sizes

A
  • more dNTPS than ddNTPS

- adequate materials (primers, polymerases)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

_________ is used on DNA fragments to determine size.

A

Gel electrophoresis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Traditionally the mix of sequencing reactions (DNAs) is run on a denaturing ___________ which can resolve bands differing in size by _________

A

page gel

1 nucleotide

*based on their size they will be in different locations of the Gel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is is called if you did not cover all the nucleotides? What is this called? How is it fixed?

A

This is called a sequencing error. This is fixed using an arbitrary value/ variable such was N. Then we use statistical modeling to find N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the fourth step of shotgun sequencing?
-needs overlapping sequence,
requires 7-9x coverage of the human genome (3 billion bases -> needed 21-27 billion bass to provide adequate fragment overlap)

A

Assembling fragments

19
Q

What is the fifth step shotgun sequencing?
Generation of open reading Frame -> homology searches
-> putative ID frameshift detection -> function assignment metabolic pathways gene families -> DNA molds regulatory elements repetitive elements -> comparative egenoimisc

A

Annotation

*comparative genomics

20
Q

_____________ shows a table of different gnome and how they differ between species

A

genome size comparison

21
Q

___________ the era of science that we are currently in is large scale sequencing which has become quite routine. The next challenge in genomics in deciphering the sequencing data

A

post sequencing era

22
Q

Different types of gene finding

A
  • RNA genes: tRNA, rRNA, snoRNA, snRNA, microRNA

- protein coding genes:

23
Q

__________:

contains no introns, simpler regulatory features

A

prokaryotic protein coding genes

24
Q

__________:

  • exon intron structure
  • complex regulatory features
A

eukaryotic

25
Finding Eukaryotic genes computationally:
- content based methods - feature based methods - comparative methods - combined methods
26
_____________: GD content, hexameter repeats, composition statistics, codon frequencies
content based methods
27
__________: donor sites, acceptor sites, promotor sites, start/stop codons, polyA signals, lengths
Feature based methods
28
____________ is using sequencing homology and set searches
comparative methods
29
content based methods calculate the statistical probability of a region being _____________________________ based on genomic sequences
protein coding
30
________________ cytosine nucleotide followed by a guanine nucleotide
CpG sites
31
At a dinucleotide site C and G are
next to each other not base pairs
32
the p in CpG notation refers to ________________
the phosphodiester bond between C and G nucleotides.
33
_____________ are regions of the genome with a higher frequency of CG dinucleotides (not base pairs) than the rest of the genome
CpG islands
34
there is a higher GC content in the ____________
coding regions
35
___________ is the percentage of G's and C's in a DNA molecule
GC content
36
__________ unequal usage of synonymous codons in the coding regions is a universal feature of the genomes
codon bias
37
___________ are linked to a taxonomy tree to allow comparative analysis of the codon usage frequencies
codon usage trees
38
the ________ gives the relative frequency of usage per thousand
the 4th column
39
_________ vary in length in species
intron and exon lengths
40
As GC content increase ________________ remains constant while __________________ decreases sharply
intron length remains constant exon length sharply decreases
41
________ can be used to calculate the probability later
intron composition
42
________ is based on identifying gene signals.
feature based methods
43
_____________ of x and y is the score of an optimal alignment of x and y under a score function S. We denote it by F(x, y)
Alignment Score
44
The ______________ is the alignment with the higher alignment score
optimal alignment