World of Genomics Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What are the two types of grooves in dna?

A

Major grooves and minor grooves.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What bonds are between the bases holding the two dna strands together?

A

hydrogen bonds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What bonds are between bases on a single strand of DNA?

A

strong covalent bonds.

This means the dna strands can be melted and annealed back together. This is done in PCR.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is DNA?

A

DNA is a polymer of nucleotides consisting of adenine (A), cytosine (C), guanidine (G) and thymine (T)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

DNA molecules are…

A

…double-stranded and the two strands ‘pair’ together with complementary sequences forming the pairs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can we sequence DNA?

A

using dideoxynucleotides we can sequence DNA using the chain termination method
the use of robotics has automated this so that millions of base pairs can be analysed per day

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What has sequencing dna led to?

A

we have access to large amounts of DNA sequence – primary data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how does information flow in the cell?

A

information flows from DNA in the genome to RNA in the gene transcript to protein which is generally the functional end of information flow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What sources do we get primary data from?

A

We can get primary data then from three sources:

1 – the DNA sequence itself – sequencing genes - the genome

2 – the mRNA – the transcriptome

3 – proteins – via protein sequencing – the proteome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the Transcriptome?

A

the sum total of all transcripts in a cell/organism/tissue/organ.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the proteome?

A

the sum total of all proteins in a cell, organism, tissue, or organ.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

DNA is transcripted to…

A

…DNA transcriped to mrna which is translated to rna and proteins.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data from genome sequencing is converted into…

A

…text containing strings of
A, C, G and T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In genome sequences/sequencing, what is the main thing we are concerned with?

A

It is the order of the letters (A,C,G,T) that we are concerned about here

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Where is data generated from genome sequencing projects deposited?

A

the data generated from genome sequencing projects is deposited in databases which can be ‘searched’ by users of the database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

One of the main key points of modern genomics is…

A

…studying gene expression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Eahc cells contain the same genes but…

A

…differ by which genes are turned on or off. This causes cell differentiation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Knowing the sequence does not…

A

…tell us alot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

why do we need to study gene expression?

A
  • Studying gene expression allows us to understand what particular genes encode and what they do.
  • If you can understand what cells have the genes turned on or off, you can deduce what the gene does.
  • For example, if a gene is only turned on in brain tissue, it is likely to be a gene to do with the brain.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

In cells, some genes will be…

A

…transcribed and others turned off

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Some genes will have high expression, and some will…

A

…be expressed at low levels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Describe differential gene expression.

A
  • Some genes transcribed, some not transcribed. (turned on or turned off)
  • Some genes having high levels of expression, some genes having low levels of expression.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

It is differential gene expression which defines each cell in terms of:

A

1 – it’s particular function

2 – it’s developmental stage

3 – the response to environmental cues

4 – it’s genotype (wildtype or mutant)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is qualtiative?

A

On or off

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Describe how development state/stage changed expression.

A

During develpotment of red blood cells, when immature, particular genes are on, when mature, particular genes are off. Development state changes expression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Whats an example of how we can study gene expression?

A

studying gene expression using northern blotting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

WHat is northern blotting?

A

When people invented a way to detect rna, they called it a northern blot. This is used to determine RNA levels/molecules of a gene.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is a southern blot?

A

first person to develop a way to detect dna molecules by radial dna was guy called ed southern. This plot was called a southern blot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Whats a western plot?

A

When people invented a way to detect proteins, they called it a western blot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Why would we want to study gene expression?

A

imagine we have a gene that we are interested in and we want to know which tissues it is expressed in (i.e. we want to know about its regulation)

one way that we can do this is to use a technique which examines the quantity of mRNA molecules from that gene in different tissues

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Describe studying gene expression.

A

we can isolate RNA molecules

we can separate them with gel electrophoresis

we can then identify them on a gel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

How does gel electrophoresis work?

A

In gel electrophersis, dna separate out according to molecular weight when an electrical current is run through it, because the negative charge of the dna is attracted to the positive charge of the current.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Why do we use northern blots instead of gel electrophoresis when studying gene expression?

A

Gel electrophoresis does not tell us a lot on its own since we cannot distinguish the different dna. This is where northern blots are used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Describe northern blots.

A

RNA is transferred to a nylon or nitrocellulose membrane

specific RNA molecules can be detected by hybridisation as long as you have a DNA probe that is detectable

it is detectable if labelled with radioactive nucleotides or labelled with a molecule that can be detected by antibodies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

what is the purpose of the β-tubulin lane in northern blots?

A

works on cytoskeleton microtubules.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

When is a northern blot used?

A

if we just want to look at individual genes

37
Q

When can northern blots only reasonably be done?

A

if you have a hypothesis which suggests that these genes may be up-regulated – otherwise which genes do you test your northern blot with?

38
Q

When might we want to look at the whole transcriptome / more than just one gene?

A

when we want to look at the response of all genes to a certain condition

39
Q

Give an example of when we want to look at multiple genes ?

A

this is a good example – there are mutant mice that are obese – this stems from a mutation in a particular gene

in order to see what effect this gene has on the mouse it is necessary to see what other genes are turned up or turned down – how could we do this?

40
Q

if we were interested in a given tissue type and wanted to know what were the genes expressed in that tissue does that mean we need to examine each gene individually?

A

we can examine all genes together using microarrays

41
Q

What turns rna into dna?

A

Rna is turned into dna by reverse transcriptase.

42
Q

What can microarrays help us understand?

A

Using this, we can understand what genes are turned on or off in certain diseases. This allows us to determine how diseases such as cancer comes about. Then we can look at developing new potential treatments.

43
Q

What is an array in microarrays?

A

an array is basically a large number of ‘spots’ of a DNA probe that is covalently linked to a substrate such as a glass slide or a silicon chip

44
Q

What are arrays constructed by?

A

the arrays are generally constructed by robots using either small (25bp) oligonucleotide probes or larger cDNA probes

45
Q

How is data dealt with?

A

in order to deal with this amount of information we need to use computers and sophisticated software

this is where the science of bioinformatics comes in

46
Q

we can use DNA sequence data to …

A

…construct relationships of organisms

47
Q

we can use phylogenetic trees to …

A

…examine relationships of proteins to each other within the genome

48
Q

we all know about the human genome project but…

A

…what is less known is that many other species now have their whole genomes sequenced

49
Q

as sequencing of DNA gets cheaper …

A

…more genomes can be sequenced

50
Q

the cost of the human genome project was …

A

…in terms of billions of dollars yet only 10 or so years later the price has dropped to a few thousand dollars per genome

51
Q

What is Moore’s Law?

A

this states that processor speeds, or overall processing power for computers will double every two years

52
Q

Why might a persons genome be sequenced?

A

A persons genome can be sequenced to see what drugs a patient will respond best to.

53
Q

What does Gene Ontology give us?

A

it gives us a logical means of searching databases – all genes are assigned GO terms

54
Q

the genes that are differentially expressed in a human breast cancer are arranged according to …

A

their biological function – in many cases these genes encode proteins common to signalling/enzymatic pathways

55
Q

What are paralogues?

A

Genes that are members of a gene family

56
Q

the different globins are …

A

…expressed at different times during development

57
Q

Foetal globins have…

A

…higher oxygen affinity.

58
Q

Why do genes duplicate?

A

in many cases duplication arises due to errors in either meiosis or DNA replication

59
Q

In meiosis there can be a…

A

…mismatch during alignment at the metaphase plate and in DNA replication slippage may occur

60
Q

In meiosis and dna replication what type of duplication occurs?

A

tandem duplication occurs where the copy sits on a chromosome next to the original gene

61
Q

because of the presence of gene duplication in more complex genomes it gives space for …

A

…genes to evolve new functions – this in part has been cited as one of the reasons for increasing complexity over time

62
Q

What happens as the size of prokaryotic genomes increase?

A

as the size of prokaryotic genomes increases then more of their genes tend to be members of gene families

63
Q

in more complex eukaryotic genomes the percentage of genes in gene families is …

A

… even larger

64
Q

Percentage of genes in gene families in more complex eukaryotic genomes is larger than…

A

…the percentage of genes in gene families in prokaryotes

65
Q

through evolutionary history there is a tendency for organisms to …

A

…become more complex

66
Q

Gene duplication may be important in…

A

…the increase in complexity

67
Q

‘Genecards’ is a …

A

…simple entry point to databases

68
Q

having the genome sequence in itself doesn’t …

A

…give too much information.

  • the genome needs to be annotated
69
Q

the genome sequence needs to be annotated:

A

1 – gene prediction – computer programs/BLAST/experimental

2 – gene annotation (structural and functional) - experimental

70
Q

What are computer programs able to predict?

A

computer programs are able to predict the presence of genes from raw genome sequence data

71
Q

What do computer programs use to predict the presence of genes from raw genome sequence data ?

A
  • sequence that appears to encode a protein that is homologous to known proteins
  • known intron/exon junction sites
  • known promoter sequences/regulatory elements
72
Q

Genome sequence is stored in…

A

…databases that are publicly accessible.

these can then be searched for potential genes or potential similarity to known genes

73
Q

What is the usual way to search for homology?

A

the usual way for searching for homology is to use an algorithm called BLAST

74
Q

What does BLAST stand for?

A

(Basic Local Alignment Search Tool)

75
Q

How are results shown on BLAST?

A

using NCBI’s BLAST tool the results are shown graphically with colour coding for matches.

the results are also shown as an ‘E value’

76
Q

What is the E value?

A

the Expect value (E) is a parameter that describes the number of hits you can “expect” to see by chance when searching a database of a particular size

77
Q

How does E value decrease?

A

decreases exponentially as the Score (S) of the match increases

78
Q

What does E value describe?

A

Essentially, the E value describes the random background noise.

79
Q

an E value of 1 assigned to a hit can be …

A

…interpreted as meaning that in a database of the current size one might expect to see 1 match with a similar score simply by chance.

80
Q

The lower the E-value, or the closer it is to zero…

A

…the more real the match.

81
Q

What is the E value the likelihood of?

A

Likelihood of getting a match “as good as this match” based on random chance alone.

82
Q

What are E values very dependant on?

A

E-values are very dependent on the query sequence length and the database size.

83
Q

Short identical sequence may have a …

A

…high E-value and may be regarded as “false positive” hits.

84
Q

Describe E-value < 10e-100

A

identical sequences - long alignments across the entire query and hit sequence

85
Q

Describe 10e-50 < E-value < 10e-100

A

almost identical sequences - a long stretch of the query protein is matched to the database.

86
Q

Describe 10e-10 < E-value < 10e-50

A

closely related sequences - could be a domain match or similar

87
Q

Describe 1 < E-value < 10e-6

A

could be a true homologue but it is a grey area

88
Q

Describe E-value > 1

A

proteins are most likely not related

89
Q

Describe E-value > 10

A

hits are most likely junk unless the query sequence is very short