Sequence Alignment (short ver) Flashcards

1
Q

aligns two or more sequences to highlight their similarity, inserting a small number of gaps into each sequence (usually denoted by dashes) to align wherever possible identical or similar characters.

A

Basic Sequence Alignment Algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why compare biological sequence?

A

to obtain functional or mechanistic insight about a sequence by inference from another potentially better characterized sequence
to find whether two (or more) genes or proteins are evolutionary-related
to find structurally or functionally similar regions within sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

practical applications of sequence alignment include…

A

similarity searching of databases
assembly of sequence reads
mapping sequencing reads to a known genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Protein structure prediction, annotation, etc…

A

similarity searching of databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

reads into a longer construct such as a genomic sequence

A

assembly of sequence reads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

looking for differences from reference genome - SNPs, indels

A

resequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

mapping transcription factor binding sites via

A

ChiP-Seq (chromatin immuno-precipitation sequencing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

arguably the most fundamental operation of bioinformatics

A

Pairwise sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

sequence comparison is most informative when it detects _

A

homologs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

sequences that have common origin; they share a common ancestor

A

homolog

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

homologous sequences may either be:

A

orthologs or paralogs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or False: orthologs and paralogs are an example of all or nothing relationships

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Any pair of sequences may share a certain level of:

A

identity and/or similarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Orthologs tend to have ____ while Paralogs tend to have ______

A

Orthologs - similar function
paralogs - slightly different function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

homolog produced by speciation that have diverged due to divergence of the organisms they are associated with.

A

orthologs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

homologs produced by gene duplication

A

paralogs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

ortho is a greek word that means

A

straight; implies direct descent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

they represent genes derived from a common ancestral gene that duplicated within an organisms and then subsequently diverged by accumulated mutation

A

paralogs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

greek word meaning along side of

A

para

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Why is determining orthologs vs. paralogs can be a complex problem?

A

gene loss after duplication
lack of knowledge of evolutionary history
weak similarity because of evolutionary distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

True or False: Homology implies exact same functions

A

False;

homology does not imply exact same functions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

may have similar function at very crude level but play a different physiological role

A

homology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

three types of sequence change that can occur during evolution

A

mutations/substitutions
deletions
insertions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

great tools to visualize sequence similarity and evolutionary changes in homologous sequences

A

alignment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

represent mutations/substitutions

A

mismatches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

represent insertions and deletions

A

gaps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

one way to judge alignment is to:

A

compare their number of:
matches
insertions
deletions
mutationd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

reflect biological or statistical observations about the known sequences, and are frequently represented by scoring matrices

A

alignment scoring schemes

29
Q

if x1 and y1 is a match

30
Q

if x1 and y1 is a mismatch

31
Q

if either x1 and y1 is a gap

32
Q

the gap penalty is denoted as

33
Q

true or false: biologist often prefer parsimonious alignment

34
Q

number of postulated sequence changes is minimized

A

parsimonious alignment

35
Q

True or false: there may be more than one optimal alignment and these may not reflect the true evolutionary history of our sequences

36
Q

two commonly quoted matrices for pairs of aligned sequences

A

Sequence identity
sequence similarity

37
Q

typically quotes the percent of identical characters in the aligned region of two sequences

A

sequence identity

38
Q

typically, the score resulting from optimal pairwise alignment (there is dependence in parameters used like scoring scheme)

A

sequence similarity

39
Q

True or False: homology is not an all or nothing relationship, you can not have a percent homology

A

False;

homology is an all or nothing relationship

40
Q

True or False; homology is an all or nothing relationship, you can have a percent homology

A

False

you can not have a percent homology

41
Q

frequently used as an indicator of homology

A

high sequence similarity

42
Q

use to find genes and/or proteins with potentially similar or identical function

A

high sequence similarity

43
Q

can query a database of sequence by performing a series of pairwise alignments

A

high sequence similarity

44
Q

way of arranging the primary sequences of DNA, RNA, or protein to identify regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequence

A

sequence alignment

45
Q

aligned sequences of nucleotide or amino acid residues are typically represented as ____ within a matrix

46
Q

inserted between the residues so that residues with identical or similar characters are aligned in successive columns

47
Q

True or False; alignment can also be not an exact match

48
Q

True or False: Alignment can be based on edit distance

49
Q

True or False: Alignment is usually based on a similarity measure

50
Q

the number of changes requires to change one sequence into another is called the __

A

edit distance

51
Q

every character in the query (source) string lines up with a character in the target string

A

Global alignment

52
Q

attempt to align every resiue in every sequence, are most useful when the sequences in the query set are similar and of roughly equal size

A

global alignment

53
Q

may require gap (space) insertion to make strings the same length

A

global alignment

54
Q

a general global alignment based on dynamic programming

A

Needleman Wunsch algorithm

55
Q

Needleman-Wunsch algorithm is based on

A

dynamic programming

56
Q

an internal alignment or embedding of a substring into a target string

A

local alignment

57
Q

general local alignment method based on dynamic programming

A

Smith-Waterman algorithm

58
Q

Pairwise sequence alignment methods

A

brute force alignment
dot matrices
dynamic programming

59
Q

its objective is to arrange two sequences in such a fashion that pairs of matching characters between the two sequences are maximized

A

pairwise sequence alignment

60
Q

function that ranks or scores the characters being compared

A

substitution matrix

61
Q

simplesst case of pairwise sequence

A

brute force alignments

62
Q

make the brute force method unusable for all but the shortest sequence

63
Q

pair of related sequences often have insertions or deletions relative to one another, we therefore require

A

gapped pairwise alignment

64
Q

simple graphical method for pairwise alignment;

A

dot plot/dot matrix approach

65
Q

no scoring, so difficult to compare alternative alignments

A

dot plot/dot matrix approach

66
Q

can give visual sequence structure but requires human interaction

A

dot plot/dot matrix approach

67
Q

provides optimal solutions (but not necessarily unique solutions)

A

dynamic program algorithm

68
Q

much faster, widely used for database searches, may miss some pairws with low similarity

A

heuristic word or k-tuple approaches