Bioinformatics lecture Flashcards

1
Q

The extent to which two sequences are the same

A

Identity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Lining up two or more sequences to search for the maximal regions of identity in order to assess the extent of biological relatedness of homology

A

Alignment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The relatedness of sequences

A

Similarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A fixed set of commands in a computer program

A

Algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A space introduced in alignment to compensate for insertions or deletions in one of the sequences being
compared

A

Gap

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Similarity attributed to descent from a common ancestor

A

Homology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The sequence presented for comparison with all other sequences in a selected database.

A

Query

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The genetic sequence database sponsored by the National Institutes of Health.

A

GenBank

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

describes the number of matches
to the query by chance when searching a database of a
particular size.

A

E- value (Expect value)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

study on evolutionary relatedness among species by comparing homologies and differences in gene
sequences

A

Phylogenetics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
  • A field which uses computers to store and analyze
    molecular biological information.
  • application of tools of computation and analysis to the capture and interpretation of biological data.
A

BIOINFORMATICS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q
  • Allow the storage and management of large biological data sets
  • Data is being generated at a much greater pace than its
    analysis (Human Genome Project)
A

CREATIO OF DATABASES

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Determine relationships among members of large data users

A

DEVELOPMENT OF ALGORITHMS AND STATISTICS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
  • Transcriptomics
  • Microbiomics
  • Metabolomics
  • Genomics
  • Proteomics
A

BRANCHES OF BIOINFORMATICS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
  • Retrieving DNA sequences from databases
  • Computing nucleotide compositions
  • Identifying restriction sites
  • Designing polymerase chain reaction (PCR) primers
  • Identifying open reading frames (ORFs)
  • Predicting elements of DNA/RNA secondary structure
  • Finding repeats
  • Computing the optimal alignment between two or more DNA
    sequences
  • Finding polymorphic sites in genes (single nucleotide
    polymorphisms, SNPs)
  • Assembling sequence fragments
  • Creation and visualization of 3D structure models for
    biological molecules of significance.
A

BIOINFORMATICS APPLICATIONS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q
  • Microbial genome applications
  • Molecular medicine
  • Personalized medicine
  • Gene therapy
  • Drug development
  • Antibiotic resistance
  • Evolutionary studies
  • Waste cleanup
  • Biotechnology
  • Climate change studies
  • Alternative energy sources
  • Crop improvement
  • Forensic analysis
  • Bio-weapon creation
  • Insect resistance
  • Improve nutritional quality
  • Veterinary science
A

BIOINFORMATICS APPLICATIONS IN VARIOUS FIELDS

17
Q

THREE EARLIEST DNA SEQUENCE AND PROTEIN DATABASES

A
  • DDBJ (DNA DataBank of Japan)
  • EMBL (European Molecular Biology Lab)
  • Genbank (USA)
18
Q
  • Contain original data in the form of primary sequence data
    or structural data as submitted by the scientific community.
  • Examples: GenBank, EMBL, DDBJ, SWISS-PROT and PIR
A

PRIMARY DATABASES

19
Q

Contain information that has been
process and derived from the raw data available in primary
database

A

SECONDARY DATABASES

20
Q
  • A way of rearranging sequences of DNA, RNA or protein to identify regions of similarity.
A

SEQUENCE ALIGNMENT

21
Q

To understand functional, structural, or
evolutionary relationships between the sequences

A

identify regions of similarity

22
Q

TYPES OF SEQUENCE ALIGNMENT

A
  • Pairwise - compare two sequences
  • Multiple- compare more than two sequences
23
Q

compare more than two sequences
o MUSCLE
o MAFFT
o CLUSTAL Omega

24
Q

compare two sequences
o EMBOSS WATER
o BLAST

25
Matching the residues (bases or amino acids) of two sequences across their entire length o matches the identical sequences o The two sequences are treated as potentially equivalent
Global alignment
26
matching of two sequences from regions which have more similarity with each other o The two sequences may or may not be related o Purpose ▪ To see whether a substring (a part) in one sequence aligns well with a substring (a part) in the other sequence o Applications: ▪ Searching for local similarities in large sequences (e.g., newly sequenced genomes) ▪ Looking for conserved domains of motifs in two proteins
Local alignment
27
multiple sequence alignment tool that arranges the sequences of DNA, RNA or protein to identify regions of similarity
MUSCLE
28
* finds regions of local similarity between sequences. * the amino acid sequences of proteins or the nucleotides of DNA sequences. * compare a query sequence with a library or database of sequences, and identify library sequences that resemble the query sequence above a certain threshold. * can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families.
- BLAST BASIC LOCAL ALIGNMENT SEARCH TOOL (BLAST)
29
* In BLAST o you supply one or more query sequences and it compares nucleotide or protein sequences to sequence databases * In a multiple alignment o you supply multiple sequences to be aligned to identify regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequences
BLAST DIFFER FROM MULTIPLE SEQUENCE ALIGNMENT
30
you supply one or more query sequences and it compares nucleotide or protein sequences to sequence databases
BLAST
31
you supply multiple sequences to be aligned to identify regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequences
multiple alignment