2: Introduction to Bioinformatics (FINALS) Flashcards

1
Q

→ A field which uses computers to store and analyze molecular biological information

→ It is about finding and interpreting biological data online

Marriage between biology and informatics

→ Science of collecting and analyzing complex biological data

A

Bioinformatics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

→ A field where biology, mathematics, statistics, computer science, information technology, and other health sciences are merged into a single discipline to process biological data

→ Uses complex machines to read biological data at a much faster rate than before

A

Bioinformatics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 3 principal components of bioinformatics?

A
  1. Creation of Databases
  2. Development of Algorithms and Statistics
  3. The use of these tools for Analysis and Interpretation of various types of biological data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

3 Principal Components of Bioinformatics:

→ are like repositories or banks of biologic informations and are designed to collect archive, visualize, and arrange biologic data

→ Allowed the storage and management of large biological data sets

→ Enable scientist to have an intelligent data description, interpretation or retrieval of dat

A

Databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

3 Principal Components of Bioinformatics:

T or F

Data is being generated at a much greater pace than its analysis

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Example of bioinformatics:

→ made in the 1990’s

→objective is to sequence the entire human genome

→ Consist of about 3.2 billion base pairs

→ Finished in 2003

A

Human Genome Project

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

3 Principal Components of Bioinformatics:

Determine relationships among members of large data sets

A

Development of algorithms and statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

3 Principal Components of Bioinformatics: Development of algorithms and statistics

Large set of data are organized so relationships can be determined

A

Algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

3 Principal Components of Bioinformatics:

A concept under this is biological data

A

The use of these tools for analysis and interpretation of various types of biological data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

3 Principal Components of Bioinformatics: The use of these tools for analysis and interpretation of various types of biological data

→ Including DNA, RNA, and protein sequences, protein structures, gene expression profiles, and biochemical pathways

A

Biological data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Sciences that attempt to describe a living organism in terms of “omics”

A

Branches of Bioinformatics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Branches of Bioinformatics:

Involve the description of sequences of entire genome

A

Genomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Branches of Bioinformatics:

Study of all RNA molecules in a living organism

A

Transcriptomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Branches of Bioinformatics:

→ Description of the entire complement of proteins in a living organism

→ Entire proteins found in a living organism

→ Study of the Sequence, 3D Structures, and other Properties of all Proteins

A

Proteomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Branches of Bioinformatics:

→ Pertains to microbes like viruses, fungi, parasites, bacteria

→ Genomes of microorganisms are described within a specific environmental niche?

A

Microbiomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Branches of Bioinformatics:

→ Involved description of chemical processes involving metabolites

A

Metabolomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Branches of Bioinformatics:

→ Pertains to microbes like viruses, fungi, parasites, bacteria

→ Genomes of microorganisms are described within a specific environmental niche?

A

Microbiomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Familiarize the DNA/RNA Bioinformatics Applications

A
  1. Retrieving DNA sequences from databases
  2. Computing nucleotide compositions
    Identifying restriction sites
  3. Designing polymerase-chain reaction (PCR) primers
  4. Identifying open reading frames (ORF)
  5. Finding repeats
  6. Computing the optimal alignment between 2 or more DNA sequences
  7. Finding polymorphic sites in genes (SNPs)
  8. Assembling sequence fragments
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Familiarize other applications in bioinformatics given

A
  1. Sequence alignment and analysis
  2. Mapping and analyzing DNA, RNA, Protein, Amino acid, and Lipid sequences
  3. Creation and Visualization of 3D structure models for biological molecules of significance
  4. Genome annotation
  5. Genetic diseases
  6. Designer medicine
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Familiarize Applications in Various Fields

A

tignan niyo nlng sa ppt to huhuhu

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Why do we use Bioninformatics?

A

Saves time when doing real experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Importance of Bioinformatics:

T or F

Study should end by simulated experiment on computer instead of a real environment

A

F (Study might START by simulated experiment on computer instead of a real environment)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Importance of Bioinformatics: Identify the process

Simulated experiment on computer = ?

Primer optimized and used in amplification reaction =

A

Simulated experiment on computer = In Silico

Primer optimized and used in amplification reaction = Wet Lab

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Importance of Bioinformatics: Identify whether “In Silico” or “Wet Lab”

Target Identification

A

In Silico

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Importance of Bioinformatics: Identify whether "In Silico" or "Wet Lab" Primer Characterization
Wet lab
26
Importance of Bioinformatics: Identify whether "In Silico" or "Wet Lab" Assay Optimization
Wet lab
27
DNA/RNA Bioinformatics Applications: → sequence with start codon (AUG), until a stop codon UAG, UGA, UAA → predicting elements of DNA/RNA secondary structure
open reading frames (ORF)
28
Three earliest DNA Sequences and Protein Databases?
1. Nucleic acids 2. Protein 3. Other databases?
29
Three earliest DNA Sequences and Protein Databases: What is the database for Nucleic acids
International Nucleotide Sequence Database
30
Three earliest DNA Sequences and Protein Databases: Composition of International Nucleotide Sequence Database
1. DDBJ (DNA DataBank of Japan) 2. EMBL (European Molecular Biology Lab) 3. GenBank (USA)
31
Three earliest DNA Sequences and Protein Databases: What is the database for Protein?
Worldwide Protein Data Bank
32
Three earliest DNA Sequences and Protein Databases: Familiarize the other databases
1. Ensembl 2. Human metabolome Database 3. Gene Expression Databases 4. Phenotypic Database 5. RNA Databases 6. Amino acid/protein Databases 7. RNA Databases 8. Protein-Protein and other Molecular Interactions 9. Signal Transduction Pathway Databases 10. Bacterial DNA Databases
33
T or F In Gene Analysis Application, changes the sequence of the gene binge expressed always result to normal and healthy person
F (A DISEASE MAY ARISE due to changes the sequence of the gene binge expressed)
34
Gene Analysis Application T or F Sickle cell anemia results from point mutation of tyrosine to valine in beta-acid chain
F (substitution of GLUTAMIC ACID to VALINE)
35
This refers to genetic characteristics
Genotype
36
This refers to Physical Characteristics
Phenotype
37
Gene Analysis Application: → leads to sickle cell anemia → A recessive trait
Single Nucleotide mutation
38
Gene Analysis Application: Single Nucleotide mutation Normal Sequence: G-A-G (Glutamic Acid) Mutated: G-U-G (Valine) Which amino acid became mutated?
A
39
Gene Analysis Application: Single Nucleotide mutation If the Father and Mother are Heterozygous for sickle cell gene, how many are: a. children who will manifest the disease b. normal children c. children who are carriers
a. ¼ of children develop sickle cell disease b. ¼ are normal c. ½ are carriers
40
What are the 2 Bioinformatic Actvities?
1. Finding DNA/Protein Sequence 2. Sequence Alignment
41
To find gene or protein sequences online, what websites should be used?
1. Genbank 2. Protein Data Bank
42
To find gene sequences online, what website should be used?
Genbank
43
To find protein sequences online, what website should be used?
Protein Data Bank
44
→ A way of rearranging sequences of DNA, RNA, or protein to identify regions of similarity
Sequence Alignment
45
Sequence Alignment: What are the 2 factors involved where sequence alignment is made
Reference and Unknown Sequence
46
Sequence Alignment: Reference sequence is also known as what?
Known, Subject sequence
47
Sequence Alignment: Unknown sequence is also known as?
Query sequence
48
Sequence Alignment: Familiarize Importance of identifying regions of similarity
1. To understand functional, structural or evolutionary relationships between the sequences 2. help identify dissimilar regions of the DNA sequence useful for designing primers
49
Sequence Alignment: Familiarize Importance of identifying regions of similarity If sequences are similar, what does it mean?
they have similar functions or structure
50
Sequence Alignment: Familiarize Importance of identifying regions of similarity Can either mean belonging to the same group or distant relationship
Evolutionary relationship
51
Sequence Alignment: Familiarize Importance of identifying regions of similarity T or F Identifying similar regions of DNA sequence is useful for designing primers
F (Identifying DISSIMILAR REGIONS of DNA sequence is useful for designing primers)
52
2 Types of Sequence Alignment
1. Pairwise 2. Multiple
53
Types of Sequence Alignment: Compare two sequences
Pairwise
54
Types of Sequence Alignment: Compare more than two sequences
Multiple
55
What are the websites used in pairwise sequence alignment?
1. EMBOSS WATER 2. BLAST
56
What are the websites used in multiple sequence alignment?
1. MUSCLE 2. MAFFT 3. CLUSTAL Omega
57
What are the Types of Pairwise Sequence Alignments?
1. Global alignment 2. Local alignment
58
Types of Pairwise Sequence Alignments → Matching the residues (bases or amino acids) of two sequences across their entire length → The whole of DNA is aligned
Global Alignment
59
3 Types of Pairwise Sequence Alignments → Matching of two sequences from regions which have more similarity with each other → The two sequences may or may not be related → to see whether a substring (a part) in one sequence aligned well with substring (a part) in other sequence
Local alignment
60
Sequence Alignments → There multiple sequences being aligned → The residues are colored that differences can easily be seen
Multiple Sequence alignment: Clustal Omega
61
What type of Pairwise Sequence Alignments is appropriate for the given application: Comparing two genes or proteins with the same function
Global Alignment
62
What type of Pairwise Sequence Alignments is appropriate for the given application: Searching for local similarities in large sequences
Local alignment
63
What type of Pairwise Sequence Alignments is appropriate for the given application: Looking for conserved domains of motifs in two proteins
Local Alignment
64
What type Sequence Alignments is appropriate for the given application: Determines if all of the sequences are identical by presence of ASTERISK
Multiple Sequence alignment: Clustal Omega
65
T or F In Multiple Sequence alignment: Clustal Omega, absence if asterisk means the sequence is similar
F (absence of asterisk means sequence is DISSIMILAR/VARIATION)
66
Pairwise Sequence Alignment: Emboss Water Straight line indicates that the sequences are?
Similar
67
Pairwise Sequence Alignment: Emboss Water Straight line indicates that the sequences are?
Similar
68
Pairwise Sequence Alignment: Emboss Water Dot/period indicates that the sequences are?
Dissimilar
69
Pairwise Sequence Alignment: Emboss Water Meaning of Y in the sequence?
Any pyrimidine
70
Pairwise Sequence Alignment: Emboss Water Meaning of R in the sequence?
Any purine
71
Pairwise Sequence Alignment: Emboss Water Meaning of N Ain the sequence?
Any bases
72
Pairwise Sequence Alignment: Website → Finds regions of local similarity between sequences → The amino acid sequences of proteins or nucleotide of DNA sequences → Compare a query sequence with a library or database of sequence → Identify library sequences that resemble the query sequence above certain threshold → To Identify uncharacterized genes
Basic Local Alignment Search Tool (BLAST)
73
Query, Database, Remarks of the program BLASTn?
Query: Nucleotide Database: Nucleotide Remarks: for high scoring matches
74
Query, Database, Remarks of the program BLASTp?
Query: Protein Database: Protein Remarks: uses substitution matrices
75
Query, Database, Remarks of the program BLASTx?
Query: Nucleotide (trans) Database: Protein Remarks: for novel DNA seqs and EST analysis
76
Query, Database, Remarks of the program TBLASTx?
Query: Protein Database: Nucleotide Remarks: for STS and EST assignments in databases
77
What are the 2 possible results of BLAST?
1. Graphic Summary 2. Sequences producing significant alignment
78
supply multiple sequences to be aligned to identity regions of similarity that may be a consequence of functional, structural, or evolutionary relationships | what program
Multiple Sequence Comparison by Log Expectation (MUSCLE)
79
Multiple Sequence Comparison by Log Expectation (MUSCLE): T or F Each sequence should have definition line preceded by the">" greater than symbol
T
80
Multiple Sequence Comparison by Log Expectation (MUSCLE): T or F Choose part of sequence that is similar for primer
F (DO NOT choose part of sequences that is similar; dapat di sila similar bb)
81
What appropriate alignment sequence should be used? what website? Aligning the envelope genes of the 4 dengue virus
Multiple Alignment Sequence: MUSCLE
82
BLAST or Multiple alignment You supply one or more query sequences
BLAST
83
BLAST or Multiple alignment Compares nucleotide or protein sequences to sequence databases
BLAST
84
BLAST or Multiple alignment Uses to infer functional and evolutionary relationships between sequences
BLAST | parangboth
85
BLAST or Multiple alignment Uses to infer functional and evolutionary relationships between sequences
BLAST (both dapat)
86
BLAST or Multiple alignment Help identify members of gene families
BLAST
87
BLAST or Multiple alignment You supply multiple sequences to be aligned to identity regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequences
Multiple alignment
88
DEFINITION OF TERMS IN BIO INFORMATICS: → a text-based, bioinformatic data format used to store nucleotide or amino acid sequences (e.g. Deoxyribonucleic Acid [DNA] or Ribonucleic Acid [RNA]). → pronounced "Fast A" ("fast-aye") because the name is a shortening of "FAST-All".
FASTA
89
DEFINITION OF TERMS IN BIO INFORMATICS: Can be present in one of the sequences wherein one or more amino acid residues have been deleted from the sequence
Gap
90
DEFINITION OF TERMS IN BIO INFORMATICS: The input sequence that is being compared to others in the database aka sequence of interest
Query Sequence
91
DEFINITION OF TERMS IN BIO INFORMATICS: The sequence you are comparing to
Subject Sequence
92
93
DEFINITION OF TERMS IN BIO INFORMATICS: A diagram that depicts the lines of evolutionary descent of different species, organisms, or genes from a common ancestor
Phylogenetic tree
94
DEFINITION OF TERMS IN BIO INFORMATICS: A series of digits that are assigned consecutively to each sequence record processed by NCBI
GI number
95
DEFINITION OF TERMS IN BIO INFORMATICS: A unique identifier assigned to a record in sequence databases such as GenBank
Accession number
96
DEFINITION OF TERMS IN BIO INFORMATICS: The process of deriving the structural and functional information of a protein or gene from a raw data set using different analysis, comparison, estimation, precision, and other mining techniques
Genome annotation
97
DEFINITION OF TERMS IN BIO INFORMATICS: → A set of values for qualifying the set of one residue being substituted by another in an alignment → Calculated by adding substitution scores, defined for each aligned pair of letters, and gap scores for each run of letters in one segment aligned with null characters inserted into the other
Score
98
DEFINITION OF TERMS IN BIO INFORMATICS: A parameter that describes the number of hits one can “expect” to see by chance when searching a database of a particular size
Expect Value
99
Full name of BLAST?
Basic Local Alignment Search Tool
100
Full name of MUSCLE?
Multiple Sequence Comparison by Log Expectation