Bioinformatics Flashcards

Question 1

Q

interdisciplinary
field that combines biology, computer science,
statistics, mathematics, and engineering to analyze
and interpret biological data, particularly data from
large datasets like genomes or protein sequences

Answer

A

Bioinformatics

Question 2

Q

It is a widely-used format for
representing nucleotide or protein sequences.

Question 3

Q

It consists of a header line starting with ‘>’, followed by the sequence data on subsequent lines.

Question 4

Q

in sequence alignment, a ________ represents a position where one sequence has an insertion or
deletion relative to another sequence.

Question 5

Q

____________ are
introduced to optimize alignment and account for
evolutionary changes

Question 6

Q

___________ are
introduced to optimize alignment and account for
evolutionary changes.

Question 7

Q

It is the
sequence for which you are searching for similarities
or matches within a database

Answer

A

Query sequence

Question 8

Q

It’s the sequence you
are using as a reference

Answer

A

Query sequence

Question 9

Q

it is the
sequence(s) in a database against which the query
sequence is compared during sequence alignment or
similarity searches

Answer

A

Subject sequence

Question 10

Q

it is a branching
diagram that depicts the evolutionary relationships
among a set of organisms, genes, or species

Answer

A

Phylogenetic tree

Question 11

Q

It
shows the inferred evolutionary history and
relatedness based on genetic or sequence data

Answer

A

Phylogenetic tree

Question 12

Q

it is a
unique numerical identifier assigned to each
sequence entry in the NCBI (National Center for
Biotechnology Information) databases.

Answer

A

GI number

Question 13

Q

It provides a
stable and unique way to refer to a specific sequence
entry.

Answer

A

GI number

Question 14

Q

It is a
unique identifier assigned to a sequence record in a
public sequence database (like GenBank, EMBL, or
DDBJ)

Answer

A

Accession number

Question 15

Q

Typically consist of letters
and numbers and are used to reference specific
sequence entries.

Answer

A

Accession number

Question 16

Q

Involves
identifying and labeling the features of a genome such as genes, regulatory sequences, and other
functional elements.

Answer

A

Genome annotation

Question 17

Q

This process helps in
understanding the biological significance of the DNA
sequence.

Answer

A

Genome annotation

Question 18

Q

In sequence alignment or similarity searches, it is a numerical value that quantifies the level
of similarity or quality of alignment between two
sequences.

Question 19

Q

Higher scores generally indicate more
significant similarity.(T or F)

Question 20

Q

It is a statistical
measure that estimates the number of different
alignments with scores equivalent to or better than a
given score that would occur by chance in a database
search.

Answer

A

Expect value (E-value)

Question 21

Q

A ___________ indicates a more significant
match or similarity.

Answer

A

lower E-value

Question 22

Q

A field which uses computers to store and analyze
molecular biological information

Answer

A

BIOINFORMATICS

Question 23

Q

It is about finding and interpreting biological data
online

Answer

A

BIOINFORMATICS

Question 24

Q

It is a field in which biology, mathematics, statistics, computer
science, information technology, and other health sciences are
merged into a single discipline to process biological data

Answer

A

BIOINFORMATICS

Question 25

Q

It uses complex machines to read biological data at a much
faster rate than before.

Answer

A

BIOINFORMATICS

Question 26

Q

There is a marriage between biology and informatics. (T or F)

Question 27

Q

The science of collecting and analyzing complex
biological data

Answer

A

BIOINFORMATICS

Question 28

Q

Allows the storage and management of large biological data sets

Answer

A

THE CREATION OF DATABASES

Question 29

Q

Data is being generated at a much greater pace than
its analysis (e.g. Human Genome Project)

Answer

A

THE CREATION OF DATABASES

Question 30

Q

These are repositories so it’s like a bank of biologic
information and are designed to collect, archive, visualize, and
organize biologic data.

Answer

A

Databases

Question 31

Q

This is to enable scientists to have an
intelligent data description, interpretation, or retrieval.

Answer

A

Databases

Question 32

Q

There is
much data that has been generated especially since the
completion of the

Answer

A

Human Genome Project

Question 33

Q

When was Human Genome Project launched?

Question 34

Q

Objective of human genome project

Answer

A

To sequence
the entire human genome which consists of about 3.2 billion
base pairs.

Question 35

Q

It was completed in 2003 because of this there’s a
large amount of data that have to be interpreted or analyzed.

Answer

A

Human Genome Project

Question 36

Q

Aside from the human genome, many other organisms were
completely sequenced. So there is again an enormous amount
of data that has to be understood that is why databases have
been created. (T or F)

Question 37

Q

PRINCIPAL COMPONENTS OF BIOINFORMATICS

Answer

A

*THE CREATION OF DATABASES
*THE DEVELOPMENT OF ALGORITHMS AND STATISTICS
*THE USE OF THESE TOOLS FOR THE ANALYSIS AND
INTERPRETATION OF VARIOUS TYPES OF
BIOLOGICAL DATA

Question 38

Q

Determine relationships among members of large
data sets

Answer

A

THE DEVELOPMENT OF ALGORITHMS AND
STATISTICS

Question 39

Q

The large set of data are organized so that relationships can
be determined that is called

Answer

A

Algorithm

Question 40

Q

Algorithm is applied in ________

Answer

A

Statistics

Question 41

Q

including DNA, RNA and protein sequences, protein
structures, gene expression profiles, and biochemical
pathways

Answer

A

THE USE OF THESE TOOLS FOR THE ANALYSIS AND
INTERPRETATION OF VARIOUS TYPES OF
BIOLOGICAL DATA

Question 42

Q

Sciences that attempt to describe a living organism
in terms of ‘omics’

Answer

A

BRANCHES OF BIOINFORMATICS

Question 43

Q

BRANCHES OF BIOINFORMATICS

Answer

A

Genomics
Transcriptomics
Proteomics
Microbiomics
Metabolomics

Question 44

Q

IDENTIFY THE BRANCH OF BIOINFORMATICS

involves the description of sequences of
the entire genome of an organism

Question 45

Q

IDENTIFY THE BRANCH OF BIOINFORMATICS

study of all RNA molecules in a
living organism

Answer

A

Transcriptomics

Question 46

Q

IDENTIFY THE BRANCH OF BIOINFORMATICS

the description of the entire
complement of proteins in a living organism.

Answer

A

Proteomics

Question 47

Q

IDENTIFY THE BRANCH OF BIOINFORMATICS

They
study the sequence, 3D structures, and other
properties of proteins.

Answer

A

Proteomics

Question 48

Q

IDENTIFY THE BRANCH OF BIOINFORMATICS

It is the entire proteins found in a living organism.

Answer

A

Proteomics

Question 49

Q

IDENTIFY THE BRANCH OF BIOINFORMATICS

Pertains to microbes, viruses, fungi,
parasites, bacteria.

Answer

A

Microbiomics

Question 50

Q

IDENTIFY THE BRANCH OF BIOINFORMATICS

The genomes of these
microorganisms are described within a specific environmental niche

Answer

A

Microbiomics

Question 51

Q

IDENTIFY THE BRANCH OF BIOINFORMATICS

involves description of the chemical
processes involving metabolites.

Answer

A

Metabolomics

Question 52

Q

DNA/RNA BIOINFORMATICS APPLICATIONS

Answer

A

● Retrieving DNA sequences from databases
● Computing nucleotide compositions
● Identifying restriction sites
● Designing polymerase chain-reaction (PCR) primers
● Identifying open reading frames (ORFs).
● Predicting elements of DNA/RNA secondary structure
● Finding repeats
● Computing the optimal alignment between two or
more DNA sequences
● Finding polymorphic sites in genes (single nucleotide
polymorphisms, SNPs)
● Assembling sequence fragments

Question 53

Q

Identifying open reading frames (ORFs) - Open reading frames means that you have a sequence
which includes the

Answer

A

start codon until a stop codon

Question 54

Q

WHY DO BIOINFORMATICS?

Answer

A

● It serves to save time when doing real experiments.
design primers
● You might want to do a simulated experiment on a
computer (‘ in silico’) instead of a real environment.

Question 55

Q

Bioinformatics is very convenient for a scientist because it
serves to

Answer

A

Save him time when he wants to do a real
experiment. As the experiment or the research study may start by
simulating it in a computer first.

Question 56

Q

When you do simulated
experiments in a computer, that is described as “in silico” so it
is done in a computer rather than a real environment. For
example, when you do PCR and you want to amplify a
particular DNA fragment, you design primers using
bioinformatic tools or software. (T or F)

Question 57

Q

Once you have designed a
primer, then you can do your actual laboratory experiment, we
call it the ____________

Question 58

Q

Where the primer would be optimized and
eventually used in the amplification reaction.

Question 59

Q

APPLICATIONS OF BIOINFORMATICS

Answer

A

● Sequence alignment and analysis
● Mapping and analyzing DNA, RNA, Protein, Amino
Acid, and Lipid sequences
● Creation and visualization of 3-D structure models for
biological molecules of significance, e.g., proteins
● Genome annotation
● Genetic diseases
● Designer Medicine

Question 60

Q

APPLICATIONS IN VARIOUS FIELDS

Answer

A

● Microbial genome applications
● Molecular medicine
● Personalized medicine
● Gene therapy
● Drug development
● Antibiotic resistance
● Evolutionary studies
● Waste cleanup
● Biotechnology
● Climate change studies
● Alternative energy sources
● Crop improvement
● Forensic analysis
● Bio-weapon creation
● Insect resistance
● Improve nutritional quality
● Veterinary science

Question 61

Q

The earliest databases
for DNA sequences and proteins were developed by three
groups of scientists from different parts of the world:

Answer

A

● Nucleic Acids (International Nucleotide Sequence
Database)
● Protein (Worldwide Protein Data Bank)

Question 62

Q

IDENTIFY THE DATABASE

DDBJ (DNA Data Bank of Japan)

Answer

A

Nucleic Acids (International Nucleotide Sequence
Database)

Question 63

Q

IDENTIFY THE DATABASE

EMBL (European Molecular Biology Lab)

Answer

A

Nucleic Acids (International Nucleotide Sequence
Database)

Question 64

Q

IDENTIFY THE DATABASE

EMBL (European Molecular Biology Lab)

Answer

A

Nucleic Acids (International Nucleotide Sequence
Database)

Answer 51

A

Nucleic Acids (International Nucleotide Sequence Database)

Answer 52

A

Protein (Worldwide Protein Data Bank)

Answer 53

A

Protein (Worldwide Protein Data Bank)

Answer 54

A

● Ensembl
● Human metabolome Database (HMDB)
● Gene Expression Databases - Mostly Microarray data
● Phenotypic Databases
● RNA Databases
● Amino Acid/Protein Databases
● Protein-Protein and other Molecular interactions
● Signal Transduction Pathway Databases
● Metabolic Pathway and Protein Function Databases
● Bacterial DNA Databases

Answer 55

A

● A disease may arise due to changes the sequence of
the gene being expressed
● Single Nucleotide Mutation: Sickle Cell Anemia

Answer 56

A

Sickle cell anemia

Answer 57

A

FALSE (Valine NOT VASELINE)

Answer 58

A

Glutamic acid

Answer 59

A

Phenotype

Answer 60

A

Sickle-Cell Anemia

Answer 61

A

SEQUENCE ALIGNMENT

Answer 62

A

a known sequence (reference sequence)
and unknown sequence (query sequence)

Answer 63

A

Known sequence

Answer 64

A

Unknown sequence

Answer 65

A

Pairwise
Multiple

Answer 66

A

○ EMBOSS WATER
○ BLAST

Answer 67

A

○ MUSCLE
○ MAFFT
○ CLUSTAL Omega

Answer 68

A

Global alignment
Local alignment

Answer 69

A

Global alignment

Answer 70

A

Global alignment

Answer 71

A

Global alignment

Answer 72

A

Global alignment

Answer 73

A

Global alignment

Answer 74

A

Local alignment

Answer 75

A

Local alignment

Answer 76

A

Local alignment

Answer 77

A

Local alignment

Answer 78

A

Local alignment

Answer 79

A

Clustal omega

Answer 80

A

MUSCLE
MAFFT
Clustal Omega

Answer 81

A

Multiple Sequence Comparison by Log
Expectation

Answer 82

A

Multiple Alignment using Fast Fourier
Transform

Answer 83

A

MUSCLE (Multiple Sequence Comparison by Log Expectation)

Answer 84

A

NCBI: Basic Local Alignment Search Tool (BLAST)

Answer 85

A

NCBI: Basic Local Alignment Search Tool (BLAST)

Answer 86

A

NCBI: Basic Local Alignment Search Tool (BLAST)

Answer 87

A

NCBI: Basic Local Alignment Search Tool (BLAST)

Answer 88

A

MULTIPLE ALIGNMENT

Answer 89

A

MULTIPLE ALIGNMENT

Answer 90

A

MULTIPLE ALIGNMENT