Lecture 08 - Bioinformatics Flashcards

1
Q

What is bioinformatics

A

the collection, classification, storage and analysis of biochemical and biological information using computers especially as applied to molecular genetics and genomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the INSDC

A

international nucleotide sequencing database collective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Who makes up the INSDC

A

DDBJ, NCBI, ENA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the DDBJ

A

DNA data bank of Japan

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the NCBI

A

national center for biotechnology information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the ENA

A

european nucleotide archive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the FASTA format

A

accession number, identifier, what kind it is, then sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is FASTQ used for

A

next generation sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is found in the GenBank Header

A

locus, definition, accession, version, keywords, source, references

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What information is found in the locus

A
  • locus name
  • length of sequence
  • molecule type
    -genebank division (3 letter code)
  • date last modified
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is in the definition

A

a brief description of the sequence (may include source organism and gene name)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the accession number

A

a unique identifier for the sequence within the database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the version number

A

it denotes any change to the sequence since it was first submitted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a reference sequence

A

high quality sequences that the NCBI have curated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the format for refSeq accession numbers

A

have an underscore
NM_
NC_
NG_
NR_
NZ_

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the source

A

free-format information including an abbreviated form of the organsims name

17
Q

What is the organism

A

the formal scientific name for the source organism

18
Q

What does direct submission mean

A

the person who added the sequence and the date that the sequence was added to the database

19
Q

What does CDS mean

A

coding sequence

20
Q

What is an exon

A

definitive region of genome that codes for a portion of spliced mRNA, rRNA, or tRNA, may contain 5’UTR, all CDSs and 3’UTR

21
Q

What is an intron

A

a segment of DNA that is transcribed but removed from within the transcript by splicing together the sequences on either side of it

22
Q

What is a gene

A

a region of biological interest identified as a gene and for which a name has been assigned

23
Q

What is a location

A

a site between two adjoining nucleotides such as a restriction enzyme site that is indicated by listing the two points separated by ^

24
Q

What is a sequence span

A

indicated using the starting base number and the ending base number separated by two periods

</> symbols may be used with the starting and ending numbers to indicate that an end point is beyond the specified base numbers

25
Q

What is a location operator

A

a prefix that specifies what must be done to the indicated sequence to find or construct the location corresponding to the feature

26
Q

What are common operators

A

complement
join

27
Q

What is complement

A

find the complement of the sequence and then present it in 5’ to 3’

28
Q

What is join

A

the indicated elements should be joined to form one contiguous sequence

29
Q

What is complment join

A

the indicated elements should be joined to form a contiguous sequence and then take the complement and place it in 5’ to 3’ orientation