Módulo 1 Flashcards

António

1
Q

O que são genes parálogos?

A

Genes present in a particular organism that are related to each other through a gene duplication event (come from the same gene but have different functions)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is MSA?

A

Multiple Sequence Alignment. Método computacional que dá highlight a sequência mais longa possível que seja semelhante entre múltiplas proteínas.

O alinhamento múltiplo é a principal ferramenta de pesquisa de semelhanças entre elementos da mesma família.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

O que são padrões conservados (Conserved patterns)?

A

Sequências que surgem do alinhamento e podem ser utilizadas para definir assinaturas que caracterizam uma família ou domínio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the following string represent?

ATOM 400 N ALA A 53 36.594 24.706 31.023 1.00 15.56 N

A

Protein Data Bank PDB format (text)

400 Atom number
N The actual atom
ALA A Amino Acid chain
53 Amino acid number
36.594 X coordinate
24.706 Y coordinate
31.023 Z coordinate

1.00 15.56 N Factor Beta (Error in the position of the atoms)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

O que é Alternative Splicing?

A

O splicing alternativo é um processo celular no qual os exões do mesmo gene são unidos em diferentes combinações, levando a transcrições de mRNA diferentes, mas relacionados. Estes mRNA podem ser traduzidos para produzir diferentes proteínas com estruturas e funções distintas – tudo a partir de um único gene

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What can Alternative Splicing be used for?

A

It’s a way to use the same gene to encode proteins that are more or less related.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

NEXT

A

NEXT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why are there more proteins in the cell than there are genes for said proteins? (Example, why does a cell have 100 proteins but only 10 genes?)

A

Through Alternative Splicing, a gene can encode for different proteins by removing different parts of itself, producing different proteins.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a domain?

A

They are areas of the protein that are more or less different or independent from each other. Sufficiently stable to fold independently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the difference between a domain and a subunit?

A

A domain is a discrete function and/or structural section of a polypeptide. This differs from a subunit, which is a single polypeptide in a protein which is in turn composed of multiple polypeptides. It is important to note that subunits can have domains

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is it possible for a protein to have 2 subunits but one domain?

A

For example, if both subunits have the same function, it can be considered that the protein has only one domain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why does automatic genome annotation work?

A

Because similar sequences imply similar functions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the basis for our understanding how structural domains evolve in proteins?

A

Similar sequences imply similar structures.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What happens when we have a lower sequence similarity?

A

Higher structure dissimilarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why is the function of an enzyme easy to estabilish?

A

Because they catalize the same reaction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does %ID measure? What about RMSD?

A

%ID: Sequence Similarity
RMSD: Structure Similarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What happens to the RMSD when proteins are similar?

A

Tends to be lower

18
Q

What are some measurements of RMSD?

A

Posições mais semelhantes de aa
Distância entre alfa-C nas cadeias peptídicas, média do quadrado da distância

19
Q

What is the seed alignment?

A

The alignment we use first. A hand-curated alignment of known members of the family

20
Q

What is the structural unit of the protein?

A

The domain.

21
Q

The number of possible domain folds known so far is theoretically finite but in reality it’s practically infinite. True or false?

A

False. Very large but seems to be finite

22
Q

O que são domain folds?

A

O número de formas diferentes de empacotar os elementos da estrutura secundária nos domínios.

23
Q

Sequence based predictions predict structural features based on the secondary structure, the aminoacid sequence. True or false?

A

False. It’s the primary structure, (aa sequence)

24
Q

Give an example of a false positive in a deterministic pattern

A

The sequence verifies the pattern although it has the rarest possibilities in all of the amino acids

25
Give an example of a false negative in a deterministic pattern
The sequence verifies everything in the pattern except one completely unexpected amino acid.
26
What are some downsides of DP?
Extremely high rigidity, which means it doesn't accept any deviations to the pattern.
27
What is a Confusion Matrix?
A Table with two rows and two columns that reports the number of true positives, false negatives, false positives, and true negatives
28
Num classificador binário, a sensibilidade é dada pela taxa de verdadeiros positivos e a especificidade é dada pela taxa de falsos negativos. Verdadeiro ou falso.
FALSO. A especificidade é dada pela taxa de verdadeiros negativos.
29
A consensus sequence is likely a true positive. True or false?
True.
30
Name one big advantage of PSSM.
It has a lot less false negatives because it can put any amino acid in any position
31
What's PSSM and what's its deal?
Position specific scoring matrix attributes a value to each amino acid, and depending on the score they receive, they conclude if the sequence belongs or not to a certain family or domain.
32
PSSM attributes a score of 0 to any gap/transition in the sequence. True or false?
False. It does not attribute scores to gaps/transitions.
33
PSSM doesn't attribute a score to any gaps in the sequence, but it does attribute weight. True or false?
True
34
In a weight matrix, the higher the score between two aminoacids, the higher the chances of them being interchangable. True or false?
True
35
What is homology modelling?
The use of information from homologous proteins from databases and other sequences to model a protein.
36
What are the three mains steps of homology modelling?
Alignment and superposition with the reference structure, improvement and preliminary structure, and optimization of energy.
37
Swiss model is unreliable only in the position of loops and the side groups of amino acids, and is only reliable for the position of the backbone (alfaCarbon)
True
38
What is the Rosetta Method?
Splitting a sequence into several shorter sequences, and trying to fill the gaps with "randomly generated" sequences, until something fits.
39
Alphaphold is so good it's similar to structures provided by...
X-Ray crystallography
40
Num classificador binário, a sensibilidade é dada pela taxa de verdadeiros positivos e a especificidade é dada pela taxa de verdadeiros negativos. Verdadeiro ou falso
Verdade
41
Num classificador binário, a especificidade é dada pela taxa de verdadeiros positivos e a sensibilidade é dada pela taxa de verdadeiros negativos. Verdadeiro ou falso
Falso. Sensibilidade é dada pela taxa de verdadeiros positivos Especificidade é dada pela taxa de verdadeiros negativos
42
O que significa pontuação (local) de QMEAN4
O QMEAN4 é uma função de scoring usada assessar a qualidade da previsão de modelos de estrutura proteica, que compões outras 4 métricas, como a interação entre os Cb, a interação entre todos os átomos, exposição ao solvente e a torsão entre 3 aa consecutivos. Uma pontuação local iria significar a qualidade de previsão de um aa ou local em específico, tendo em conta as métricas.