Deep Learning Flashcards
Deep learning
Neural network with 3 or more layers
Issues with vanishing/exploding gradients and they need a lot of data
Long Short Term Memory
Gated memory cells
Allow long range dependencies to be learned
Good at language modelling
Protein homology detection
LTSM uses
Protein secondary structure prediction (not better than PSI pred)
Protein homology detection (One hot encodings, no BLOSOM, faster than BLAST, multiple alignment, learns profile via error propogation)
Deep Belief nets
Trained layer at a time
Fix previous weights as each layer added
Train next layer
Tune with back propogation
Protein model quality/stability prediction
Convolutional Neural Nets
Inspired from the animal visual cortex
Filters
Weight sharing
Translational invariance
Computer Vision (MNIST, medical imagery (classification and segmentation), gene expression)
Autoencoders
Trained to reproduce the input
Builds encoding layer
Lower dimensional representation can include good deal of semantic meaning
Word2vec,prot2vec,DNA2vec
Estimating protein model quality/stability prediction
Variational autoencoders
Reconstruct data from estimate distribution
Trained via reconstruction loss, KL divergence
Protein encoding/Protein design
methyINet
Attention
In a sentence, some words more important
Bidirecional LSTMs to generate encoding
Weighted sum of hidden states
Accurate in sentinment analysis and translation tasks
Transformer
Use of Attention (Google Brain)
Self-attention
Encoder section (N=6)
ResNets
Builds on convolutional networks
Uses skip connections, can go deeper
Used for protein structure prediction
Sequence alignment profiles
Showed significant improvement
CASP
Critical Assessment of protein structure prediction
Competition for best prediction of protein structures
Evaluated on alpha C prositions
GDT_TS scores from 0-100
Molecular modelling
Used traditionally for proteins structure prediction
Very computer intensive
Coevolution
HMM SVMs with multiple sequence alignments (up to CASP 2010)
Sequence alignments give evolutionary information
Look for correlated changes in the protein sequence
Information about potential interactions allowed 3d structure to be inferred
Rosetta
Combination of physical calculations
Monte Carlo assembly
PSI BLAST Alignment
Up until CASP 13
AlphaFold
CASP13-2018
Free modelling category
AlphaFold2 builds on a lot of previous research (Use of resnets)