Lecture 11 Flashcards by Ana Sofía Mendoza

What method can be used for promoter analysis? What does it do?

Gibbs Sampler (motif identification)

Searches for statistically most probably motifs in unaligned sequences

How well did you know this?

Not at all

Perfectly

What methods are used for determining transcription factor binding site in silico?

Word counting methods
Gibbs sampling
Phylogenetic footprinting/ comparative genomics

How well did you know this?

Not at all

Perfectly

Describe the steps of Gibbs Sampling

Set width for motif
Choose random position for the start of the motif in all but one sequences
Estimate the amino acid (or nt) frequencies in the motif columns of all but left-out sequence
Estimate background frequencies (nt0): frequencies of nt (or aa) in positions that are not considered the motif
Scan out the left out sequence and estimate probability of finding the motif at any position
- Calculate odds score ratio for each position (a= pobserved / p background)
Add up all above odds score and divide the odds score for each position by the total to obtain probability that motif is at that position
These probabilities used as weights to decide a probable location of the motif in the left out sequence
Repeat > 100 times

How well did you know this?

Not at all

Perfectly

What is the goal of Gibbs sampling?

Find the most probable patterns common to all of the sequences by sliding them back and forth until the ratio of the motif probability to the background probability is a maximum

How well did you know this?

Not at all

Perfectly

How was Gibbs Sampler modified?

search multiple motifs
seek pattern in only fraction of input sequences (bc not all genes regulated by same TF or regulatory mechanism)
Look motifs of different widths
Avoid suboptimal solution by shifting current alignments a certain number of positions to right and left

How well did you know this?

Not at all

Perfectly

For what is the hypergeometric p-value used?

To ask if there are GO categories enriched in my cluster

How well did you know this?

Not at all

Perfectly

Name 4 methods to study protein-protein interactions

Classic biochemical (chromatographic) methods
yeast two hybrid followed by clone sequencing
Affinity purification (TAP tagging then mass spectrometry)
interologs or BioID

How well did you know this?

Not at all

Perfectly

How does Y2H work?

Attach Gal4 binding domain to bait protein
Separate Gal4 activation domain and attach to prey protein
if bait and prey interact in vivo in nucleus of yeast, Gal4 fxn is reconsittuted and drive expression of reporter gene:blue yeast colonies

How well did you know this?

Not at all

Perfectly

Problems of Y2H:

-Assay done in yeast, so might not get modifications necessary for protein funtion
-overexpressing protein and targetting to nucleus
-not well for membrane proteins

How well did you know this?

Not at all

Perfectly

Is Y2H useful for membrane proteins?

How well did you know this?

Not at all

Perfectly

Is Y2H useful for transient interactions?

Yes for transient binary interactions

How well did you know this?

Not at all

Perfectly

What is TAP?

Tandem affinity purification

How well did you know this?

Not at all

Perfectly

How does TAP tagging work?

immunoprecipitation-based purification technique for studying protein–protein interactions. The goal is to extract from a cell only the protein of interest, in complex with any other proteins it interacted with. TAP uses two types of agarose beads that bind to the protein of interest and that can be separated from the cell lysate by centrifugation, without disturbing, denaturing or contaminating the involved complexes. To enable the protein of interest to bind to the beads, it is tagged with a designed piece, the TAP tag.

The original TAP method involves the fusion of the TAP tag to the C-terminus of the protein under study. The TAP tag consists of three components: a calmodulin binding peptide (CBP), TEV protease cleavage site, and two Protein A domains, which bind tightly to IgG (making a TAP tag a type of epitope tag).

How well did you know this?

Not at all

Perfectly

What are other types of arrays used for protein study?

Protein Microarrays
Glycan Arrays (arrayed 100s carbohydrates onto slides as a tool to understand carbohydrate biology)

How well did you know this?

Not at all

Perfectly

Why study glycosylation?

-regulatory mechanisms
-carbs key structural support in plant biology

How well did you know this?

Not at all

Perfectly

How did Moller et al. studied glycan arrays?

Study These Flashcards

Developed monoclonal antibodies to crude extracts of cell wall polymers from aribidopsis (rat or mouse)
-use mAb to probe fixed samples to understand distribution of glycans in different cells

how did Moller et al. determined specificity of individual mAbs?

Study These Flashcards

64 different plant glycans were arrayed onto nitrocellulose, each mAb hybrdized to array and detected using anti-mouse or anti-rat secondary antibodies linked to alkaline phosphatase
-arrays scanned, specificities determined by CLUSTER ANALYSIS

What are 3 gene expression databases?

Study These Flashcards

ArrayExpress
GEO (Gene Expression Omnibus)
SRA (Sequence Read Archive)

For what organisms we can find specific gene expression databases?

Study These Flashcards

Human (RefExA)
Mouse
Worm
Fly
Arabidopsis

For gene expression data sets what information should be available?

Study These Flashcards

source of tissue, age, microarray element identifiers/ identifier annotation, library and fragmentation protocols, etc.

Genomic analysis of AtPERKs research question?

Study These Flashcards

although PERK1 induced rapidly upon pathogen attack in B. napus, no visible phenotype for individual AtPERK mutants.

What was a novel thing of genomic analysis of AtPERKs

Study These Flashcards

did not do any experiments, just used gene expression databases–>equivalent to 6k northern blots

What consideration should be made for designing higher-order mutants?

Study These Flashcards

similar gene expression profiles and sequence similarity (high for both)

What is a consequence of coexpression analysis?

Study These Flashcards

Coexpressed genes that have a vague functional description might be involved in similar biological process of genes whose funciton is known, so guilt-by association to assign funciton
*RGL2 whose role in floral biology was unknown

What was the research question behind seed coexpression network analysis?

Since no master regulator of dormany or germination had been identified, can we use coexpression analysis to identify crucial hubs in networks?

What was the process of seed co-expression network analysis?

1. Array database (seed microarray database, 175 published samples) 2. Coexpression calculations: calculate genes coexpression scores in samples that are dormant or can germinate 3. Database queries: filter databse of interactions, >4.5 M interactions 4. Visualize and analyze network

What is SeedNet?

Coexpression network based on seed samples

What does a node represent in SeedNet?

Node is a gene, and lines between genes denote coexpression between genes - red colouring : upregulation in dormant samples -blue colouring: increased expression in gemrinating samples Node size is proportional to number of connections gene has/ number of coexpression partners the gene has

What could the authors identify with SeedNet?

Novel transcriptional regulators of dormany or germination using guilt-by association

Lecture 11 Flashcards

(29 cards)