papers Flashcards

1
Q

the format for metadata storage in arrayexpress?

A

MAGE-TAB format it has 2 spread sheets: Investigation Description Format (IDF) file and the Sample Data Relationship Format (SDRF) file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is investigation description format in arrayexpress?

A

The IDF contains an overview of the whole experiment, including the title, the submitter’s contact details, publication information, protocols and the experimental variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The SDRF format in arrayexpress?

A

The SDRF describes all the sample characteristics (e.g. cell type) or any treatment that the sample has been subjected to (e.g. growth in low oxygen conditions), and links each sample to its corresponding data file. The structure of the SDRF, i.e. the order of the columns, reflects the experimental workflow from source material, through intermediate steps (e.g. labelling of nucleic acids, preparation of sequencing libraries, running of sequencing assays) to raw and processed data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are CEL files?

A

the raw data for microarray experments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how to specify the exp factor like cold and the organism in the arrayexpress search?

A

efv: cold AND organism: “Oryza sative”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what are the dyes for control and treatment in the microarray?

A

CY3 : control

cy5: treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

DREB/CBF is a part of which type of TFs?

A

ERF

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what does the green text in the linux shell imply?

A

the computer is ready to accept our commands

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is working directory in linux?

A

One important concept to understand is that the shell has a notion of a default location in which any file operations will take place. This is its working directory. If you try to create new files or directories, view existing files, or even delete them, the shell will assume you’re looking for them in the current working directory unless you take steps to specify otherwise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are the slashes in the folder addresses?

A

directory seperators

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

the function for creating a list in python?

A

list()

if no arguments are passed it would return an empty list

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

the code for installing packages in R?

A

install.packages(‘limma’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what’s the code for converting the data into a dataframe?

A

x=data.frame()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what’s the main difference b/w lists and tuples?

A

lists are mutable tuples are not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is a csv file

A

comma separated file, is a text file which contains a list of data which is separated by comma

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what does tuple unpacking mean?

A

splitting tuple elements into individual variables

a,b=(1,2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

the code for importing matplotlib.

A

import matplotlib.pyplot as plt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

the code for getting a metabolite by id in cobrapy?

A

model.metabolites.get_by_id(‘metabolte id’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what are the types of boundary reactions and why they’re called pseudo reactions?

A

All of them are unbalanced pseudo
reactions, that means they fulfill a function for modeling by adding to or removing metabolites from the model
system but are not based on real biology.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

an exchange, demand and sink reaction definition?

A

An exchange reaction is a reversible reaction that adds to or removes
an extracellular metabolite from the extracellular compartment. A demand reaction is an irreversible reaction that
consumes an intracellular metabolite. A sink is similar to an exchange but specifically for intracellular metabolites,
i.e., a reversible reaction that adds or removes an intracellular metabolite.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

codes for printing out the exchange sink and demand rxns of a model.

A

print(“exchanges”, model.exchanges)
print(“demands”, model.demands)
print(“sinks”, model.sinks)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

the code for reading excel files in python?

how to read only a single column?

A

import pandas as pd

df = pd.read_excel (‘C:\Users\Ron\Desktop\name of the file.xlsx’, usecols=[‘gene ID’])

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

how to convert a dataframe to a list? how to do it with specific values?

A

genes_list =genes.values.tolist()[0:9]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

how to generally convert something to the list?

A

model_gene_list=list(rice_model.genes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

how to convert the list items to strings?

A

for i in list:

i=’‘.join(i or i.id)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

the CAM metabolism overall definition?

A

In plants
performing CAM photosynthesis, the stomata open at night and
CO 2 is fixed and stored in the vacuole in the form of a carboxylic
acid such as malate, citrate, or isocitrate (Maclennan et al., 1963;
Lüttge, 1990; Gawronska and Niewiadomska, 2015; Igamberdiev
and Eprintsev, 2016). During the hot, dry daytime hours, the
stomata can remain closed to minimize water loss, and the stored CO 2 is remobilized for fixation by Rubisco in the chloroplast,
accompanied by the accumulation of storage carbohydrates.
Although this cycle is energetically expensive, it conserves pre-
cious water and is an efficient alternative to direct daytime CO 2
fixation by Rubisco as in C 3 photosynthesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

how to know where the jupyter notebooks are being saved?

A

on a notebook we type pwd and it shows the working directory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

what does io in python mean?

A

import output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

what is an iterable?

A

Iterators are objects that allow you to traverse through all the elements of a collection

another def: iterator doesn’t give all the values but one value at a time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

how an iterator can be created

A

You can create an iterator object by applying the iter() built-in function to an iterable.
You can use an iterator to manually loop over the iterable it came from. A repeated passing of iterator to the built-in function next()returns successive items in the stream. Once, when you consumed an item from an iterator, it’s gone. When no more data are available a StopIteration exception is raised.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

what’s the iterator protocol?

A

The iterator objects are required to support the following two methods, which together form the iterator protocol:
iterator.__iter__()
Return the iterator object itself. This is required to allow both containers (also called collections) and iterators to be used with the for and in statements.
iterator.__next__()
Return the next item from the container. If there are no more items, raise the StopIteration exception.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

what is an iteration?

A

the act of going over a collection is called iteration. collections are like lists, tuples etc..
iterator is an object which can be used to iterate over a collection. the iter method gives the iterator and the next method is going to give us the next value using this iterator.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

how to get a list of methods pertained to an object?

A

dir(the name of the variable the object was assigned to)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

how to understand if sth is iterable?

A

it should have a method called __iter__

we can find it out by the dir() function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

how to make a pandas dataframe?

A

df=pd.DataFrame(data, columns=[‘name’,’age’])

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

how to concatenate dataframes using pandas?

A

first we need to make a list of all the dataframes.
frames=[df1,df2,df3]
result=pd.concat(frames)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

what is pandas.concat() function?

A

Concatenate pandas objects along a particular axis with optional set logic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

how to have continuous indices while merging 2 dataframes?

A

we should add an additional argument:

df=pd.contact(data,ignore_index=True)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

how to get a textual index when merging pandas dataframes and in what way it can be useful?

A

df=pd.concat(data, keys=[‘India’,’US’])
we can’t use this code while having the ignore_index argument.
we can call the subset of the dataframe pertaining to each key by using: df.loc[‘India’]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

how to stick different pandas dataframes horizontally (adding to columns instead of rows)?

A

df=pd.concat(data,axis=1)

the default value is 0 which adds to the rows

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

how to add a new column as series to our pandas dataframe?

A

s=pd.Series([‘humid’,’dry’], name=’event’)

df=pd.concat([data,s],axis=1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

the different types of merging?

A

inner join: only takes shared values between two dataframes.
outer join: considers the whole values.
left join: takes the values of the 1st dataframe and the shared values,
right: takes the values of the 2nd dataframe and the shared values,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

the code for reading an SBML model in cobrapy?

A

my_model=read_sbml_model(‘path to the file’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

what are different rice gene IDs?

A

Rice(Oryza sativa) has more than one form gene ID for the genome. The two main gene ID for rice genome are the RAP (The Rice Annotation Project, , and the MSU(The Rice Genome Annotation Project, . All RAP rice gene IDs are of the form Os##g####### as explained on the website . All MSU rice gene IDs are of the form LOC_Os##g##### as explained on the website . All SYMBOL rice gene IDs are the unique name on the NCBI(National Center for Biotechnology Information,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

how to call a package in r?

A

library(biomaRT)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

the code for getting the r version?

A

R.version

or in the linux terminal R –version

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

how to update r?

A

the new version of r should be installed via r-project.org and then we should choose CRAN
so the next time you open r studio, it will be working with the updated version.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

the 4 arguments in the biomart search?

A

Attributes: the column headers that we want in our outputs.
Filters: filters are our input data
Values: are identifiers that are used along filters to limit our results
Mart: the argument for database selection. it’s the first thing we choose (we specify the database)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

how to search a gene in a certain organism in NCBI?

A

name of the gene AND human [orgn]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

how to get the annotation for a platform from GEO?

A

click on the platform in the table below u see the annotations and for downloading click on download full table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

how to get the code for installing certain packages in r

A

go to the bioconductor and search the name of the package and u will find the code for installing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

how to read raw cel files in r?

how to costumize the reading?

A

first set the working directory to the folder that r files are in then run ReadAffy() function.
in the function if we write widget=T, a new window will be opened and we can select

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

the image function shows what and what should we deciphere from it?

A

it’s the image from microarray chip and the white dots are the dots with expressed genes the black dots are not expressed genes and we should check the integrity of these dots across the chip

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

how to see each sample covers which range of genes? I mean which range of numbers?

A

we should draw a boxplot

boxplot()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

the code for drawing a histogram in r

A

hist(data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

how a histogram shows the quality of the samples?

A

مثلا اگر قله همه نمودارها روی یا حول و حوش یک عدد بود یعنی کیفیت دیتای گرفته شده از سمپلها خوبه
بعد از نرمال کردن هم باید ببینیم اگر برطرف شد اوکیه اگر نشد باید حذفش کنیم

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

how to check the quality of the RNAs used for microarray in R? how should it be looked like?

A

AffyRNAdeg(data)
دنسیتی سمت 3 پریم باید از 5 پریم کمتر باشه
چون تخریب از 3 پریم به 5 پریم صورت میگیره
بهترین حالت این است که یک نمودار نزولی داشته باشیم نه حالت زیگزاگ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

how to normalize the data?

A

normalized_data=rma(data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

2 codes for showing the normal data?

A
یکی بعد از باز کردن دیتای نرمال شده با 
rma
و بخش assay data  
رو که میزنیم توی کنسول یه کد مینویسه که میتونیم سیوش کنیم 
راه دیگه استفاده از فانکشن exprs  هست
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

what is justRMA () for?

A

برای وقتایی که رم دستگاه کمه و کامپیوتر نمیتونه آنالیز و انجام بده با روش آر ام ای معمولی
اینجا دیگه داده نرمال رو فقط نشون میده حالت لارج اکسپرشن ست نیست

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

how to see which methods exist for background correction and normalization of transcriptome data in r?

A

bgcorrect. methods(NO ARGUMENTS)

normalize. methods(data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

how to manually choose for normalization and bg correction methods in one line of code in r?

A

data=expresso(data=trans_data, widget=T)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

how to know the range of colors being recognized by R?

A

color()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

how to write a table in r? how to specify to seperate the values by tab?

A

write.table(x,file=’data.txt’, quote=F, sep=”\t”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

how to get rid of browse[1] in R?

A

type c in the console, but the function will continue running and if you press q you will exist the browser and the function both

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

how to paste a number in whole rows of a column in excel?

A

first write the num in another cell, then copy the cell then select the cells you want to have that number and then press the arrow in the paste> paste special> operation> add

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

the function for performing some conditions in r?

A

gene_up= subset(data, name of the col we wanna perform the condition >2 )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

the shortcut key for renaming the files?

A

click on the file and press f2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

how to check if a newer version for rstudio exist?

A

in rstudio open help> check for updates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

how to change the row names of a table?

A

row.names(data)=x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

how to bring help of a function in r?

A

we should click on the function and press f1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

how to delete NA data from our vriable in R?

A

data=na.omit(data)

it removes the row containing NA data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

why do we specify upper and lower bound in FBA?

A

These bounds enforce thermodynamic reversibility and mechanistic (max uptake and secretion rate) constraints for the rxn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
74
Q

how GEM models can further elucidate

how changes in one component affect other pathways and cell phenotypes?

A

since these models connect genes to measur-
able cell phenotypes (e.g., growth, cell energetics, pathway
fluxes, biosynthesis of cell components, byproduct secretion,
etc.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
75
Q

what are Model extraction methods (MEMs)?

A

Model extraction methods (MEMs) employ diverse algorithms

to extract cell-line- or tissue-specific models from a GEM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
76
Q

what column vector v specifies?

A

which contains

the unknown fluxes through each of the reactions of the S matrix.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
77
Q

which system is underdetermind and it means what?

A

system of linear equations is established by
multiplying the S matrix by a column vector, v. the product of this matrix multiplication must
equal zero, S · v = 0 (Gianchandani et al., 2009). Because the
resulting system is underdetermined (i.e., too few equations, too
many unknowns), linear programming (LP) is used to optimize
for a particular flux, Z, the objective function, subject to under-
lying constraints

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
78
Q

how the objective function is depicted?

A

he objective function typically takes on the
form of:
Z = c · v
where c is a row vector of weights for each of the fluxes in col-
umn vector v, indicating how much each reaction in v contributes
to the objective function, Z

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
79
Q

what is the task of FBA?

A

Thus, the task of FBA is to find a solution to v that
lies within the bounded solution space and that optimizes the
objective function at the same time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
80
Q

gimme algorithm guarantees what?

A

guarantees to both produce a functioning
metabolic model based on gene expression levels and quan-
tify the agreement between the model and the data is called
the Gene Inactivity Moderated by Metabolism and Expression
(GIMME) algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
81
Q

The application of metabolic models?

A

metabolic phenotype prediction, metabolic engineering, studies of network evolution, biomedical application

82
Q

why GEMs are important in studying the metabolism of higher organisms?

A

It is extremely difficult, or even impossible for higher level or-
ganisms, to study the entirety of their pathways either in vitro or in vivo.
However, mathematical tools such as genome-scale models can be used to
gain insight into how these biological systems function

83
Q

4 main step in GEM reconstruction?

A

First, a draft reconstruction of the
biological network of an organism is extracted using information about re-
actions, enzymes, and pathways from databases such as KEGG, BRENDA,
etc. The second step is the manual curation of the reconstructed draft model.
This involves checking and filling the gaps and correcting misplaced reactions.
Here, organism-specific databases and literature are used. Computational al-
gorithms such as GAUGE [20], FastGapFill [21], and FBA-Gap [22] can also
be applied. The third step is the conversion of the reconstructed model into
a mathematical representation that can be used for subsequent simulations.
The final step is the refining, validation, and application of the model to inform decisions.

84
Q

what are solvers in FBA used for?

A

to solve a set of linear equations.

85
Q

what are the 13C-MFA drawbacks?

A

however, is challenging not only because of the
extensive instrumentation required but also because of the limited
number of fluxes and conditions that can be experimentally measured.
Typically, 13C-MFA focuses on central carbon metabolism

86
Q

why the transcriptome data is a better high-throuput data?

A

Unlike the first two omics data that cover a small
share of all reactions in a genome-scale model, transcriptomics and proteomics
are the platforms where a quantitative snapshot of molecular
species at system-level is currently possible [23]. However, proteomics
is a relatively immature technology compared to transcriptomics. The
accuracy with which protein concentrations can be determined is
much lower than that with which mRNA concentrations can be determined.
On the other hand, RNA amount changes can be precisely measured
in a highly automated process at low cost in comparison with the
amount of data gathered

87
Q

what is the logic behind E-flux?

A

The rationale behind E-flux is that, given a limited translational
efficiency and a limited accumulation of enzyme over the time, the level
of mRNA can be used as an approximate upper bound on the maximum
amount of metabolic enzymes, and hence as a bound on reaction rates.

88
Q

why in prokaryotes there is more correlation between protein level and mRNA abundance?

A

because the ribosomes attach to nascent mRNA so that translation can be synchronous with
transcription; proteins levels thus depend more directly on mRNA
abundance

89
Q

what is a probeset?

how the probes are numbered in a probeset?

A

a gene is represented by a probeset in the microarray. each probeset consists of 11 probes.
For each probeseton an array, the individual probes are numbered sequentially from the 5’ end of the transcript to the 3’ end.

90
Q

how to exit from the python shell in command prompt?

A

exit()

91
Q

how to make a copy of our model in cobrapy?

A

new= model. copy( )

92
Q

the code for printing all the reactions in the model in cobrapy?

A

model.reactions

without parenthesis

93
Q

the shortcut for deleting a cell in jupyter notebook?

A

select a cell till it becomes green and press D twice

94
Q

what is a frozenset() function?

A

it gets an iterable as an argument and makes it unchangeable.
you can’t assign a new value to it

95
Q

how come maximization of biomass can be the objective of microorganisms

A

The premise is
that an organism that acquires and/or redistributes
resources to outgrow its competitors will be in the best
position to survive

96
Q

why maximization of the cell’s biomass is not a suitable choice for modeling multicellular organisms?

A

ch an assumption, however, is likely to be
invalid for individual cell types in multicellular organ-
isms, where cellular objectives may differ greatly both between and within tissues. The assumption of maximal
rates of biomass production involves an objective at the
cellular level, whereas in multicellular organisms a given
cell’s objective is likely to be realised via survival at the
organism level, which may not necessarily be dependent
upon the growth of the cell. Moreover, signals from the
extracellular environment may trigger different cellular
priorities and objectives

97
Q

how to get the gpr of all the reactions in the model?

A

gpr=[ ]
for i in model.reactions:
gpr_reaction= model.reactions.get_by_id(i.id).gene_reaction_rule
gpr.append(gpr_reaction)

98
Q

the code for storing the data as an excel sheet in R?

A

write.xlsx2(data, “data.xlsx”)

99
Q

how to read the excel file? what’s the parameter sheet index in R?

A

read.xlsx
read.xlsx2
sheet index is a numerical number specifiying which sheet to read
eg: 1 = sheet number 1 (first sheet) is going to be read

100
Q

what is the class of the output of reading an excel file via read.xlsx() in r?

A

data frame

101
Q

what is the keyword for restarting Rstudio?

A

shift+ctrl+f10

102
Q

what is %in% operator?

A

%in% returns logical vector (TRUE or FALSE but never NA) if there is a match or not for its left operand. Output logical vector has the same length as left operand.

103
Q

how to find the missing values in R?

A

is.na(data) returns true if there is and false if there wouldn’t be a missing value

104
Q

how to read an excel file in python? how to select a specific column?

A

file= pd.read_excel (file path, usecols=[‘name of the column’])

105
Q

how to get the type of a variable in python?

A

type (x)

106
Q

any and all functions?

A

The any function in R will tell if you if there are ANY of the given search terms in your vector. It returns either TRUE or FALSE. To demonstrate this function, let’s create a quick vector that goes from -3 to 5, incrementing by 1.

y

107
Q

how to install a package with pip in jupyter notebook?

A

we add a ! befpre pip,

!pip install parse

108
Q

2 ways of getting the values of special column of excel or dataframe in python?

A

df [‘name of the column]

df.name of the column

109
Q

how to replace something in only a specific column in excel?

A

select and copy that column and do the manipulation on a new spreadsheet. and then paste it on the original file.

110
Q

how to print out both bounds of model reactions?

A

for i in model.reactions:

print(model.reactions.get_by_id(i.id).bounds)

111
Q

what is the rationale behind the E-Flux method?

A

the biological rationale behind our method is that
expression data provide measurements on the level of mRNA for
each gene. If there were limited accumulation of enzyme over the
time course considered, and given a particular level of translational
efficiency, the level of mRNA can be used as an approximate
upper bound on the maximum available protein and hence as an
upper bound on reaction rates to some level of approximation.

112
Q

where FBA method’s prediction is poor?

A

Given their simplicity and minimal data requirement, FBA models often weakly predict metabolic fluxes at elabo-
rate branched-chain reactions and in a cyclic pathway, for example, pentose-phosphate pathway and non-cyclic TCA,
where fluxes change dynamically with prevailing e ­ nvironments 17–

113
Q

what is the shell?

A

the shell is a program that takes commands from the keyboard and gives them to the operating system to perform.

114
Q

the raw data of single microarray consists of what?

A

The raw data from a single microarray consist of a pair of
images representing the fluorescent intensities detected by a photomulti-
plier tube when the microarray is scanned with each of two lasers.

115
Q

what does model fitting mean?

A

Fitting a model to data means choosing the statistical model that predicts values as close as possible to the ones observed in your population.

116
Q

how microarray works?

A

n the Ž rst step of the technique, DNA clones with known
sequence content are spotted and immobilized onto a glass slide or other substrate, the microarray. Next,
pools of mRNA from the cell populations under study are puriŽ ed, reverse-transcribed into cDNA, and
labeled with one of two fluorescent dyes, which we will refer to as “red” and “green.” Two pools of differ-
entially labeled cDNA are combined and applied to a microarray. Labeled cDNA in the pool hybridizes to complementary sequences on the array and any unhybridized cDNA is washed off. Hybridization efŽ ciency
may vary from clone to clone, confounding comparisons between genes. However, if we assume that the
efŽ ciency of an individual clone is not altered by the type of the dye label, then the relative abundance of
a particular mRNA in the two samples can be measured.

117
Q

why we can’t figure out a microarray is single or two channel by searching the platform?

A

The one-colour vs two-colour thing is an experiment-specific choice. You could use an Agilent array for either one- or two-colour experiments, for example. So you actually have to look at the GSE, rather than the GPL information to determine whether a given expt was 2-colour.

118
Q

what is two-channel microarray?

A

The earliest microarrays used two channels, with two RNA samples separately labeled and competitively hybridized to the same array

119
Q

what is genechip?

A

it is the trademark of Affymetrix

it is a comercial microarray platform

120
Q

what is the general concept of microrray?

A

DNA microarrays are widely used to measure genome-wide changes in mRNA expression levels across conditions such as developmental stages, disease states, drug treatment and gene disruption

121
Q

how 2-dye microarray meaure gene expression?

A

Spotted microarrays are commonly hybridized with two samples labelled with two different fluorophores. For these arrays, the ratio of the signal intensities in the two channels is a relative measure of gene expression.

122
Q

what is probe in microarray?

A

In standard
terminology, the cDNAs spotted onto the arrays are called
probes, and those in the samples are called target genes.

123
Q

what are one-channel and two-channel microarry?

A

cDNA microarrays generate one- or two-channel data. In
two-channel use the arrays are hybridized to a mixture
of two samples, each labelled with a different dye (Cy3
and Cy5). In one-channel use, which is the focus of
this paper, each array is hybridized to a single sample, labelled with a single dye. The arrays are laser-scanned at
the wavelength(s) appropriate to the dye(s) used, and the
images are processed to extract data for analysis. In onechannel studies, these usually consist of a measure for the
spot intensity and its local background, for each spot on
the array. In two-channel studies this is available for both
dyes

124
Q

what background-corrected spot intensities reflect?

A

The background-corrected spot intensities reflect the
abundance of the corresponding target genes in the
samples.

125
Q

what are different types of bias in microarray data?

what is correction for bias called?

A

The background-corrected spot intensities reflect the
abundance of the corresponding target genes in the
samples. However, often the relation is not that of simple
proportionality: the signals may be distorted in various
ways. One of these is spatial bias: the presence of
regions with overall higher or lower intensity levels on
the slides (Fig. 1). Another form of distortion may appear
when data from replicate arrays are compared graphically
(Fig. 2) and various forms of systematic departure from
the identity line are observed. This phenomenon is here
termed relative intensity bias.
The process of correcting for bias prior to analysis
is called normalization. The purpose is to promote
uniformity within arrays and reproducibility between
arrays. Normalization has profound effects on subsequent
analysis, irrespective of the methodology used. Failure to
normalize appropriately will generally lead to misleading
conclusions.

126
Q

why it’s hard to find out the absolute gene expression in the 2-dye microarrays?

A

For two-color microarrays, however, it is more diffi-
cult to determine absolute gene expression because of effects such as spot size variation, and
relative expression between two conditions is typically reported

127
Q

the reason tha GEMs should be integrted with omics data?

A

the dvent of highthrouput techniques enabling scientists to simultaneously measure large numbers of molecular components (e.g.,
proteins, metabolites, and nucleic acids). there is a growing
concern that much of these data sit in databases without being used or fully analyzed.
Statistical
inference methods have been widely applied to gain insight into which genes may influence the
activities of others in a given omics data set, however, they do not provide information on the
underlying mechanisms or whether the interactions are direct or distal. genome-scale models derived from
knowledgebases may be used to extract additional biological understanding from omics data
sets and inspire novel applications of this technology to interpretation of complex data sets.

128
Q

two ways of interpreting omics data with GEMs?

A

1- comparing the omics data with GEM predictions.

2- using omics data as a surrogate for modeling regulation

129
Q

how omics data can be used as a surrogate for modeling regulation?

A

Metabolic network reconstructions aim to be comprehensive repositories of biochemical
data for an organism. Thus, models derived from these knowledgebases will include all
possible reactions catalyzed by an organism’s gene products regardless of whether they are
active in a given environment. The all-inclusive nature of these knowledgebases is partially
responsible for false negatives observed in gene essentiality or genetic interaction
simulations35. Biological networks have evolved a degree of robustness against
perturbations that result cascading failures36 – this robustness is due, in part, to the presence of alternative compensatory pathways. However, an alternative pathway that is present in
the global knowledgebase may not be accessible to the organism in the given growth
medium, thus mutation of the principle pathway will result in a phenotype in vivo but not in
silico. The regulatory apparatuses of successful organisms have evolved to express the network
components that are suited to their current environment. If we knew the complete regulatory
structure of an organism and how it worked then we could plausibly compute which cellular
components may be expressed in a given condition; unfortunately, this isn’t known even for
the arguably best-studied bacterium38. Due to stochastic effects arising from low copy
numbers of regulators and enzymes39,40, and intracellular heterogeneity, integrated models
of metabolism and regulation will still be an approximation of individual cells and
populations. In the absence of experimentally elucidated regulatory rules, we can still use omics surveys
in conjunction with functional models to serve as surrogates for a regulatory model, and
create condition- and tissue-specific models

130
Q

how to iterate through a range of numbers in python?

A

for i in range(1,26)===> it will iterate 25 times

for i in range(26)===> it will iterate 26 times, starts from 0, ends in 25

131
Q

what is split function in python?

and how to split from the white space?

A

split() method in Python split a string into a list of strings after breaking the given string by the specified separator.

if we don’t pass any argument to split, it will split from the white space

132
Q

what is the find method in python?

A
The find() method returns the index of first occurrence of the substring (if found). If not found, it returns -1.
if it's given by start and end index, it only searchs in that range
133
Q

the shortcut for getting the file path in linux?

A

ctrl+L

134
Q

how to find the index of an element in a list

A

The index() method returns the index of the specified element in the list.

135
Q

when we read an excel file with x=pd.read_excel code, what’s the type of the x?

A

pandas dataframe

136
Q

how to make a header and normal text in jupyter notebook?

A

in the box on the top which code is chosen, choose heading and then type what you want and the run the cell
for normal text, you should choose the markdown format

137
Q

how to devide numbers without getting the decimals as the answer in python?
how to get the remains?

A

use // instead of /

use this % to get the remain

138
Q

how to edit a text written in markdown format in jupyter notebook?

A

double click on that text

139
Q

how to make different sizes of headers in jupyter notebook?

A
# header 1
## header2
###  3
#### 4
there should be an enter between hashtags an the text
140
Q

how to make a list of bullet points

how to make a list of numbers

A

bullet: * this will make a bullet
number: 1. this will make a number

141
Q

how to make the text italic or bold in jupyter notebook?

A

add 1 astrids in between text without any space.

add 2 astrids for making it bold

142
Q

how to define several variables in a single line?

A

var1, var2, var3 = ‘red’, ‘blue’, ‘green’

143
Q

shortcut for inserting a new cell below in jupyter notebook?

A

esc+B

144
Q

when booleans automatically are converted to integers?

A

when they’re being used with operators with integers. false is converted to 0 and true to 1
eg: 3+True
4

145
Q

the function for converting other data types to boolean

what are the things that will be false?

A

bool()
Only the following values evaluate to False (they are often called falsy values):

The value False itself
The integer 0
The float 0.0
The empty value None
The empty text ""
The empty list []
The empty tuple ()
The empty dictionary {}
The empty set set()
The empty range range(0)
146
Q

what is None type and for what it’s used for?

A

The None type includes a single value None, used to indicate the absence of a value. None has the type NoneType. It is often used to declare a variable whose value may be assigned later.
or when we wanna indicate that the value is missing.

147
Q

how to have our strings in multi line?

A

have in in between ‘’’

148
Q

what is a method in python?

A

Methods are functions associated with data types and are accessed using the . notation e.g. variable_name.method() or “a string”.method(). Methods are a powerful technique for associating common operations with values of specific data types.

149
Q

what does replace method do in python?

A

The .replace method replaces a part of the string with another string. It takes the portion to be replaced and the replacement text as inputs or arguments.
another_day = today.replace(“Satur”, “Wednes”)
it returns the modified string it does not actually change the string in the variable

150
Q

strip and split methods in python?

A

The .split method splits a string into a list of strings at every occurrence of provided character(s).

The .strip method removes whitespace characters from the beginning and end of a string, but not from the middle.
a_long_line = “ This is a long line with some space before, after, and some space in the middle.. “

151
Q

whatv does format do in python?

A

The .format method combines values of other data types, e.g., integers, floats, booleans, lists, etc. with strings. You can use format to construct output messages for display.

cost_of_ice_bag = 1.25
profit_margin = .2
number_of_bags = 500
# Template for output message
output_template = """If a grocery store sells ice bags at $ {} per bag, with a profit margin of {} %, 
then the total profit it makes by selling {} ice bags is $ {}."""

print(output_template)

total_profit = cost_of_ice_bag * profit_margin * number_of_bags
output_message = output_template.format(cost_of_ice_bag, profit_margin*100, number_of_bags, total_profit)

print(output_message)

152
Q

how to add a new value to a specific index of a list in python?

A

A new value can also be inserted at a specific index using the insert method.

fruits.insert(1, ‘banana’)

153
Q

how to remove an item from a list?

A

You can remove a value from a list using the remove method.

fruits.remove(‘blueberry’)
or
fruit.remove(fruit[0]) to remove the first element

154
Q

how to remove and return an element from an index in python?

A

To remove an element from a specific index, use the pop method. The method also returns the removed element.
fruits.pop(1)

155
Q

how to get the dimension and the name of columns of a dataframe in python?

A

We used the shape and columns dataframe attributes to get the shape of our dataframe (number of rows, number of columns) and the column names, respectively.

pd. shape
pd. columns

156
Q

how to create an empty array using numpy?
how to make it 2-D?
how to make a numpy arra

A

Return a new array of given shape and type, without initializing entries.

Parameters
shapeint or tuple of int
Shape of the empty array, e.g., (2, 3) or 2.

dtypedata-type, optional
Desired output data-type for the array, e.g, numpy.int8. Default is numpy.float64.

order{‘C’, ‘F’}, optional, default: ‘C’
Whether to store multi-dimensional data in row-major (C-style) or column-major (Fortran-style) order in memory.

likearray_like
Reference object to allow the creation of arrays which are not NumPy arrays. If an array-like passed in as like supports the __array_function__ protocol, the result will be defined by it. In this case, it ensures the creation of an array object compatible with that passed in via this argument.
ex: m=np.empty(2) ==> returns an 1-D array with 2 items

ex: m=np.empty([2,3]) ==> returns a 2-D array with 2 rows and 3 cols

157
Q

how to remove a character from a string?

A

name=’fete’

m= name.replace(‘e’,’’)

158
Q

how to add several lists to a dataframe?

A

first make tuple of the lists
# Python 3 to get list of tuples from two lists
data_tuples = list(zip(Month,Days))
then make a dataframe of it
pd.DataFrame(data_tuples ,columns=[‘month’,’day’]

or do everything in the same line:
pd.DataFrame(list(zip(lst1, lst2, lst3)),
columns=[‘lst1_title’,’lst2_title’, ‘lst3_title’])

159
Q

how to append two items to a list in python?

A

lis= list()

lis.append ([a,b])

160
Q

when we loop over a dictionary, what will we get in python?

how to access the values in a dictionary?

A

it only returns the keys not the values

dict[key] returns the value

161
Q

how to iterate over the values in a dictionary in python?

how to iterate over boths keys and values?

A

for value in dict.values():
print(value)

for key,value in dict.items():
print(key,value)

162
Q

how to define the steps in the range function in python?

A

range(1,11,2)

163
Q

how to make an empty loop in python? what does the keyword do?

A

for i in x:
pass
pass means nothing happens when this condition is being reached

164
Q

one of the ways to make a dataframe which already has the name of the columns and we don’t have to specify the columns in python?

A

to make a dataframe with a dictionary (the data which is passed is a dictionary)
eg;

df = pd.DataFrame({
    'colA':[True, False, False], 
    'colB': [1, 2, 3],
})
print(df)
    colA  colB
0   True     1
1  False     2
2  False     3
165
Q

what is the difference between insert and assign?

A

pandas.DataFrame.assign() method can be used when you need to insert multiple new columns in a DataFrame, when you need to ignore the index of the column to be added or when you need to overwrite the values of an existing columns.

Alternatively, you can also use pandas.DataFrame.insert(). This method is usually useful when you need to insert a new column in a specific position or index.

df.insert(1, 'colC', s.values)
print(df)
    colA colC  colB
0   True    a     1
1  False    b     2
2  False    c     3
166
Q

what is scope in python?

A

Scope refers to the region within the code where a particular variable is visible. Every function (or class definition) defines a scope within Python. Variables defined in this scope are called local variables. Variables that are available everywhere are called global variables. Scope rules allow you to use the same variable names in different functions without sharing values from one to the other.

167
Q

how to make an optional argument while defining function in python?

A
We'll make this an optional argument with a default value of 0.
def loan_emi(amount, duration, down_payment=0):
    loan_amount = amount - down_payment
    emi = loan_amount / duration
    return emi

All the optinal arguments should come after the required argument

168
Q

what are named argument?

A

Invoking a function with many arguments can often get confusing and is prone to human errors. Python provides the option of invoking functions with named arguments for better clarity. You can also split function invocation into multiple lines.

emi1 = loan_emi(
    amount=1260000, 
    duration=8*12, 
    rate=0.1/12, 
    down_payment=3e5
)
169
Q

how to call help for a function that we don’t know wht it does in python?

A

help(name of the function)

170
Q

how to print a certain row in dataframe?

A

if you know the index

print(df.loc[[159220]])

171
Q

in CAM metabolism, there is a trade off b/w what?

A

However, there is a tradeoff between the
energetic investment in the CO 2 -concentrating mechanism
in the form of the CAM cycle and the benefit of sup-
pressing photorespiration.

172
Q

how to approximate enzyme machinery costs by metabolic networks?

A

the total network flux
can be used as a proxy for the overall enzyme machinery
cost by averaging out the variation between reactions

173
Q

why the cheung 2014 model was allowed to accumulate or store some compounds?

A

To maintain a metabolic output at night (export to the phloem), the model
must accumulate carbon and nitrogen stores during
the day. To explore the metabolic interaction between
the day and the night, the model included a set of sugars
and carboxylic acids to be used as carbon storage mol-
ecules.

174
Q

iMAT and GIMME and E-Flux very basic assumptions?

A

For instance, iMAT and GIMME assume that mRNA levels below a
certain threshold reveal that corresponding reactions are inactive [10, 11]. E-Flux and
PROM assume that transcript level indicates the degree to which the reactions are active
by constraining the upper bounds

175
Q

how to select the first column of a data frame as a data frame and as a serie?

A
# Select first column of the dataframe as a dataframe
first_column = df.iloc[: , :1]
# Select first column of the dataframe as a series
first_column = df.iloc[:, 0]
176
Q

how to get a portion of pandas data frame?

A

df.iloc[row_start:row_end , col_start, col_end]
Arguments:

row_start: The row index/position from where it should start selection. Default is 0.
row_end: The row index/position from where it should end the selection i.e. select till row_end-1. Default is till the last row of the dataframe.
col_start: The column index/position from where it should start selection. Default is 0.
col_end: The column index/position from where it should end the selection i.e. select till end-1. Default is till the last column of the dataframe.
It returns a portion of the dataframe that includes rows from row_start to row_end-1 and columns from col_start to col_end-1.

177
Q

how to compare if 2 list’s elements are identical?

A

sorted(x) == sorted(y)

x and y are our lists

178
Q

how to sort the list without modifying the original list?

A

x=[]
y= sorted (x)
sorted function sorts and return a new list

179
Q

what is numpy? what it is used for?

A

. The Numpy library provides specialized data structures, functions, and other tools for numerical computing in Python.

180
Q

how to construct tissue or organ specific metabolic models?

A

Tissue or cell specific models can be derived from the
metabolic reconstruction to represent tissue and cell specific functions by adding physical–chemical constraints and tissue biomass compositional data.

181
Q

how to specify the type of arguments and the returned object in python function?

A
def greeting(name: str) -> str:
    return 'Hello ' + name
182
Q

what is isinstance() in python?

A

The Python’s isinstance() function checks whether the object or variable (first argument) is an instance of the specified class type or data type.

183
Q

how to make a dict with key being the reaction id and the values be zero

A

dict= {i.id: 0 for i in model.reactions}

184
Q

how to make a dict from 2 cols of the excel sheet?

A

exp_data_2h = dict()
for i,j in zip(exp_data[‘MSU7’],exp_data[‘k354_2h_cold’]):
exp_data_2h[i]=j

185
Q

what are loop control statements?

A

break, continue and pass

186
Q

continue statement?

A

The continue statement is used to skip the rest of the code inside a loop for the current iteration only. Loop does not terminate but continues on with the next iteration.

187
Q

how to have a dataframe without index column being shown?

A

df=df.syle.hide_index()

188
Q

how to set a certain column in pandas data frame as the index column?

A

pd =pd.set_index(‘reaction_id’)

189
Q

how to get the max of a dict values in python?

A

max_val= max(dict.values())

190
Q

how to create a new dict with an already existed dict which was filtered for none values?

A

res = {k:v for k,v in kwargs.items() if v is not None}

191
Q

the function for printing out the exchange fluxes and the flux for OF in cobrapy?

A

model.summary( )

192
Q

how to fix the original OF as an additional constraint in cobrapy?

A

cobra.util.solver.fix_objective_as_constraint(model, fraction=1, bound=None, name=’fixed_objective_{}’)
Fix current objective as an additional constraint.
To avoid that, we can fix the current objective value as a constraint to ignore solutions that give a lower (or higher depending on the optimization direction) objective value than the original model.
Parameters:
model (cobra.Model) – The model to operate on
fraction (float) – The fraction of the optimum the objective is allowed to reach.
bound (float, None) – The bound to use instead of fraction of maximum optimal value. If not None, fraction is ignored.
name (str) – Name of the objective. May contain one {} placeholder which is filled with the name of the old objective.

193
Q

how to store all the metabolites of the model in a single variable?

A

mets= [i for i in model.metabolites]

194
Q

how to exit conda environment and go back to the main command line?

A

conda deactivate

195
Q

review addition of 9

A

search aout it

196
Q

how to calculate the time it takes to run a command?

A

put %%time at the beginning

197
Q

how to check the type of the elements in an np array

A

weights.dtype

198
Q

how to download a file from internet via numpy?

A

import urllib.request
urllib.request.urlretrieve(‘url of the web page the file is in’’)

you will see the file in the main page of jupyter notebook after running the code

199
Q

how to load data from a text file using numpy?

A

np.genfromtxt(file name, sep=’,’, skip_header=1)

200
Q

how to get the reactions associated with a special metabolite in cobrapy?

A

model.metabolites.get_by_id(‘met_id’).reactions

201
Q

how to access a group of columns and rows in a pandas dataframe?

A

df.loc[‘cobra’, ‘shield’]

gets the value of the cobra row and shield column