Genomics Vocab Flashcards
Halotype
a set of DNA variants along the single chromosome that tend to be inherited together
Heterozygosity
having different alleles for a particular trait. Two different alleles of a SNP by an individual.
Autosomal
gene located on one of the numbered or non-sex chromosomes
SNPs
these are single nucleotide polymorphisms which represent a difference in a single DNA building block(nucleotide). Variation at a single base position in the DNA.
Serial founders effect
This occurs when populations migrate over long distances, and involve relatively rapid movements followed by periods of settlement. The population in the migration carry only a subset of genetic diversity carried from previous migrations.
Microsatellite markers
they are small pieces of DNA that repeat. These provide data for phylogenetic studies that seek to explain concordant biogeographic and genetic histories of large scale regions
Principle of component analyses (PCA)
data sets with lots of variables, need to find variables that can be placed into grouping. Technique used to emphasize variation and bring out strong patterns in a dataset.
Fixation index(Fst)
measure of population differentiation due to a genetic structure/ measure genetic distance between populations
Molecular variance(AMOVA)
analysis of molecular variance to detect population differentiation utilizing molecular markers, it is a statistical method in population genetics and molecular ecology.
AlphaMissense
an adaptation of AlphaFold fine-tuned on human and primate variant population frequency data-bases to predict missense variant pathogenicity. It combined structural context and evolutionary conservation to find results across genetic benchmarks
Clustering
the goal is to divide the population or set of data points into a number of groups so that the data points within each group are more comparable to one another and different from data points within the other groups
Genetic admixture
occurs when previously diverged or isolated genetic lineages mix, results in introduction of new genetic lineages into populations
Maximum Likelihood(Ml)
a model used to describe the process that results in the data that are observed
Genome Mosaicism
the presence of cells in a person that have a different genome from the body’s other cells. “A population of cells with its own personal genome
Mosaicism
can arise due to errors that occur during chromosome segregation or DNA replication leading to chromosome aneuploidy, CNVs, genomic rearrangements, single nucleotide variation, or repeat expansions and microsatellite instabilities.
Copy number variants (CNV)
number of copies of a particular gene varies from one individual to the next.
-have been found in clonal isolates of embryonic stem cells
Pathological functions
somatic mosaicism of terminally differentiated cells has long been known to cause cancer
-somatic mosaicism of nervous system tissues underlies a host of neurodevelopmental and perhaps neuropsychiatric diseases
Mutational heterogeneity
mutations at two or more genetic loci that produce the same or similar phenotypes
Uniform model
all genes have same background mutation frequency (BMF)
MutSigCV
identifies genes that are significantly mutated in cancer genomes
-by applying to exome sequences from 3083 tumor-normal pairs and discover extraordinary variation in mutation frequency and spectrum within cancer types, which sheds light on mutational processes and disease aetiology and mutations across genome, which is correlated with DNA replication timing and transcriptional activity.
-this is able to eliminate most of the apparent artifactual findings and enable the identification of genes truly associated with cancer.
Whole-Exome sequencing
a type of genetic sequencing that requires all the exons in the sequence used to understand what may be causing symptoms or diseases
Olfactory receptors
protein capable of binding odor molecules that plays a central role in the sense of smell
Titin
a very large protein that provides structure, flexibility, and stability to the cell’s structures.
False positive
an incorrect presentation that reads as something being present
Deoxynucleotide:
Free 3’OH is needed for ester bond formation during template strand synthesis