Entity Population Sequence Statistics Detail

DescriptionAn association between a population and an allele which can be used to identify the genomic statistical values that are linked to a set of subjects within a cohort.

Attributes
Alelle Frequency RtAlelle Frequency Rt
Cohort Heterozygosity RtCohort Heterozygosity Rt
Effective From DtEffective From Dt
Effective To DtEffective To Dt
Load Info SkLoad Info Sk
Population Sequence Statistics SkPopulation Sequence Statistics Sk
Source Code SkSource Code Sk
Tenant SkTenant Sk
Total Samples NumTotal Samples Num
Valid From TsValid From Ts
Valid To TsValid To Ts

Relationship
Population Sequence Statistics Detail_Population Sequence Statistics_FKPopulation Sequence Statistics Detail_Population Sequence Statistics_FK

Primary Key
Population Sequence Statistics Detail PKPopulation Sequence Statistics Detail PK

Dependencies
 

Reverse Dependencies
 

Attribute Details

 Alelle Frequency Rt
DescriptionThe frequency of the allele in this cohort group.
The allele frequency represents the incidence of a gene variant in a population.
An allele frequency is calculated by dividing the number of times the allele of interest is observed in a population by the total number of copies of all the alleles at that particular genetic locus in the population.
Allele frequencies can be represented as a decimal, a percentage, or a fraction.
In a population, allele frequencies are a reflection of genetic diversity. Changes in allele frequencies over time can indicate that genetic drift is occurring or that new mutations have been introduced into the population.
Data TypeStandards - Data Domains.ddm/Data Domains/Rate [FLOAT(5)]
Is Part Of PrimaryKeyfalse
Is Requiredfalse
Is Derivedfalse
Is Surrogate Keyfalse



 Cohort Heterozygosity Rt
DescriptionThe heterozygosity or genetic diversity associated with the know Cohort samples. A measure of genetic variation in a population.

For example:
If an individual carries the gene for black hair and the gene for blond hair, we would say that individual is heterozygous for hair color. Heterozygosity may also refer to the percentage of locations on a chromosome that are heterozygous in an individual.
Heterozygosity may also refer to the percentage of locations on a chromosome that are heterozygous in an individual. These locations are called loci (the singular form is locus) and may contain more than one gene. The concept of heterozygosity is frequently extended from an individual to a population in the study of population genetics.
Heterozygosity in a population is calculated as follows:
1) Let pi be the frequency p of the allele that has an index number of i for a given locus. The value of pi may therefore range from 0 to 1.
2) Calculate the predicted heterozygosity for a single locus. This is given by the equation 1 - Σpi^2. Since the sum of the terms pi^2 is less than 1, the heterozygosity is a value between 0 and 1. Heterozygosity may therefore be expressed as a percentage.
3) Interpret the significance for the predicted heterozygosity at a single locus. The equation the equation 1 - Σpi^2 shows that the maximum heterozygosity occurs when the alleles for that locus are equally common. For example, for two equally common alleles, the heterozygosity is 1 - Σpi^2 = 1 - (1/2)^2 - (1/2)^2 = 1/2.
4) Calculate the predicted heterozygosity for multiple loci. In this case, we wish to find the average of the sum of the squares of the allele frequencies and subtract it from 1. Thus, the heterozygosity for multiple loci is 1 - 1/mΣΣpi^2.
5) Evaluate the observed heterozygosity of a population for a single locus. We have Ho = Σxi/n where Ho is the observed heterozygosity, n is the population and xi is 0 if the alleles in the individual with index i are equal and 1 if they are different.
Data TypeStandards - Data Domains.ddm/Data Domains/Rate [FLOAT(5)]
Is Part Of PrimaryKeyfalse
Is Requiredfalse
Is Derivedfalse
Is Surrogate Keyfalse



 Effective From Dt
DescriptionEstablishes a period where a set of attributes are true according to the business.
Data TypeStandards - Data Domains.ddm/Data Domains/Date [DATE]
Is Part Of PrimaryKeyfalse
Is Requiredtrue
Is Derivedfalse
Is Surrogate Keyfalse



 Effective To Dt
DescriptionEnds a period of effectivity.
Data TypeStandards - Data Domains.ddm/Data Domains/Date [DATE]
Is Part Of PrimaryKeyfalse
Is Requiredfalse
Is Derivedfalse
Is Surrogate Keyfalse



 Load Info Sk
DescriptionThe surrogate key of the load information entry describing the details regarding the loading of the row.
Data TypeStandards - Data Domains.ddm/Data Domains/Surrogate Key Large [LONG]
Is Part Of PrimaryKeyfalse
Is Requiredtrue
Is Derivedfalse
Is Surrogate Keyfalse



 Population Sequence Statistics Sk
DescriptionThe surrogate key for anchor Cohort Sequence Statistics.
Data TypeStandards - Data Domains.ddm/Data Domains/Surrogate Key Large [LONG]
Is Part Of PrimaryKeytrue
Is Requiredtrue
Is Derivedfalse
Is Surrogate Keyfalse



 Source Code Sk
DescriptionThe origin of the data identifying the actual load source, vendor, manual key entry, or context of the data in a specific row in the database.
Data TypeStandards - Data Domains.ddm/Data Domains/Surrogate Key [INTEGER]
Is Part Of PrimaryKeyfalse
Is Requiredtrue
Is Derivedfalse
Is Surrogate Keyfalse



 Tenant Sk
DescriptionThe surrogate key of the entry identifying the legal owner of the data.
Data TypeStandards - Data Domains.ddm/Data Domains/Surrogate Key [INTEGER]
Is Part Of PrimaryKeyfalse
Is Requiredtrue
Is Derivedfalse
Is Surrogate Keyfalse



 Total Samples Num
DescriptionThe number of samples analyzed to create these statistics.
Data TypeStandards - Data Domains.ddm/Data Domains/Number Integer [INTEGER]
Is Part Of PrimaryKeyfalse
Is Requiredfalse
Is Derivedfalse
Is Surrogate Keyfalse



 Valid From Ts
DescriptionEstablishes a period where a set of attributes are true in the source system. This would be populated with the transaction timestamp and would be used for the snapshot date.
Data TypeStandards - Data Domains.ddm/Data Domains/Timestamp [TIMESTAMP]
Is Part Of PrimaryKeytrue
Is Requiredtrue
Is Derivedfalse
Is Surrogate Keyfalse



 Valid To Ts
DescriptionEnds a period of validity.
Data TypeStandards - Data Domains.ddm/Data Domains/Timestamp [TIMESTAMP]
Is Part Of PrimaryKeyfalse
Is Requiredfalse
Is Derivedfalse
Is Surrogate Keyfalse

Relationship Details

 Population Sequence Statistics Detail_Population Sequence Statistics_FK
Is Identifying Relationshiptrue
Child TablePopulation Sequence Statistics Detail
Child MultiplicityZERO_TO_MANY
Child Referential Integrity: On DeleteNONE
Child Referential Integrity: On InsertNONE
Child Referential Integrity: On UpdateNONE
Parent TablePopulation Sequence Statistics
Parent MultiplicityONE
Parent Referential Integrity: On DeleteNONE
Parent Referential Integrity: On InsertNONE
Parent Referential Integrity: On UpdateNONE

Primary Key Details

 Population Sequence Statistics Detail PK
Key AttributePopulation Sequence Statistics Sk
Key AttributeValid From Ts