C20orf111

Chromosome 20 open reading frame 111, or C20orf111, is the hypothetical is_associated_with::protein that in humans is encoded by the C20orf111 is_associated_with::gene. C20orf111 is also known as Perit1 (peroxide inducible transcript 1), HSPC207, and dJ1183I21.1. It was originally located using genomic sequencing of chromosome 20. The National Center for Biotechnology Information, or NCBI, shows that it is located at q13.11 on chromosome 20, however the genome browser at the University of California-Santa Cruz (UCSC) website shows that it is at location q13.12, and within a million base pairs of the is_associated_with::adenosine deaminase locus. It was also found to have an increase in expression in cells undergoing hydrogen peroxide-induced apoptosis. After analyzing the amino acid content of C20orf111, it was found to be rich in serine residues.

Gene
C20orf111 a valid, protein coding gene that is found on the minus strand of is_associated_with::chromosome 20 at q13.12 by searching the UCSC Genome Browser, but q13.11 according to Refseq on NCBI.

Gene neighborhood
A few of the known genes near C20orf111 are given in the box below with their known function.

General properties

 * Genomic DNA Length:14,968 is_associated_with::base pairs (bp)
 * Most common mRNA Length: 2,260 bp with 4 is_associated_with::exons. Has 10 splice isoforms.
 * is_associated_with::5' untranslated region 252 bp long.
 * is_associated_with::3' untranslated region 1,129 bp long.

Transcript variants


10 splice isoforms that encode good proteins, altogether 8 different isoforms, 2 of which are complete isoforms. The image below shows the 10 isoforms that are predicted. Of these 10 splice isoforms, 8 have varying is_associated_with::peptide lengths, however all of these proteins are only hypothetical with no extensive research done on them.

Transcription regulation
When looking at the predicted promoter sequence, there are no is_associated_with::RNA Polymerase II binding sites, however there is a binding site for core promoter element for TATA-less promoters. In this same region of the promoter, there is also a TATA-binding factor sequence, which helps in the positioning of RNA polymerase II for transcription.

General properties

 * Contains a highly conserved is_associated_with::domain of unknown function 776 (DUF776),which composes 62% of the entire protein.
 * Molecular weight 31.8 kilodaltons
 * is_associated_with::Isoelectric point 8.57
 * Predicted to be a nuclear protein

Function
The function of C20orf111 is not well understood by the scientific community. It does contain a domain of unknown function, DUF776, which has a large segment that is conserved well conserved through Xenopus tropicalus. It is also shown to have an increase in expression in rat is_associated_with::cardiomyocytes undergoing is_associated_with::hydrogen peroxide induced is_associated_with::apoptosis.

Expression
When looking at the EST Profiles in humans, normal tissue (non-cancerous), expresses at a level of 82 transcripts per million. C20orf111 has been shown to increase in expression in rat cardiac myocytes undergoing |H|2|O|2|-induced apoptosis, suggesting a role in cell death. In bladder, cervical, head and neck, non-neoplasia, pancreatic, and prostate cancer cells, there are expression levels lower than normal.



Homology
C20orf111 gene has no clear is_associated_with::paralogs in the human is_associated_with::genome. However, it has many is_associated_with::orthologs in other organisms, and is conserved highly in organisms such as is_associated_with::Xenopus tropicalis and is semi-conserved in the proto-animal Trichoplax adherens at the is_associated_with::C-terminus.

The following table presents a select number of the orthologs found.

Conservation
The image below is a is_associated_with::multiple sequence alignment comparing the conservation of the C20orf111 protein amongst other organisms. The protein is highly conserved in the DUF776 region amongst vertebrates, and also at the is_associated_with::C-terminus in eukaryotes.



Predicted post-translational modification


Using tools at ExPASy the following are predicted is_associated_with::post-translational modifications for C20orf111.
 * Predicted propeptide cleavage site in protein between position R81 and S82.
 * 30 predicted Serine phosphorylation sites
 * 5 predicted Threonine phosphorylation sites
 * 3 predicted Tyrosine phosphorylation sites

Predicted secondary structure
PELE (Protein Secondary Structure Prediction) was used to predict the secondary structure of C20orf111. There is little in the way of is_associated_with::β-strand or is_associated_with::α-helix secondary structure, but a large part of the protein appears to exist as random coils. This is shown on the image of the C20orf111 images to the right.