C16orf58

Chromosome 16 open reading frame 58, or C16orf58, also known as FLJ13638 is a is_associated_with::protein which in humans is encoded by the C16orf58 is_associated_with::gene. The gene itself is 18892 bp long, with is_associated_with::mRNA of 2760 bp, and a protein sequence of 468 is_associated_with::amino acids. There is a conserved domain of unknown, DUF647. No function has been determined for this gene yet, but it is predicted that it resides in the is_associated_with::endoplasmic reticulum in the is_associated_with::cytoplasm.

Species distribution
C16orf58 has very interesting conservation in that it has is_associated_with::orthologs back through is_associated_with::plants and is_associated_with::fungi. However, it has not been found in is_associated_with::reptiles, is_associated_with::birds, or is_associated_with::amphibians. The below table shows some, but not all, orthologs which were found using BLAST.

Protein Interactions
Though the function is still unknown, C16orf58 has been shown to interact with three different proteins:


 * MVD MVD stands for disphosphomevalonate decarboxylase which is an enzyme which functions in is_associated_with::cholesterol biosynthesis.


 * is_associated_with::BSCL2 is_associated_with::BSCL2 is the Bernardinelli-Seip congenital lipodystrophy 2, or seipin. It located in the endoplasmic reticulum and is thought to be important in the is_associated_with::lipid droplet morphology.


 * is_associated_with::TSC22D4 The third interacting protein is TSC22D4, TSC22 domain family member 4, and functions as a is_associated_with::leucine zipper translational regulation.

Structure
Although there are several sites that will give predictions on protein structure, C16orf58 does not have a known structure yet. That being said there is at least one is_associated_with::transmembrane domain, if not more. Within the protein structure there are several extended areas with uncharged amino acids, these could be possible transmembrane domains, or is_associated_with::hydrophobic cores. The below shows the charge of each of the amino acids in the protein sequence, + for positive, - for negative and 0 for uncharged. Note the large segments of uncharged amino acids appear bolded. These stretches of uncharged amino acids are conserved back through distant orthologs. 1 00—000-00 000-00000- 0+00+000-0 0000-0000+ 00000+0000 +0-0+-00-0       61  0000000000 0000000000 000-0000-0 000000-000 0000000000 0000000000      121  0000+00000 0000000+-0 00000+0000 00+00+0-00 0+00+000-0 00-00000-0      181  0000000000 000000000+ 0000000000 +00000000+ +0000-000+ -000-00000      241  0000000000 0000000000 0000000000 000000+00+ 0000-000-0 +0+000+000      301  0+0-00-000 00+0-00000 0000000000 0000+00000 0-00000-00 0-000000-0      361  0000000000 0+000+000+ 0000000000 000-00000- 0—0+0+0+0 00++-00000      421  +-00-00-00 00+00+000- 000+0-+000 -00-0+0000 000-++00