FAM200A

C7orf38 is a gene located on is_associated_with::chromosome 7 in the human genome. The gene is expressed in nearly all tissue types at very low levels. Evolutionarily, it can be found throughout the kingdom is_associated_with::animalia. While the function of the protein is not fully understood by the scientific community, bioinformatic tools have shown that the protein bares much similarity to is_associated_with::zinc finger or is_associated_with::transposase proteins. Many of its is_associated_with::orthologs, is_associated_with::paralogs, and neighboring genes have been shown to possess zinc finger domains. The protein contains a hAT dimerization domain nears its is_associated_with::C-terminus. This domain is highly conserved in is_associated_with::transposase is_associated_with::enzymes.

Gene
C7orf38 is located on Chromosome 7 at q22.1. Its genomic sequence contains 5,612 bp. The predominant transcript contains two is_associated_with::exons and is 2,507 bp in length. The translated protein contains 573 is_associated_with::amino acids.

Protein composition
The 573 amino acid protein has a molecular weight of 66,280.05. The is_associated_with::isoelectric point was found to occur at a pH of 5.775, about 1.6 pH lower than that of the average human pH. Two deviations from prototypical human proteins are evident. The protein contains a less than expected number of is_associated_with::glycine residues, and is rich in is_associated_with::leucine residues. There are not sections of strong is_associated_with::hydrophobicity or is_associated_with::hydrophilicity. Thus, it is not predicted to be a is_associated_with::transmembrane protein.

Gene neighborhood
The four genes in closest proximity to C7orf38 on chromosome 7 exhibit similar function, many of which are transcription factors.

Paralogs
Eight is_associated_with::paralogs are found in the human is_associated_with::proteome. Similar to the neighboring genes, many of the paralogs function as is_associated_with::zinc fingers, or is_associated_with::transcription factors.

Orthologs
Orthologs to C7orf38 can be traced back evolutionarily through plants. The following is not an extensive list of orthologs. It is intended to provide an evolutionary overview of the conservation of C7orf38.

Protein
CBLast was used to determine a structurally related protein with experimentally determined structure. The protein Hermes DNA transposase, of the Hermes DBD superfamily, was shown to be structurally similar (Evalue: 1E-6).

The hAT dimerization domain is found at the is_associated_with::C-terminus of is_associated_with::transposase elements belonging to the Activator superfamily (hAT element superfamily). The isolated dimerization domain forms extremely stable dimers in vitro.

mRNA
The MFOLD program available at Rensselaer BioInformatics Server was used to predict secondary structure of the mature mRNA sequence. The primary sequence of the mRNA secondary structures displayed high levels of conservation in orthologs, suggesting structural importance.

Tissue distribution
The gene appears to be expressed in most tissue types. Very low levels of expression were observed through est profiles, and no deviation was observed between health or developmental states.