C4orf21

C4orf21 (is_associated_with::Chromosome 4 open reading frame 21) is a is_associated_with::protein in humans that is encoded by the C4orf21 is_associated_with::gene that has uncharacterised function and a weight of 236.6 kDa. The encoded protein of this gene has been linked with is_associated_with::alcohol dependence. This gene shows relatively low expression in most human tissues, with increased expression in situations of chemical dependence. C4orf21 is orthologous to nearly all kingdoms of Eukarya. Functional domains of this protein link it to a series of is_associated_with::helicases, most notably the AAA_12 and AAA_11 domains.

Gene
The entire gene is 97,663 is_associated_with::base pairs long and has an unprocessed is_associated_with::mRNA that is 6,740 is_associated_with::nucleotides in length. It consists of 28 exons that encode for a 2104 is_associated_with::amino acid protein.



Locus
C4orf21 is located on the fourth chromosome on the 4q25 position near the LARP7 gene. It is encoded for on the minus strand.

Homologous domains
C4orf21 contains a DUF2439 domain (domain of unknown function), zf-GRF domain, and AAA_11 and an AAA_12 domain (ATPases associated with diverse cellular activities). DUF domains are involved in telomere maintenance and meiotic segregation. AAA_11 and AAA_12 contain a P-loop motif which are involved in conjugative transfer proteins. Other helicase domains are also present in c4orf21 orthologs.

Paralogs
There are 9 moderately-related proteins in humans that are paralogous to the ATP-dependent helicase containing domains in the C-terminus of c4orf21 after the 1612th amino acid. A majority of these proteins are in the RNA helicase family. There are no known is_associated_with::paralogs to the large N-terminal portion of the protein.

Orthologs
Complete is_associated_with::orthologs of the c4orf21 gene are found in mammalia. The helicase domain containing C-terminus portion of the gene is conserved across Eukarya.

Primary sequence
C4orf21 is 236.6 kDa.

Post-translational modifications
C4orf21 has experimentally determined is_associated_with::phosphorylation sites at the Y38, S137, S140, S325, and S864 positions.

Secondary structure
A weak is_associated_with::transmembrane domain is predicted in the TMHMM server with one loop in the C-terminus of the protein prior to the helicase core. This domain contains both ends outside of a membrane.

Tertiary domains and quaternary structure
C4orf21 has related structures to is_associated_with::Upf1, a paralog. These structures have the capability to bind zinc ions and mRNA.

Function and biochemistry
The function of c4orf21 is unknown. Given this, the paralogs to the helicase core of the gene are associated with translation, transcription, is_associated_with::nonsense-mediated mRNA decay, is_associated_with::RNA decay, is_associated_with::miRNA processing, RISC assembly, and is_associated_with::pre-mRNA splicing. These paralogs operate under a SPF1 RNA helicase motif.

is_associated_with::Mov10, a paralog, and probable RNA helicase is required for RNA-mediated gene silencing by the is_associated_with::RNA-induced silencing complex (RISC). It is also required for both miRNA-mediated translational repression and miRNA-mediated cleavage of complementary mRNAs by RISC, and for RNA-directed transcription and replication of the human hepatitis delta virus (HDV). Mov10 nteracts with small capped HDV RNAs derived from genomic hairpin structures that mark the initiation sites of RNA-dependent is_associated_with::HDV RNA transcription.

Expression
Expression is relatively low for c4orf21 compared to other proteins. Expression of c4orf21 is slightly elevated compared to its average expression in tissue in the is_associated_with::hematopoietic and is_associated_with::lymphatic systems, and is above average in the is_associated_with::brain also. Lower averages exist in is_associated_with::liver, is_associated_with::pharynx, and is_associated_with::skin tissue.

Transcription factor interactions
The transcriptional start site for c4orf21 aligns best with ATF, is_associated_with::CREB, is_associated_with::deltaCREB, is_associated_with::E2F, and is_associated_with::E2F-1 is_associated_with::transcription factor binding sites.

Interacting proteins
C4orf21 shows predicted protein interaction with its AQR, is_associated_with::DNA2, is_associated_with::IGHMBP2, is_associated_with::LOC91431, and is_associated_with::SETX paralogs.

Clinical significance
C4orf21 has been previously linked to is_associated_with::alcohol dependence (where genes linked to this disorder are also linked to alcoholism and other psychological and is_associated_with::personality disorder). Given this, expression of the gene in the liver and brain are particularly interesting. Upon examination of variable GEO profiles, there were many related to is_associated_with::Hepatitis and other disorders of the liver. The best correlative studies were those in relation to liver is_associated_with::transplant failure. The link to alcohol dependence provides a strong connection to dependence to other chemical substances such as is_associated_with::nicotine through analysis of lymphoblast cells. C4orf21 showed significantly increased expression in those who were nicotine dependent versus a control group of non-smokers. Upregulation of c4orf21 was also present in certain cancer expression data sets.

A paralog of c4orf21 was found to inhibit is_associated_with::HIV-1 Replication at multiple stages. is_associated_with::Mov10 is involved in the biological processes of RNA-mediated gene silencing, transcription, transcription regulation and has is_associated_with::hydrolase and is_associated_with::helicase activity through ATP and RNA binding.