GLIS1

Glis1 (Glis Family Zinc Finger 1) is gene encoding a Krüppel-like protein of the same name whose locus is found on Chromosome 1p32.3. The gene is enriched in unfertilised eggs and embryos at the one cell stage and it can be used to promote direct reprogramming of is_associated_with::somatic cells to is_associated_with::induced pluripotent stem cells, also known as iPS cells. Glis1 is a highly promiscuous is_associated_with::transcription factor, regulating the expression of numerous genes, either positively or negatively. In organisms, Glis1 does not appear to have any directly important functions. Mice whose Glis1 gene has been removed have no noticeable change to their is_associated_with::phenotype.

Structure


Glis1 is an 84.3 kDa is_associated_with::proline rich protein composed of 789 amino acids. No crystal structure has yet been determined for Glis1, however it is homologous to other proteins in many parts of its amino acid sequence whose structures have been solved.

Zinc finger domain
Glis1 uses a Zinc finger domain comprising five tandem Cys2His2 zinc finger motifs (meaning the zinc atom is coordinated by two is_associated_with::cysteine and two is_associated_with::histidine residues) to interact with target is_associated_with::DNA sequences to regulate gene transcription. The domain interacts sequence specifically with the DNA, following the major groove along the double helix. It has the is_associated_with::consensus sequence GACCACCCAC. The individual zinc finger motifs are separated from one another by the is_associated_with::amino acid sequence(T/S)GEKP(Y/F)X, where X can be any amino acid and (A/B) can be either A or B. This domain is homologous to the zinc finger domain found in Gli1 and so is thought to interact with DNA in the same way. The alpha helices of the fourth and fifth zinc fingers are inserted in to the major groove and make the most extensive contact of all the zinc fingers with the DNA. Very few contact are made by the second and third fingers and the first finger does not contact the DNA at all. The first finger does make numerous protein-protein interactions with the second zinc finger, however.

Termini
Glis1 has an activation domain at its is_associated_with::C-terminus and a repressive domain at its is_associated_with::N-terminus. The repressive domain is much stronger than the activation domain meaning transcription is weak. The activation domain of Glis1 is four times stronger in the presence of CaM kinase IV. This may be due to a coactivator. A proline-rich region of the protein is also found towards the N-terminal. The protein's termini are fairly unique, and have no strong sequence similarity other proteins.

Use in cell reprogramming
Glis1 can be used as one of the four factors used in reprogramming somatic cells to induced pluripotent stem cells. The three transcription factors Oct3/4, Sox2 and Klf4 are essential for reprogramming but are extremely inefficient on their own, fully reprogramming roughly only 0.005% of the number of cells treated with the factors. When Glis1 is introduced with these three factors, the efficiency of reprogramming is massively increased, producing many more fully reprogrammed cells. The transcription factor c-Myc can also be used as the fourth factor and was the original fourth factor used by is_associated_with::Shinya Yamanaka who received the 2012 Nobel Prize in Physiology or Medicine for his work in the conversion of somatic cells to iPS cells. Yamanaka's work allows a way of bypassing the controversy surrounding stem cells.

Mechanism
Somatic cells are most often fully differentiated in order to perform a specific function, and therefore only express the genes required to perform their function. This means the genes that are required for differentiation to other types of cell are packaged within is_associated_with::chromatin structures, so that they are not expressed.

Glis1 reprograms cells by promoting multiple pro-reprogramming pathways. These pathways are activated due to the up regulation of the transcription factors is_associated_with::N-Myc, Mycl1, c-Myc, is_associated_with::Nanog, ESRRB, is_associated_with::FOXA2, is_associated_with::GATA4, is_associated_with::NKX2-5, as well as the other three factors used for reprogramming. Glis1 also up-regulates expression of the protein is_associated_with::LIN28 which binds the let-7 is_associated_with::microRNA precursor, preventing production of active let-7. Let-7 microRNAs reduce the expression of pro-reprogramming genes via is_associated_with::RNA interference. Glis1 is also able to directly associate with the other three reprogramming factors which may help their function.

The result of the various changes in gene expression is the conversion of is_associated_with::heterochromatin, which is very difficult to access, to is_associated_with::euchromatin, which can be easily accessed by transcriptional proteins and enzymes such as is_associated_with::RNA polymerase. During reprogramming, is_associated_with::histones, which make up is_associated_with::nucleosomes, the complexes used to package DNA, are generally demethylated and acetylated 'unpacking' the DNA by neutralising the positive charge of the is_associated_with::lysine residues on the N-termini of histones.

Advantages over c-myc
Glis1 has a number of extremely important advantages over c-myc in cell reprogramming.


 * No risk of cancer: Although c-myc enhances the efficiency of reprogramming, its major disadvantage is that it is a is_associated_with::proto-oncogene meaning the iPS cells produced using c-myc are much more likely to become cancerous. This is an enormous obstacle between iPS cells and their use in medicine. When Glis1 is used in cell reprogramming, there is no increased risk of cancer development.


 * Production of fewer 'bad' colonies: While c-myc promotes the proliferation of reprogrammed cells, it also promotes the proliferation of 'bad' cells which have not reprogrammed properly and make up the vast majority of cells in a dish of treated cells. Glis1 actively suppresses the proliferation of cells that have not fully reprogrammed, making the selection and harvesting of the properly reprogrammed cells less laborious. This is likely to be due to many of these 'bad' cells expressing Glis1 but not all four of the reprogramming factors. When expressed on its own, Glis1 inhibits proliferation.


 * More efficient reprogramming: The use of Glis1 reportedly produces more fully reprogrammed iPS cells than c-myc. This is an important quality given the inefficiency of reprogramming.

Disadvantages

 * Inhibition of Proliferation: Failure to stop Glis1 expression after reprogramming inhibits cell proliferation and ultimately leads to the death of the reprogrammed cell. Therefore careful regulation of Glis1 expression is required. This explains why Glis1 expression is switched off in is_associated_with::embryos after they have started to divide.

Roles in disease
Glis1 has been implicated to play a part in a number of diseases and disorders.

Psoriasis
Glis1 has been shown to be heavily up regulated in is_associated_with::psoriasis, a disease which causes chronic inflammation of the skin. Normally, Glis1 is not expressed in the skin at all. However during inflammation, it is expressed in the spinous layer of the skin, the second layer from the bottom of four layers as a response to the inflammation. This is the last layer where the cells have nuclei and thus the last layer where gene expression occurs. It is believed that the role of Glis1 in this disease is to promote cell differentiation in the skin by changing the increasing the expression of multiple pro-differentation genes such as is_associated_with::IGFBP2 which inhibits proliferation and can also promote is_associated_with::apoptosis It also decreases the expression of Jagged1, a ligand of notch in the is_associated_with::notch signaling pathway and Frizzled10, a receptor in the is_associated_with::wnt signaling pathway.

Late onset Parkinson's Disease
A certain allele of Glis1 which exists due to a is_associated_with::single nucleotide polymorphism, a change in a single nucleotide of the DNA sequence of the gene, has been implicated as a risk factor in the neurodegenerative disorder is_associated_with::Parkinson's disease. The allele is linked to the late onset variety of Parkinson's, which is acquired in old age. The reason behind this link is not yet known.