Human Gene TOX4 (uc001waz.3) Description and Page Index
  Description: Homo sapiens TOX high mobility group box family member 4 (TOX4), mRNA.
Transcript (Including UTRs)
   Position: hg19 chr14:21,945,335-21,967,319 Size: 21,985 Total Exon Count: 9 Strand: +
Coding Region
   Position: hg19 chr14:21,945,438-21,964,764 Size: 19,327 Coding Exon Count: 9 

Page IndexSequence and LinksUniProtKB CommentsCTDGene AllelesRNA-Seq Expression
Microarray ExpressionRNA StructureProtein StructureOther SpeciesGO AnnotationsmRNA Descriptions
Other NamesModel InformationMethods
Data last updated: 2013-06-14

-  Sequence and Links to Tools and Databases
Genomic Sequence (chr14:21,945,335-21,967,319)mRNA (may differ from genome)Protein (621 aa)
Gene SorterGenome BrowserOther Species FASTAGene interactionsTable SchemaBioGPS
CGAPEnsemblEntrez GeneExonPrimerGeneCardsGeneNetwork
OMIMPubMedStanford SOURCEUniProtKBWikipedia

-  Comments and Description Text from UniProtKB
DESCRIPTION: RecName: Full=TOX high mobility group box family member 4; AltName: Full=Epidermal Langerhans cell protein LCP1;
FUNCTION: Component of the PTW/PP1 phosphatase complex, which plays a role in the control of chromatin structure and cell cycle progression during the transition from mitosis into interphase.
SUBUNIT: Component of the PTW/PP1 phosphatase complex, composed of PPP1R10/PNUTS, TOX4, WDR82 and PPP1CA or PPP1CB or PPP1CC. Interacts with PPP1R10/PNUTS.
SUBCELLULAR LOCATION: Nucleus (Probable). Note=Associated with chromatin.
SIMILARITY: Contains 1 HMG box DNA-binding domain.
SEQUENCE CAUTION: Sequence=BAA34457.2; Type=Erroneous initiation; Note=Translation N-terminally shortened;

-  Comparative Toxicogenomics Database (CTD)
  The following chemicals interact with this gene           more ... click here to view the complete list

+  Common Gene Haplotype Alleles
  Press "+" in the title bar above to open this section.

-  RNA-Seq Expression Data from GTEx (53 Tissues, 570 Donors)
  Highest median expression: 29.82 RPKM in Testis
Total median expression: 955.26 RPKM

View in GTEx track of Genome Browser    View at GTEx portal     View GTEx Body Map

+  Microarray Expression Data
  Press "+" in the title bar above to open this section.

-  mRNA Secondary Structure of 3' and 5' UTRs
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -29.80103-0.289 Picture PostScript Text
3' UTR -753.872555-0.295 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  InterPro Domains: Graphical view of domain structure
IPR009071 - HMG_superfamily

Pfam Domains:
PF00505 - HMG (high mobility group) box
PF09011 - HMG-box domain

SCOP Domains:
47095 - HMG-box

ModBase Predicted Comparative 3D Structure on O94842
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
No orthologGenome BrowserGenome BrowserNo orthologNo orthologNo ortholog
Gene DetailsGene Details    
Gene SorterGene Sorter    
 Protein SequenceProtein Sequence   

-  Gene Ontology (GO) Annotations with Structured Vocabulary
  Molecular Function:
GO:0000981 RNA polymerase II transcription factor activity, sequence-specific DNA binding
GO:0003677 DNA binding
GO:0005515 protein binding

Biological Process:
GO:0006357 regulation of transcription from RNA polymerase II promoter

Cellular Component:
GO:0000785 chromatin
GO:0005634 nucleus
GO:0072357 PTW/PP1 phosphatase complex
GO:0000784 nuclear chromosome, telomeric region

-  Descriptions from all associated GenBank mRNAs
  BC035050 - Homo sapiens TOX high mobility group box family member 4, mRNA (cDNA clone IMAGE:5266241), with apparent retained intron.
AK295913 - Homo sapiens cDNA FLJ54357 complete cds, highly similar to Epidermal Langerhans cell protein LCP1.
AK299807 - Homo sapiens cDNA FLJ54382 complete cds, highly similar to Epidermal Langerhans cell protein LCP1.
BC020727 - Homo sapiens TOX high mobility group box family member 4, mRNA (cDNA clone IMAGE:4721305).
AK298555 - Homo sapiens cDNA FLJ54372 complete cds, highly similar to Epidermal Langerhans cell protein LCP1.
BC013689 - Homo sapiens TOX high mobility group box family member 4, mRNA (cDNA clone MGC:9406 IMAGE:3880134), complete cds.
BC020292 - Homo sapiens TOX high mobility group box family member 4, mRNA (cDNA clone IMAGE:3464161).
AB018280 - Homo sapiens KIAA0737 mRNA for KIAA0737 protein.
CU680652 - Synthetic construct Homo sapiens gateway clone IMAGE:100020788 5' read TOX4 mRNA.
AB385378 - Synthetic construct DNA, clone: pF1KA0737, Homo sapiens TOX4 gene for TOX high mobility group box family member 4, complete cds, without stop codon, in Flexi system.
EU446865 - Synthetic construct Homo sapiens clone IMAGE:100070261; IMAGE:100012074; FLH258197.01L TOX high mobility group box family member 4 (TOX4) gene, encodes complete protein.
EU831791 - Synthetic construct Homo sapiens clone HAIB:100066820; DKFZo008D1121 TOX high mobility group box family member 4 protein (TOX4) gene, encodes complete protein.
EU831869 - Synthetic construct Homo sapiens clone HAIB:100066898; DKFZo004D1122 TOX high mobility group box family member 4 protein (TOX4) gene, encodes complete protein.
KJ898107 - Synthetic construct Homo sapiens clone ccsbBroadEn_07501 TOX4 gene, encodes complete protein.
AK022060 - Homo sapiens cDNA FLJ11998 fis, clone HEMBB1001521.
JD156131 - Sequence 137155 from Patent EP1572962.
JD070367 - Sequence 51391 from Patent EP1572962.
JD118863 - Sequence 99887 from Patent EP1572962.
JD347431 - Sequence 328455 from Patent EP1572962.
JD546919 - Sequence 527943 from Patent EP1572962.
JD511253 - Sequence 492277 from Patent EP1572962.
AY305872 - Homo sapiens migration-inducing protein 7 (MIG7) mRNA, complete cds.
JD176294 - Sequence 157318 from Patent EP1572962.
JD305750 - Sequence 286774 from Patent EP1572962.
JD532426 - Sequence 513450 from Patent EP1572962.
JD202641 - Sequence 183665 from Patent EP1572962.
JD137403 - Sequence 118427 from Patent EP1572962.
JD181347 - Sequence 162371 from Patent EP1572962.
JD321561 - Sequence 302585 from Patent EP1572962.
JD131750 - Sequence 112774 from Patent EP1572962.
JD414452 - Sequence 395476 from Patent EP1572962.
JD371234 - Sequence 352258 from Patent EP1572962.
JD337512 - Sequence 318536 from Patent EP1572962.
JD207629 - Sequence 188653 from Patent EP1572962.
JD356263 - Sequence 337287 from Patent EP1572962.
JD101222 - Sequence 82246 from Patent EP1572962.
JD457716 - Sequence 438740 from Patent EP1572962.
JD122292 - Sequence 103316 from Patent EP1572962.
JD080102 - Sequence 61126 from Patent EP1572962.
JD228093 - Sequence 209117 from Patent EP1572962.
JD557596 - Sequence 538620 from Patent EP1572962.
JD308269 - Sequence 289293 from Patent EP1572962.
JD425841 - Sequence 406865 from Patent EP1572962.
JD213647 - Sequence 194671 from Patent EP1572962.
JD554452 - Sequence 535476 from Patent EP1572962.
JD379622 - Sequence 360646 from Patent EP1572962.
JD379623 - Sequence 360647 from Patent EP1572962.
JD238932 - Sequence 219956 from Patent EP1572962.
JD374436 - Sequence 355460 from Patent EP1572962.
JD442214 - Sequence 423238 from Patent EP1572962.
JD040777 - Sequence 21801 from Patent EP1572962.
JD206851 - Sequence 187875 from Patent EP1572962.
JD102394 - Sequence 83418 from Patent EP1572962.
JD301427 - Sequence 282451 from Patent EP1572962.
JD098116 - Sequence 79140 from Patent EP1572962.
JD559123 - Sequence 540147 from Patent EP1572962.

-  Other Names for This Gene
  Alternate Gene Symbols: C14orf92, KIAA0737, NM_014828, NP_055643, O94842, TOX4_HUMAN
UCSC ID: uc001waz.3
RefSeq Accession: NM_014828
Protein: O94842 (aka TOX4_HUMAN)
CCDS: CCDS32043.1

-  Gene Model Information
category: coding nonsense-mediated-decay: no RNA accession: NM_014828.2
exon count: 9CDS single in 3' UTR: no RNA size: 4538
ORF size: 1866CDS single in intron: no Alignment % ID: 100.00
txCdsPredict score: 3920.00frame shift in genome: no % Coverage: 99.69
has start codon: yes stop codon in genome: no # of Alignments: 1
has end codon: yes retained intron: no # AT/AC introns 0
selenocysteine: no end bleed into intron: 0# strange splices: 0
Click here for a detailed description of the fields of the table above.

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.