Human Gene GPA33 (uc001gea.1) Description and Page Index
  Description: Homo sapiens glycoprotein A33 (transmembrane) (GPA33), mRNA.
RefSeq Summary (NM_005814): The glycoprotein encoded by this gene is a cell surface antigen that is expressed in greater than 95% of human colon cancers. The open reading frame encodes a 319-amino acid polypeptide having a putative secretory signal sequence and 3 potential glycosylation sites. The predicted mature protein has a 213-amino acid extracellular region, a single transmembrane domain, and a 62-amino acid intracellular tail. The sequence of the extracellular region contains 2 domains characteristic of the CD2 subgroup of the immunoglobulin (Ig) superfamily. [provided by RefSeq, Jul 2008]. Publication Note: This RefSeq record includes a subset of the publications that are available for this gene. Please see the Gene record to access additional publications. ##Evidence-Data-START## Transcript exon combination :: U79725.1, BC069705.1 [ECO:0000332] RNAseq introns :: single sample supports all introns SAMEA2142348 [ECO:0000348] ##Evidence-Data-END## ##RefSeq-Attributes-START## MANE Ensembl match :: ENST00000367868.4/ ENSP00000356842.3 RefSeq Select criteria :: based on conservation, expression, longest protein ##RefSeq-Attributes-END##
Transcript (Including UTRs)
   Position: hg19 chr1:167,022,082-167,059,868 Size: 37,787 Total Exon Count: 7 Strand: -
Coding Region
   Position: hg19 chr1:167,023,571-167,059,524 Size: 35,954 Coding Exon Count: 7 

Page IndexSequence and LinksUniProtKB CommentsGenetic AssociationsMalaCardsCTD
Gene AllelesRNA-Seq ExpressionMicroarray ExpressionRNA StructureProtein StructureOther Species
GO AnnotationsmRNA DescriptionsOther NamesModel InformationMethods
Data last updated: 2013-06-14

-  Sequence and Links to Tools and Databases
Genomic Sequence (chr1:167,022,082-167,059,868)mRNA (may differ from genome)Protein (319 aa)
Gene SorterGenome BrowserOther Species FASTAVisiGeneGene interactionsTable Schema
BioGPSCGAPEnsemblEntrez GeneExonPrimerGeneCards
OMIMPubMedStanford SOURCETreefamUniProtKBWikipedia

-  Comments and Description Text from UniProtKB
DESCRIPTION: RecName: Full=Cell surface A33 antigen; AltName: Full=Glycoprotein A33; Flags: Precursor;
FUNCTION: May play a role in cell-cell recognition and signaling.
SUBCELLULAR LOCATION: Membrane; Single-pass type I membrane protein.
TISSUE SPECIFICITY: Expressed in normal gastrointestinal epithelium and in 95% of colon cancers.
PTM: N-glycosylated, contains approximately 8 kDa of N-linked carbohydrate.
PTM: Palmitoylated.
SIMILARITY: Contains 1 Ig-like C2-type (immunoglobulin-like) domain.
SIMILARITY: Contains 1 Ig-like V-type (immunoglobulin-like) domain.
WEB RESOURCE: Name=Atlas of Genetics and Cytogenetics in Oncology and Haematology; URL="";

-  Genetic Association Studies of Complex Diseases and Disorders
  Genetic Association Database (archive): GPA33
CDC HuGE Published Literature: GPA33
Positive Disease Associations: Insulin , Insulin Resistance
Related Studies:
  1. Insulin
    , , . [PubMed 0]
  2. Insulin Resistance
    , , . [PubMed 0]

-  MalaCards Disease Associations
  MalaCards Gene Search: GPA33
Diseases sorted by gene-association score: infant botulism (2)

-  Comparative Toxicogenomics Database (CTD)
  The following chemicals interact with this gene

+  Common Gene Haplotype Alleles
  Press "+" in the title bar above to open this section.

-  RNA-Seq Expression Data from GTEx (53 Tissues, 570 Donors)
  Highest median expression: 107.87 RPKM in Colon - Transverse
Total median expression: 178.58 RPKM

View in GTEx track of Genome Browser    View at GTEx portal     View GTEx Body Map

+  Microarray Expression Data
  Press "+" in the title bar above to open this section.

-  mRNA Secondary Structure of 3' and 5' UTRs
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -132.70344-0.386 Picture PostScript Text
3' UTR -550.121489-0.369 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  InterPro Domains: Graphical view of domain structure
IPR007110 - Ig-like
IPR013783 - Ig-like_fold
IPR003598 - Ig_sub2
IPR013106 - Ig_V-set
IPR003596 - Ig_V-set_subgr

Pfam Domains:
PF00047 - Immunoglobulin domain
PF07686 - Immunoglobulin V-set domain
PF13895 - Immunoglobulin domain
PF13927 - Immunoglobulin domain

SCOP Domains:
48726 - Immunoglobulin

ModBase Predicted Comparative 3D Structure on Q99795
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
No orthologNo orthologGenome BrowserNo orthologNo orthologNo ortholog
Gene Details     
Gene Sorter     
  Protein Sequence   

-  Gene Ontology (GO) Annotations with Structured Vocabulary
  Molecular Function:
GO:0005515 protein binding
GO:0038023 signaling receptor activity

Biological Process:
GO:0007165 signal transduction

Cellular Component:
GO:0005887 integral component of plasma membrane
GO:0016020 membrane
GO:0016021 integral component of membrane
GO:0070062 extracellular exosome

-  Descriptions from all associated GenBank mRNAs
  U79725 - Human A33 antigen precursor mRNA, complete cds.
JD552224 - Sequence 533248 from Patent EP1572962.
BC069705 - Homo sapiens glycoprotein A33 (transmembrane), mRNA (cDNA clone MGC:97280 IMAGE:7262529), complete cds.
BC069723 - Homo sapiens glycoprotein A33 (transmembrane), mRNA (cDNA clone MGC:97292 IMAGE:7262541), complete cds.
BC069745 - Homo sapiens glycoprotein A33 (transmembrane), mRNA (cDNA clone MGC:97304 IMAGE:7262553), complete cds.
BC069761 - Homo sapiens glycoprotein A33 (transmembrane), mRNA (cDNA clone MGC:97316 IMAGE:7262565), complete cds.
BC069789 - Homo sapiens glycoprotein A33 (transmembrane), mRNA (cDNA clone MGC:97233 IMAGE:7262482), complete cds.
JD138525 - Sequence 119549 from Patent EP1572962.
JD270424 - Sequence 251448 from Patent EP1572962.
BC107164 - Homo sapiens glycoprotein A33 (transmembrane), mRNA (cDNA clone MGC:129986 IMAGE:40034324), complete cds.
BC107165 - Homo sapiens glycoprotein A33 (transmembrane), mRNA (cDNA clone MGC:129987 IMAGE:40034331), complete cds.
JD320666 - Sequence 301690 from Patent EP1572962.
JD273653 - Sequence 254677 from Patent EP1572962.
JD202211 - Sequence 183235 from Patent EP1572962.
JD108718 - Sequence 89742 from Patent EP1572962.
JD484482 - Sequence 465506 from Patent EP1572962.
JD544016 - Sequence 525040 from Patent EP1572962.
JD160627 - Sequence 141651 from Patent EP1572962.
JD121266 - Sequence 102290 from Patent EP1572962.
JD278787 - Sequence 259811 from Patent EP1572962.
JD189517 - Sequence 170541 from Patent EP1572962.
JD269979 - Sequence 251003 from Patent EP1572962.
JD541649 - Sequence 522673 from Patent EP1572962.
JD156679 - Sequence 137703 from Patent EP1572962.
JD370040 - Sequence 351064 from Patent EP1572962.
JD158196 - Sequence 139220 from Patent EP1572962.
JD220625 - Sequence 201649 from Patent EP1572962.
JD415417 - Sequence 396441 from Patent EP1572962.
JD151101 - Sequence 132125 from Patent EP1572962.
JD142935 - Sequence 123959 from Patent EP1572962.
JD319913 - Sequence 300937 from Patent EP1572962.
JD412355 - Sequence 393379 from Patent EP1572962.
JD477450 - Sequence 458474 from Patent EP1572962.
JD393793 - Sequence 374817 from Patent EP1572962.
BC074830 - Homo sapiens glycoprotein A33 (transmembrane), mRNA (cDNA clone MGC:104092 IMAGE:30915543), complete cds.
BC074876 - Homo sapiens glycoprotein A33 (transmembrane), mRNA (cDNA clone MGC:103854 IMAGE:30915236), complete cds.
JD209358 - Sequence 190382 from Patent EP1572962.
JD453002 - Sequence 434026 from Patent EP1572962.
JD137692 - Sequence 118716 from Patent EP1572962.
AK312833 - Homo sapiens cDNA, FLJ93270, Homo sapiens glycoprotein A33 (transmembrane) (GPA33), mRNA.
HQ447448 - Synthetic construct Homo sapiens clone IMAGE:100070776; CCSB013999_03 glycoprotein A33 (transmembrane) (GPA33) gene, encodes complete protein.
KJ892966 - Synthetic construct Homo sapiens clone ccsbBroadEn_02360 GPA33 gene, encodes complete protein.
CU687006 - Synthetic construct Homo sapiens gateway clone IMAGE:100022555 5' read GPA33 mRNA.
JD435993 - Sequence 417017 from Patent EP1572962.
JD215173 - Sequence 196197 from Patent EP1572962.
JD544918 - Sequence 525942 from Patent EP1572962.
JD066004 - Sequence 47028 from Patent EP1572962.
JD421733 - Sequence 402757 from Patent EP1572962.
JD199591 - Sequence 180615 from Patent EP1572962.
JD088213 - Sequence 69237 from Patent EP1572962.
JD437641 - Sequence 418665 from Patent EP1572962.
JD480591 - Sequence 461615 from Patent EP1572962.
JD074987 - Sequence 56011 from Patent EP1572962.
JD498939 - Sequence 479963 from Patent EP1572962.

-  Other Names for This Gene
  Alternate Gene Symbols: GPA33_HUMAN, NM_005814, NP_005805, Q5VZP6, Q99795
UCSC ID: uc001gea.1
RefSeq Accession: NM_005814
Protein: Q99795 (aka GPA33_HUMAN or A33_HUMAN)
CCDS: CCDS1258.1

-  Gene Model Information
category: coding nonsense-mediated-decay: no RNA accession: NM_005814.1
exon count: 7CDS single in 3' UTR: no RNA size: 2793
ORF size: 960CDS single in intron: no Alignment % ID: 99.96
txCdsPredict score: 2024.00frame shift in genome: no % Coverage: 100.00
has start codon: yes stop codon in genome: no # of Alignments: 1
has end codon: yes retained intron: no # AT/AC introns 0
selenocysteine: no end bleed into intron: 0# strange splices: 0
Click here for a detailed description of the fields of the table above.

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.