Schema for kgTxInfo
  Database: mm10    Primary Table: kgTxInfo    Row Count: 63,814   Data last updated: 2018-04-13
Format description: Various bits of information about a transcript from the txGraph/txCds system (aka KG3)
On download server: MariaDB table dump directory
fieldexampleSQL type info description
name uc007afh.1varchar(255) values Name of transcript
category codingvarchar(255) values coding/nearCoding/noncoding for now
sourceAcc NM_008866.2varchar(255) values Accession of genbank transcript patterned on (may be refSeq)
isRefSeq 1tinyint(3) unsigned range Is a refSeq
sourceSize 2447int(11) range Number of bases in source, excluding poly-A tail.
aliCoverage 0.994279double range Fraction of bases in source aligning.
aliIdRatio 1double range matching/total bases in alignment
genoMapCount 1int(11) range Number of times source aligns in genome.
exonCount 9int(11) range Number of exons (excludes gaps from frame shift/stops)
orfSize 693int(11) range Size of ORF
cdsScore 1494.5double range Score of best CDS according to txCdsPredict
startComplete 1tinyint(3) unsigned range Starts with ATG
endComplete 1tinyint(3) unsigned range Ends with stop codon
nonsenseMediatedDecay 0tinyint(3) unsigned range If true, is a nonsense mediated decay candidate.
retainedIntron 0tinyint(3) unsigned range True if has a retained intron compared to overlapping transcripts
bleedIntoIntron 0int(11) range If nonzero number of bases start or end of tx bleeds into intron
strangeSplice 0int(11) range Count of splice sites not gt/ag, gc/ag, or at/ac
atacIntrons 0int(11) range Count of number of at/ac introns
cdsSingleInIntron 0tinyint(3) unsigned range True if CDS is single exon and in intron of other transcript.
cdsSingleInUtr3 0tinyint(3) unsigned range True if CDS is single exon and in 3' UTR of other transcript.
selenocysteine 0tinyint(3) unsigned range If true TGA codes for selenocysteine
genomicFrameShift 0tinyint(3) unsigned range True if genomic version has frame shift we cut out
genomicStop 0tinyint(3) unsigned range True if genomic version has stop codon we cut out

Connected Tables and Joining Fields
        knownGeneV39.bioCycPathway.kgID (via kgTxInfo.name)
      knownGeneV39.ceBlastTab.query (via kgTxInfo.name)
      knownGeneV39.dmBlastTab.query (via kgTxInfo.name)
      knownGeneV39.drBlastTab.query (via kgTxInfo.name)
      knownGeneV39.foldUtr3.name (via kgTxInfo.name)
      knownGeneV39.foldUtr5.name (via kgTxInfo.name)
      knownGeneV39.gnfAtlas2Distance.query (via kgTxInfo.name)
      knownGeneV39.gnfAtlas2Distance.target (via kgTxInfo.name)
      knownGeneV39.kgAlias.kgID (via kgTxInfo.name)
      knownGeneV39.kgColor.kgID (via kgTxInfo.name)
      knownGeneV39.kgProtAlias.kgID (via kgTxInfo.name)
      knownGeneV39.kgSpAlias.kgID (via kgTxInfo.name)
      knownGeneV39.kgTargetAli.qName (via kgTxInfo.name)
      knownGeneV39.kgXref.kgID (via kgTxInfo.name)
      knownGeneV39.knownAttrs.kgID (via kgTxInfo.name)
      knownGeneV39.knownBlastTab.query (via kgTxInfo.name)
      knownGeneV39.knownBlastTab.target (via kgTxInfo.name)
      knownGeneV39.knownCanonical.protein (via kgTxInfo.name)
      knownGeneV39.knownCanonical.transcript (via kgTxInfo.name)
      knownGeneV39.knownCds.name (via kgTxInfo.name)
      knownGeneV39.knownGene.name (via kgTxInfo.name)
      knownGeneV39.knownGeneMrna.name (via kgTxInfo.name)
      knownGeneV39.knownGenePep.name (via kgTxInfo.name)
      knownGeneV39.knownIsoforms.transcript (via kgTxInfo.name)
      knownGeneV39.knownToEnsembl.name (via kgTxInfo.name)
      knownGeneV39.knownToGnfAtlas2.name (via kgTxInfo.name)
      knownGeneV39.knownToHprd.name (via kgTxInfo.name)
      knownGeneV39.knownToLocusLink.name (via kgTxInfo.name)
      knownGeneV39.knownToMrna.name (via kgTxInfo.name)
      knownGeneV39.knownToMrnaSingle.name (via kgTxInfo.name)
      knownGeneV39.knownToMupit.name (via kgTxInfo.name)
      knownGeneV39.knownToPfam.name (via kgTxInfo.name)
      knownGeneV39.knownToRefSeq.name (via kgTxInfo.name)
      knownGeneV39.knownToSuper.gene (via kgTxInfo.name)
      knownGeneV39.knownToTag.name (via kgTxInfo.name)
      knownGeneV39.knownToU133.name (via kgTxInfo.name)
      knownGeneV39.knownToU95.name (via kgTxInfo.name)
      knownGeneV39.knownToVisiGene.name (via kgTxInfo.name)
      knownGeneV39.knownToWikipedia.name (via kgTxInfo.name)
      knownGeneV39.mmBlastTab.query (via kgTxInfo.name)
      knownGeneV39.rnBlastTab.query (via kgTxInfo.name)
      knownGeneV39.scBlastTab.query (via kgTxInfo.name)
      knownGeneV39.ucscScop.ucscId (via kgTxInfo.name)
      mm10.bioCycPathway.kgID (via kgTxInfo.name)
      mm10.ccdsKgMap.geneId (via kgTxInfo.name)
      mm10.ceBlastTab.query (via kgTxInfo.name)
      mm10.dmBlastTab.query (via kgTxInfo.name)
      mm10.drBlastTab.query (via kgTxInfo.name)
      mm10.foldUtr3.name (via kgTxInfo.name)
      mm10.foldUtr5.name (via kgTxInfo.name)
      mm10.hgBlastTab.query (via kgTxInfo.name)
      mm10.keggPathway.kgID (via kgTxInfo.name)
      mm10.kgAlias.kgID (via kgTxInfo.name)
      mm10.kgColor.kgID (via kgTxInfo.name)
      mm10.kgProtAlias.kgID (via kgTxInfo.name)
      mm10.kgProtMap2.qName (via kgTxInfo.name)
      mm10.kgSpAlias.kgID (via kgTxInfo.name)
      mm10.kgTargetAli.qName (via kgTxInfo.name)
      mm10.kgXref.kgID (via kgTxInfo.name)
      mm10.knownAttrs.kgID (via kgTxInfo.name)
      mm10.knownBlastTab.query (via kgTxInfo.name)
      mm10.knownBlastTab.target (via kgTxInfo.name)
      mm10.knownCanonical.transcript (via kgTxInfo.name)
      mm10.knownCds.name (via kgTxInfo.name)
      mm10.knownGene.name (via kgTxInfo.name)
      mm10.knownGeneMrna.name (via kgTxInfo.name)
      mm10.knownGenePep.name (via kgTxInfo.name)
      mm10.knownIsoforms.transcript (via kgTxInfo.name)
      mm10.knownToEnsembl.name (via kgTxInfo.name)
      mm10.knownToKeggEntrez.name (via kgTxInfo.name)
      mm10.knownToLocusLink.name (via kgTxInfo.name)
      mm10.knownToLynx.name (via kgTxInfo.name)
      mm10.knownToMrna.name (via kgTxInfo.name)
      mm10.knownToMrnaSingle.name (via kgTxInfo.name)
      mm10.knownToPfam.name (via kgTxInfo.name)
      mm10.knownToRefSeq.name (via kgTxInfo.name)
      mm10.knownToSuper.gene (via kgTxInfo.name)
      mm10.knownToTag.name (via kgTxInfo.name)
      mm10.knownToVisiGene.name (via kgTxInfo.name)
      mm10.knownToWikipedia.name (via kgTxInfo.name)
      mm10.rnBlastTab.query (via kgTxInfo.name)
      mm10.scBlastTab.query (via kgTxInfo.name)
      mm10.ucscScop.ucscId (via kgTxInfo.name)

Sample Rows
 
namecategorysourceAccisRefSeqsourceSizealiCoveragealiIdRatiogenoMapCountexonCountorfSizecdsScorestartCompleteendCompletenonsenseMediatedDecayretainedIntronbleedIntoIntronstrangeSpliceatacIntronscdsSingleInIntroncdsSingleInUtr3selenocysteinegenomicFrameShiftgenomicStop
uc007afh.1codingNM_008866.2124470.9942791196931494.5110000000000
uc007afg.1codingAK050549.107650.9973861186511438.5110016950000000
uc007afi.2codingNM_011541.4127270.97946511109062012110000100000
uc011wht.1codingNM_001159750.1127240.97944211109032006110000100000
uc011whu.1codingNM_001159751.1126200.97862611109391843.5110000100000
uc057aty.1nearCodingAK048564.1040570.67020.999632110201.5000000000000
uc007afn.2codingNM_133826.5120631111414523081.5110000000000
uc057atz.1codingNM_001310442.1120091111313982973.5110000000000
uc007afm.2codingAK081492.1023190.9995690.999569165911329.5110016440000000
uc007afo.2codingNM_011011.214677111411432484110000000000

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.