Schema for Interrupted Rpts - Fragments of Interrupted Repeats Joined by RepeatMasker ID
  Database: mm10    Primary Table: nestedRepeats    Row Count: 425,483   Data last updated: 2012-02-07
Format description: BED12+ describing joined (by ID) fragments of repeats from RepeatMasker
On download server: MariaDB table dump directory
fieldexampleSQL type info description
bin 607smallint(6) range Indexing field to speed chromosome range queries.
chrom chr1varchar(255) values Chromosome (or contig, scaffold, etc.)
chromStart 3004040int(10) unsigned range Start position in chromosome
chromEnd 3005439int(10) unsigned range End position in chromosome
name L1_Rodvarchar(255) values Name of item
score 471int(10) unsigned range Average of fragment identity scores, transformed into 0..1000 range for shading
strand +char(1) values +, -, or . for mixed (some fragments +, some -)
thickStart 3004040int(10) unsigned range for BED compatibility -- same as chromStart
thickEnd 3005439int(10) unsigned range for BED compatibility -- same as chromEnd
reserved 0int(10) unsigned range for BED compatibility
blockCount 3int(11) range Number of blocks
blockSizes 166,731,438,longblob   Comma separated list of block (fragment) sizes
chromStarts 0,230,961,longblob   Start positions relative to chromStart
blockStrands +,+,+,longblob   Strand of each fragment
id 4int(10) unsigned range RepeatMasker-assigned ID used to join fragments
repClass LINEvarchar(255) values Class of repeat
repFamily L1varchar(255) values Family of repeat

Sample Rows
 
binchromchromStartchromEndnamescorestrandthickStartthickEndreservedblockCountblockSizeschromStartsblockStrandsidrepClassrepFamily
607chr130040403005439L1_Rod471+3004040300543903166,731,438,0,230,961,+,+,+,4LINEL1
607chr130054603006764Lx9384+300546030067640288,1194,0,110,+,+,6LINEL1
607chr130081163010711L1_Mur2312-3008116301071104367,181,1249,26,0,1053,1287,2569,-,-,-,-,10LINEL1
607chr130116413012534RLTR25A174-3011641301253402283,564,0,329,-,-,15LTRERVK
607chr130131313013734L1M20-301313130137340249,370,0,233,-,-,17LINEL1
75chr130144713025821L1VL1696+3014471302582102278,4778,0,6572,+,+,21LINEL1
608chr130303453037508Lx2598-3030345303750804206,807,3842,1539,0,206,1365,5624,-,-,-,-,30LINEL1
608chr130377263044040L1Md_A892+30377263044040025321,993,0,5321,+,+,34LINEL1
608chr130498663050560BGLII_Mur454-3049866305056002310,31,0,663,-,-,47LTRERVK
608chr130523263060120Lx9503+305232630601200387,106,83,0,87,7711,+,+,+,59LINEL1

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Interrupted Rpts (nestedRepeats) Track Description
 

Description

This track shows joined fragments of interrupted repeats extracted from the output of the RepeatMasker program which screens DNA sequences for interspersed repeats and low complexity DNA sequences using the Repbase Update library of repeats from the Genetic Information Research Institute (GIRI). Repbase Update is described in Jurka (2000) in the References section below.

The detailed annotations from RepeatMasker are in the RepeatMasker track. This track shows fragments of original repeat insertions which have been interrupted by insertions of younger repeats or through local rearrangements. The fragments are joined using the ID column of RepeatMasker output.

Display Conventions and Configuration

In pack or full mode, each interrupted repeat is displayed as boxes (fragments) joined by horizontal lines, labeled with the repeat name. If all fragments are on the same strand, arrows are added to the horizontal line to indicate the strand. In dense or squish mode, labels and arrows are omitted and in dense mode, all items are collapsed to fit on a single row.

Items are shaded according to the average identity score of their fragments. Usually, the shade of an item is similar to the shades of its fragments unless some fragments are much more diverged than others. The score displayed above is the average identity score, clipped to a range of 50% - 100% and then mapped to the range 0 - 1000 for shading in the browser.

Methods

UCSC has used the most current versions of the RepeatMasker software and repeat libraries available to generate these data. Note that these versions may be newer than those that are publicly available on the Internet.

Data are generated using the RepeatMasker -s flag. Additional flags may be used for certain organisms. See the FAQ for more information.

Credits

Thanks to Arian Smit, Robert Hubley and GIRI for providing the tools and repeat libraries used to generate this track.

References

Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. http://www.repeatmasker.org. 1996-2010.

Repbase Update is described in:

Jurka J. Repbase Update: a database and an electronic journal of repetitive elements. Trends Genet. 2000 Sep;16(9):418-420. PMID: 10973072

For a discussion of repeats in mammalian genomes, see:

Smit AF. Interspersed repeats and other mementos of transposable elements in mammalian genomes. Curr Opin Genet Dev. 1999 Dec;9(6):657-63. PMID: 10607616

Smit AF. The origin of interspersed repeats in the human genome. Curr Opin Genet Dev. 1996 Dec;6(6):743-8. PMID: 8994846