nuID: A Novel Identifier for Oligos, Ideal for Oligonucleotide-based Microarrays

Authors Pan Du, Warren A. Kibbe, and Simon M. Lin
Robert H. Lurie Cancer Center, Northwestern University, Chicago, IL 60611


Abstract

Motivation: Oligonucleotide probes that are sequence identical may have different identifiers between manufacturers and even between different versions of the same company's microarray; and some-times the same identifier is reused and represents a completely different oligonucleotide, resulting in ambiguity and potentially mis-identification of the genes hybridizing to that probe.

Results: We have devised a unique, non-degenerate encoding scheme that can be used as a universal representation to identify an oligonucleotide across manufacturers. We have named the encoded representation as nuID, for nucleotide universal identifier. Inspired by the fact that the raw sequence of the oligonucleotide is the true definition of identity for a probe, the encoding algorithm uniquely and non-degenerately transforms the sequence itself into a compact identifier. In addition, we added a redundancy check (checksum) to validate the integrity of the identifier. These two steps, encoding plus checksum, result in a nuID, which is a unique, non-generate, robust and efficient representation of the probe sequence. We demonstrate the utility of nuID for Illumina microarrays, and we believe it has universal utility for the naming of oligomers.

 
Correspondence to Simon Lin
Tel: (+1) 312 695 1331
Fax: (+1) 312 695 1347
Supplementary material supplementary.pdf
Publication URL Link to the journal's website. TBA
PubMed URL TBA
Publication Citation Du P, Kibbe WA, and Lin SM, nuID: A Novel Identifier for Oligos, Ideal for Oligonucleotide-based Microarrays, 2007 (submitted)
 


nuID Website
Conversion between nuID and nucleotide sequence
Retrieve nuID Illumina annotation online
nuID Illumina Annotations
 


R Code Release: (included in "lumi" Bioconductor package)
Files
Description File Name
Encoding and decoding functions in R
coding.R
Illumina Human annotation package Human.lumi_1.0.1.tar.gz (29.1 M)
Illumina Mouse annotation package Mouse.lumi_1.0.1.tar.gz (41.9 M)
Readme Readme.txt
 


About this webpage
Created 10-12-2006. Last updated 10-12-2006.
http://basic.northwestern.edu/publications/nuID