O-GLYCBASE is a revised database of O- and C-glycosylated proteins.
Version 6.00 has 242 glycoprotein entries. The criteria for inclusion are at least one experimentally verified O- or C-glycosylation site. The terminal sugar linked to serine or threonine is cited when known. The database is non-redundant in the sense that it contains no identical sequences, unless there is conflicting glycosylation data. Mucins have tandem repeat sequences, which are O-glycosylated. This result in some redundancy of the O-glycosylation sites. For prediction purposes we have also included a version of the database which contains no identical O-glycosylation sites (window=9) called O-Unique.seq. This data set has been used as the training set of the netOglyc prediction server (Hansen et al. 1995).
Nucleic acids research 1999;
O-GLYCBASE version 4.0: a revised database of O-glycosylated proteins.
Gupta R , Birch H , Rapacki K , Brunak S , Hansen JE.