US 20050144556A1
(19) United States
(12) Patent Application Publication (io) Pub. No.: US 2005/0144556 Al
Petersen et al. (43) Pub. Date: Jun. 30,2005
(54) XML SCHEMA TOKEN EXTENSION FOR XML DOCUMENT COMPRESSION
(76) Inventors: Peter H. Petersen, Trenton, NJ (US);
David D'Orto, Cherry Hill, NJ (US);
Gregory Pavlik, Shamong, NJ (US);
Neil Kenig, Mount Laurel, NJ (US)
Correspondence Address:
HEWLETT PACKARD COMPANY
P O BOX 272400, 3404 E. HARMONY ROAD
INTELLECTUAL PROPERTY
ADMINISTRATION
FORT COLLINS, CO 80527-2400 (US)
(21) Appl. No.: 10/750,136
(22) Filed: Dec. 31, 2003
Publication Classification (51) Int. CI.7 G06F 17 24
A method for markup language document compression comprises defining a schema that specifies the structure ol the markup-language conforming document and defining in the schema the types of elements and attributes that comprise the conforming document. The method further comprises assigning names in the schema for each of the elements and attributes of the document, defining relationships between the elements and between the attributes and the elements. The method further comprises assigning a token in the schema representing each element name and each attribute name of the document, and replacing each element name and each attribute name in the document with the assigned token.