Binary page implementation of a canonical native storage for XML

dc.contributor.author Patanroi, Daniel
dc.contributor.department Department of Computer Science
dc.date 2020-06-17T02:41:42.000
dc.date.accessioned 2020-06-30T08:13:56Z
dc.date.available 2020-06-30T08:13:56Z
dc.date.copyright Sat Jan 01 00:00:00 UTC 2005
dc.date.issued 2005-01-01
dc.description.abstract <p>XML is a simple and very flexible text format, originally designed to meet the challenges of large-scale electronic publishing. Great as XML is for representing data, many XML-based query processors and storage managements have been proposed. With the classical memory problem of DOM parsers when an XML document is mapped onto an internal tree structure, many implementations handle a rather small document size. CanStoreX with textual page implementation approaches the problem by breaking an XML document into smaller pieces, stored into pages. It preserves the structure of the original XML document as well as does not require the whole document to be loaded into the main memory at once. Its binary page implementation removes major memory problems. This allows CanStoreX to parse XML documents of size 100 gigabytes or larger without any conspicuous problems. This shows that CanStoreX is scalable in terms of storage requirement, memory management, and query processing. The only two bottlenecks, encoding and decoding processes, can be diminished by embedding them into a computer chip, which will further bring CanStoreX to its primal state.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/rtd/19208/
dc.identifier.articleid 20207
dc.identifier.contextkey 18125284
dc.identifier.doi https://doi.org/10.31274/rtd-20200616-101
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath rtd/19208
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/73193
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/rtd/19208/Patanroi_ISU_2005_P276.pdf|||Fri Jan 14 21:53:38 UTC 2022
dc.subject.keywords Computer science
dc.title Binary page implementation of a canonical native storage for XML
dc.type thesis en_US
dc.type.genre thesis en_US
dspace.entity.type Publication
relation.isOrgUnitOfPublication f7be4eb9-d1d0-4081-859b-b15cee251456
thesis.degree.discipline Computer Science
thesis.degree.level thesis
thesis.degree.name Master of Science
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Patanroi_ISU_2005_P276.pdf
Size:
2.49 MB
Format:
Adobe Portable Document Format
Description: