In the field of Scientific Computing, two trends are clear: the size of data sets in use is growing rapidly and microprocessor performance is improving through increases in parallelism, rather than through clock rate increases. Further, Extensible Markup Language (XML) is increasingly being used to encode large data sets, and SOAP is being used to provide Grid services – uses XML and SOAP were never designed for, and naïve implementations of these standards can lead to performance penalties. As these trends continue, past assumptions about the value of seeking out parallel algorithms should be revisited.
Lexical analysis has traditionally been seen as an inherently serial process. This work seeks to challenge that viewpoint. We start by tracking the performance of state of the art in XML parsers and SOAP toolkits through benchmarks for scientific computing applications. We continue to study the space through an examination of the effects of current workstation- and server-class computer systems' caching mechanisms on parser performance. Finally, we propose Piximal, an NFA-based parser which uses spare processors to reduce XML parse time. The limits of the Piximal approach to parallel XML parsing are examined.
|Commitee:||Chiu, Kenneth, Guzman, Fernando, Lander, Leslie, Lewis, Michael J.|
|School:||State University of New York at Binghamton|
|School Location:||United States -- New York|
|Source:||DAI-B 71/01, Dissertation Abstracts International|
|Keywords:||Automata, Benchmarks, Multicore, Parallel algorithms, Parsing, XML datasets|
Copyright in each Dissertation and Thesis is retained by the author. All Rights Reserved
The supplemental file or files you are about to download were provided to ProQuest by the author as part of a
dissertation or thesis. The supplemental files are provided "AS IS" without warranty. ProQuest is not responsible for the
content, format or impact on the supplemental file(s) on our system. in some cases, the file type may be unknown or
may be a .exe file. We recommend caution as you open such files.
Copyright of the original materials contained in the supplemental file is retained by the author and your access to the
supplemental files is subject to the ProQuest Terms and Conditions of use.
Depending on the size of the file(s) you are downloading, the system may take some time to download them. Please be