Staged parser combinators for efficient data processing

Jonnalagedda, Manohar; Coppey, Thierry; Stucki, Sandro; Rompf, Tiark; Odersky, Martin

doi:10.1145/2660193.2660241

conference paper

Staged parser combinators for efficient data processing

Jonnalagedda, Manohar

•

Coppey, Thierry

•

Stucki, Sandro

more

2014

Proceedings of the 2014 ACM International Conference on Object Oriented Programming Systems Languages & Applications - OOPSLA '14

Object Oriented Programming Systems Languages & Applications (OOPSLA)

Parsers are ubiquitous in computing, and many applications depend on their performance for decoding data efficiently. Parser combinators are an intuitive tool for writing parsers: tight integration with the host language enables grammar specifications to be interleaved with processing of parse results. Unfortunately, parser combinators are typically slow due to the high overhead of the host language abstraction mechanisms that enable composition. We present a technique for eliminating such overhead. We use staging, a form of runtime code generation, to dissociate input parsing from parser composition, and eliminate intermediate data structures and computations associated with parser composition at staging time. A key challenge is to maintain support for input dependent grammars, which have no clear stage distinction. Our approach applies to top-down recursive-descent parsers as well as bottom-up nondeterministic parsers with key applications in dynamic programming on sequences, where we auto-generate code for parallel hardware. We achieve performance comparable to specialized, hand-written parsers.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/108537

Name

p637-jonnalagedda.pdf

Type

Publisher's version

Access type

openaccess

Size

655.77 KB

Format

Adobe PDF

Checksum (MD5)

0f764ee418bb680ec78ed4bb18c0954c