Other formats:
BibTeX
LaTeX
RIS
@article{876509, author = {Jakubíček, Miloš and Horák, Aleš}, article_location = {Mexiko}, article_number = {March 2010}, keywords = {punctuation; grammar checking; parsing; syntactic analysis}, language = {eng}, issn = {1870-4069}, journal = {Research in Computing Science, Special issue: Natural Language Processing and its Applications}, title = {Punctuation Detection with Full Syntactic Parsing}, url = {http://www.cicling.org/2010/Vol46.pdf}, volume = {46}, year = {2010} }
TY - JOUR ID - 876509 AU - Jakubíček, Miloš - Horák, Aleš PY - 2010 TI - Punctuation Detection with Full Syntactic Parsing JF - Research in Computing Science, Special issue: Natural Language Processing and its Applications VL - 46 IS - March 2010 SP - 335-343 EP - 335-343 PB - Instituto Politécnico Nacional SN - 18704069 KW - punctuation KW - grammar checking KW - parsing KW - syntactic analysis UR - http://www.cicling.org/2010/Vol46.pdf N2 - The correct placement of punctuation characters is in many languages, including Czech, driven by complex guidelines. Although those guidelines use information of morphology, syntax and semantics, state-of-art systems for punctuation detection and correction are limited to simple rule-based backbones. In this paper we present a syntax-based approach by utilizing the Czech parser synt. This parser uses an adapted chart parsing technique for building the chart structure for the sentence. synt can then process the chart and provide several kinds of output information. The implemented punctuation detection technique utilizes the synt output in the form of automatic and unambiguous extraction of optimal syntactic structures from the sentence (noun phrases, verb phrases, clauses, relative clauses or inserted clauses). Using this feature it is possible to obtain information about syntactic structures related to expected punctuation placement. We also present experiments proving that this method makes it possible to cover most syntactic phenomena needed for punctuation detection or correction. ER -
JAKUBÍČEK, Miloš and Aleš HORÁK. Punctuation Detection with Full Syntactic Parsing. \textit{Research in Computing Science, Special issue: Natural Language Processing and its Applications}. Mexiko: Instituto Politécnico Nacional, 2010, vol.~46, March 2010, p.~335-343. ISSN~1870-4069.
|