MATERNA, Jiří. LDA-Frames: An Unsupervised Approach to Generating Semantic Frames. In Alexander Gelbukh. Computational Linguistics and Intelligent Text Processing, 13th International Conference, CICLing 2012, Part I. Berlin: Springer-Verlag, 2012, p. 376-387. ISBN 978-3-642-28603-2. Available from: https://dx.doi.org/10.1007/978-3-642-28604-9_31.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name LDA-Frames: An Unsupervised Approach to Generating Semantic Frames
Authors MATERNA, Jiří (203 Czech Republic, guarantor, belonging to the institution).
Edition Berlin, Computational Linguistics and Intelligent Text Processing, 13th International Conference, CICLing 2012, Part I, p. 376-387, 12 pp. 2012.
Publisher Springer-Verlag
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher India
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
Impact factor Impact factor: 0.402 in 2005
RIV identification code RIV/00216224:14330/12:00059516
Organization unit Faculty of Informatics
ISBN 978-3-642-28603-2
ISSN 0302-9743
Doi http://dx.doi.org/10.1007/978-3-642-28604-9_31
Keywords in English LDA-frames; semantic frame; Latent Dirichlet Allocation
Tags best1
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 23/4/2013 07:26.
Abstract
In this paper we introduce a novel approach to identifying semantic frames from semantically unlabelled text corpora. There are many frame formalisms but most of them suffer from the problem that all frames must be created manually and the set of semantic roles must be predefined. The LDA-Frames approach, based on the Latent Dirichlet Allocation, avoids both these problems by employing statistics on a syntactically tagged corpus. The only information that must be given is a number of semantic frames and a number of semantic roles to be identified. The power of LDA-Frames is first shown on a small sample corpus and then on the British National Corpus.
Links
LC536, research and development projectName: Centrum komputační lingvistiky
Investor: Ministry of Education, Youth and Sports of the CR, Centrum komputační lingvistiky
PrintDisplayed: 30/4/2024 22:09