dc.contributor.author | MĂRĂNDUC, Cătălina | |
dc.contributor.author | BOBICEV, Victoria | |
dc.contributor.author | PEREZ, Cenel Augusto | |
dc.date.accessioned | 2021-04-29T14:16:00Z | |
dc.date.available | 2021-04-29T14:16:00Z | |
dc.date.issued | 2020 | |
dc.identifier.citation | MĂRĂNDUC, Cătălina, BOBICEV, Victoria, PEREZ, Cenel Augusto. Parsing temporal and spatial information. In: Linguistic resources and tools for natural language processing: proc. of the 15th International Conference, 14-15 Dec. 2020, online, 2020, pp. 17–28. | en_US |
dc.identifier.uri | http://81.180.74.21:8080/xmlui/handle/123456789/14639 | |
dc.description | Sursa: Conferinţa „Linguistic resources and tools for natural language processing”, proc. of the 15th International Conference, 14-15 Dec. 2020 https://profs.info.uaic.ro/~consilr/wp-content/uploads/2021/03/volum-ConsILR-v-4-final-revizuit.pdf#page=25 | en_US |
dc.description.abstract | In this paper we present a dependency treebank morphologically and syntactically annotated in a specific scheme. We managed to increase the accuracy of the POS-tagger and the syntactic parser used, which led to the increase in the volume of annotated texts. First, we analysed the accuracy with which the syntactic parser recognizes the 14 types of circumstantial complements, especially the temporal and spatial ones. These are the most numerous circumstantial complements, and they are very important for the configuration of a textual world describing reality or proposing a fictitious world, providing information about the type of text. In December 2020 our treebank comprised 42,542 sentences (919,608 words and punctuation). We studied our documents containing fictional and non-fictional narrative. Using a Malt parser optimizer, we extracted dependency chains of time and spatial complements. The number of complements and the degree to which they are precise is related to the type of text, fictional or nonfictional. In order to construct a classifier of texts, one can count the spatial and temporal complements and one can observe if they represent determinations of exact landmarks (with geographical proper names and numbers) - in which case the text is a real narrative, or if they represent imprecise determinations, in which case the narrative is fictional. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Romanian Academy, Bucharest | en_US |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/us/ | * |
dc.subject | annotated texts | en_US |
dc.subject | texts | en_US |
dc.subject | syntactic parser | en_US |
dc.subject | treebanks | en_US |
dc.title | Parsing temporal and spatial information | en_US |
dc.type | Article | en_US |
The following license files are associated with this item: