Inproceedings: an article in a conference proceedings.
Segmentation and Clustering of Textual Sequences: a Typological Approach
Title of the conference
Recent Advances in Natural Language Processing. International Conference (RANLP 8 : Hissar : 2011). Proceedings
Angelova G., Bontcheva K., Mitkov R., Nikolov N.
The long term goal of this research is to develop a program able to produce an automatic segmentation and categorization of textual sequences into discourse types. In this preliminary contribution, we present the construction of an algorithm which takes a segmented text as input and attempts to produce a categorization of sequences, such as narrative, argumentative, descriptive and so on. Also, this work aims at investigating a possible convergence between the typological approach developed in particular in the field of text and discourse analysis in French by Adam (2008) and Bronckart (1997) and unsupervised statistical learning.
fuzzy clustering, discourse types, part-of-speech distributions
Last modification date