Poster Paper
© 2013 ACEEE
DOI: 03.LSCS.2013.3.
Proc. of Int. Conf. on Advances in Computer Science and Application 2013
taken 600 sentences as our training data set and the rest 400
sentences as our testing data set.
With the training data set, we have manually categorized
sentences into Contrast, Sequence and Parallel depending
on the definition of these RST relations, then we have split
the composite sentences into 2 simple sentences manually
and then create the semantic structure of each of the
sentences, after which we manually identify the Discourse
marker from each of the sentences. Then we categorized the
sentences into Contrast, Sequence and Parallel. we have
developed some rules for each of the RST relation from their
semantic representation. These rules were then formulated
into an algorithm for each of the RST relations. Then we have
created a system with the help of our algorithm.
The testing data set i.e. the remaining 400 sentences are
first manually categorized into Contrast, Sequence and Parallel
depending on the definition of these RST relations and then
to check the accuracy of our system, we put the semantic
representation of these 400 sentences into our system.
53
Figure.3:- Line Chart to show the accuracy of our system
The figure 3 shows the accuracy of the system. The
accuracy is measured on the testing data set. The horizontal
axis represents the Testing Data set, i.e. the number of
sentences tested and the vertical axis represents the number
of correct identification of RST relation by our system. We
can see that for the first 100 sentences, 91 sentences were
identified correctly by our system and so on. When all the
400 sentences are tested, 383 RST Relations are correctly
identified by our system.
CONCLUSION
In this paper we have discussed about the identification
of the Rhetorical Structure Theory Relation from Discourse
Marker in Bengali Language Understanding. We have de-
rived algorithms for CONTRAST, SEQUENCE and PARAL-
LEL RST relation and explained them with few examples. We
have given user based evaluation to validate our approach.
We have built a small system and a small corpus to check our
result.
FUTURE WORKS
There are many more RST relations whose algorithms can
be derived and then a system can be built using the functions
of all the RST relations.
REFERENCES
[1] Maite Taboada : “Discourse markers as signals (or not) of
rhetorical relations” in Journal of Pragmatics 38 (2006) 567–
592
[2] Sumit Das, Anupam Basu, Sudeshna Sarkar: “Discourse Marker
Generation and Syntactic Aggregation in Bengali Text
Generation” in Proceedings of the 2010 IEEE Students’
Technology Symposium, 3-4 April 2010, IIT Kharagpur
[3] William C. Mann and Sandra A. Thompson: “Rhetorical
structure theory: Toward a functional theory of text
organization”. In Text, 8(3):243- 281, 1988.
[4] S amit Bhattacharya, Sanyog: “An iconic system for multilingual
communication for people with speech and motor
impairments”. M.S. Thesis, IIT, Kharagpur, Supervisor -
Anupam Basu and Sudeshna Sarkar, 2004.
[5] Maite Taboada and William C. Mann: “Rhetorical Structure
Theory: Looking Back and Moving Ahead” in Discourse
Studies 8 (3) by Sage Publicat, January 24, 2006.
[6] Caroline Sporleder, Alex Lascarides:”Using Automatically
Labelled Examples to Classify Rhetorical Relations: An
Assessment”, Printed in the United Kingdom Cambridge
University Press, Received 26 August 2005; revised 16 March
2006.
[7] Farhi Marir and Kame1 Haouam: “Rhetorical Structure Theory
for content-based indexing and retrieval of Web documents”
in Proc. of 2nd International Conference on Information
Technology: Research and Education, 2004, ITRE 2004.
Pages:160 - 164
[8] Maite Taboada and Manfred Stede: “Introduction to RST
Rhetorical Structure Theory” created as part of a project on
Natural Language Generation at the Information Sciences, May
2009.
[9] Daniel Marcu and Abdessamad Echihabi: “An Unsupervised
Approach to Recognizing Discourse Relations” in Proceedings
of the 40th Annual Meeting of the Association for
Computational Linguistics (ACL), Philadelphia, July 2002,
pp. 368-375.
[10] Abanindranath Thakur: “Khirer Putul” Published by Ananda
Publications, 1896. (For Corpus use)
[11] Bibhutibhushon Bondopoddhay: “Chander Pahar” Published
by Bornayon, 1937 (For Corpus use)
45