Abstract
In this paper we explore the possibility of using cross lingual projections that help to automatically induce role-semantic annotations in the PropBank paradigm for Urdu, a resource poor language. This technique provides annotation projections based on word alignments. It is relatively inexpensive and has the potential to reduce human effort involved in creating semantic role resources. The projection model exploits lexical as well as syntactic information on an English-Urdu parallel corpus. We show that our method generates reasonably good annotations with an accuracy of 92% on short structured sentences. Using the automatically generated annotated corpus, we conduct preliminary experiments to create a semantic role labeler for Urdu. The results of the labeler though modest, are promising and indicate the potential of our technique to generate large scale annotations for Urdu.
| Original language | English |
|---|---|
| Pages | 797-805 |
| Number of pages | 9 |
| State | Published - 2010 |
| Event | 23rd International Conference on Computational Linguistics, Coling 2010 - Beijing, China Duration: Aug 23 2010 → Aug 27 2010 |
Conference
| Conference | 23rd International Conference on Computational Linguistics, Coling 2010 |
|---|---|
| Country/Territory | China |
| City | Beijing |
| Period | 08/23/10 → 08/27/10 |
Fingerprint
Dive into the research topics of 'Using cross-lingual projections to generate semantic role labeled corpus for urdu - A resource poor language'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver