Abstract
The University of Maryland participated in the English and Czech tasks. For English, one monolingual run using only fields based on fully automatic transcription (the required condition) and one (otherwise identical) cross-language run using French queries were officially scored. Three contrastive runs in which manually generated metadata fields in the English collection were indexed were also officially scored to explore the applicability of recently developed "meaning matching" approaches to cross-language retrieval of manually indexed interviews. Statistical translation models trained on European Parliament proceedings were found to be poorly matched to this task, yielding 38% and 44% of monolingual mean average precision for indexing based on automatic transcription and manually generated metadata, respectively. Weighted use of alternative translations yielded an apparent (but not statistically significant) 7% improvement over one-best translation when bi-directional meaning matching techniques were employed. Results for Czech were not informative in this first year of that task, perhaps because no accommodations were made for the unique characteristics of Czech morphology.
| Original language | English |
|---|---|
| Journal | CEUR Workshop Proceedings |
| Volume | 1172 |
| State | Published - 2006 |
| Event | 2006 Cross Language Evaluation Forum Workshop, CLEF 2006, co-located with the 10th European Conference on Digital Libraries, ECDL 2006 - Alicante, Spain Duration: Sep 20 2006 → Sep 22 2006 |
Keywords
- Cross-language information retrieval
- Speech retrieval
- Statistical translation
Fingerprint
Dive into the research topics of 'CLEF-2006 CL-SR at Maryland: English and Czech'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver