Abstract
A sizable percentage of online user generated content is susceptible to code switching and code mixing owing to a variety of reasons. Thus, an expected consequence is that adhoc user queries on such data are also inherently code mixed. This paper thus presents our solution for a similar scenario: information retrieval on code mixed Hindi-English tweets. We explore techniques in information extraction, clustering and query expansion as part of this work and present our results on the test dataset. Our system achieved a MAP of 0.0217 on the test set and placed third on the rankings.
| Original language | English |
|---|---|
| Pages (from-to) | 105-108 |
| Number of pages | 4 |
| Journal | CEUR Workshop Proceedings |
| Volume | 1737 |
| State | Published - 2016 |
| Event | 2016 Forum for Information Retrieval Evaluation, FIRE 2016 - Kolkata, India Duration: Dec 7 2016 → Dec 10 2016 |
Fingerprint
Dive into the research topics of 'Exploiting named entity mentions towards code mixed IR: Working notes for the UB system submission for MSIR@FIRE'16'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver