Skip to main navigation Skip to search Skip to main content

Exploiting named entity mentions towards code mixed IR: Working notes for the UB system submission for MSIR@FIRE'16

  • SUNY Buffalo

Research output: Contribution to journalConference articlepeer-review

2 Scopus citations

Abstract

A sizable percentage of online user generated content is susceptible to code switching and code mixing owing to a variety of reasons. Thus, an expected consequence is that adhoc user queries on such data are also inherently code mixed. This paper thus presents our solution for a similar scenario: information retrieval on code mixed Hindi-English tweets. We explore techniques in information extraction, clustering and query expansion as part of this work and present our results on the test dataset. Our system achieved a MAP of 0.0217 on the test set and placed third on the rankings.

Original languageEnglish
Pages (from-to)105-108
Number of pages4
JournalCEUR Workshop Proceedings
Volume1737
StatePublished - 2016
Event2016 Forum for Information Retrieval Evaluation, FIRE 2016 - Kolkata, India
Duration: Dec 7 2016Dec 10 2016

Fingerprint

Dive into the research topics of 'Exploiting named entity mentions towards code mixed IR: Working notes for the UB system submission for MSIR@FIRE'16'. Together they form a unique fingerprint.

Cite this