Skip to main navigation Skip to search Skip to main content

Unsupervised Russian POS tagging with appropriate context

  • Janya Inc.
  • SUNY Buffalo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

While adopting the contextualized hidden Markov model (CHMM) framework for unsupervised Russian POS tagging, we investigate the possibility of utilizing the left, right, and unambiguous context in the CHMM framework. We propose a backoff smoothing method that incorporates all three types of context into the transition probability estimation during the expectation-maximization process. The resulting model with this new method achieves overall and disambiguation accuracies comparable to a CHMM using the classic backoff smoothing method for HMM-based POS tagging from [17].

Original languageEnglish
Title of host publicationText, Speech and Dialogue - 14th International Conference, TSD 2011, Proceedings
Pages427-433
Number of pages7
DOIs
StatePublished - 2011
Event14th International Conference on Text, Speech and Dialogue, TSD 2011 - Pilsen, Czech Republic
Duration: Sep 1 2011Sep 5 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6836 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Text, Speech and Dialogue, TSD 2011
Country/TerritoryCzech Republic
CityPilsen
Period09/1/1109/5/11

Keywords

  • CHMM
  • expectation-maximization (EM)
  • left
  • right
  • transition probability
  • unambiguous context
  • unsupervised Russian part-of-speech tagging

Fingerprint

Dive into the research topics of 'Unsupervised Russian POS tagging with appropriate context'. Together they form a unique fingerprint.

Cite this