TY - GEN
T1 - Unsupervised Russian POS tagging with appropriate context
AU - Yang, Li
AU - Peterson, Erik
AU - Chen, John
AU - Petrova, Yana
AU - Srihari, Rohini
PY - 2011
Y1 - 2011
N2 - While adopting the contextualized hidden Markov model (CHMM) framework for unsupervised Russian POS tagging, we investigate the possibility of utilizing the left, right, and unambiguous context in the CHMM framework. We propose a backoff smoothing method that incorporates all three types of context into the transition probability estimation during the expectation-maximization process. The resulting model with this new method achieves overall and disambiguation accuracies comparable to a CHMM using the classic backoff smoothing method for HMM-based POS tagging from [17].
AB - While adopting the contextualized hidden Markov model (CHMM) framework for unsupervised Russian POS tagging, we investigate the possibility of utilizing the left, right, and unambiguous context in the CHMM framework. We propose a backoff smoothing method that incorporates all three types of context into the transition probability estimation during the expectation-maximization process. The resulting model with this new method achieves overall and disambiguation accuracies comparable to a CHMM using the classic backoff smoothing method for HMM-based POS tagging from [17].
KW - CHMM
KW - expectation-maximization (EM)
KW - left
KW - right
KW - transition probability
KW - unambiguous context
KW - unsupervised Russian part-of-speech tagging
UR - https://www.scopus.com/pages/publications/80052772895
U2 - 10.1007/978-3-642-23538-2_54
DO - 10.1007/978-3-642-23538-2_54
M3 - Conference contribution
AN - SCOPUS:80052772895
SN - 9783642235375
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 427
EP - 433
BT - Text, Speech and Dialogue - 14th International Conference, TSD 2011, Proceedings
T2 - 14th International Conference on Text, Speech and Dialogue, TSD 2011
Y2 - 1 September 2011 through 5 September 2011
ER -