Sunday 25 October 2009

Remarkable papers for Vietnamese Word Segmentation

By Doan NGUYEN (Hewlett-Packard Company)

1) "Query Preprocessing: Improving Web Search Through a Vietnamese Word Tokenization Approach". SIGIR'08 (short paper)

2) "Using Search Engine to Construct a Scalable Corpus for Vietnamese Lexical Development for Word Segmentation". Proceedings of the 7th Workshop on Asian Language Resources, ACL-IJCNLP 2009.

--
Cheers,
Vu

No comments:

Post a Comment