Thursday, 29 October 2009

Wednesday, 28 October 2009

The OntoNotes Project

http://www.bbn.com/ontonotes/

--
Cheers,
Vu

The book "Natural Language Processing with Python"

http://www.nltk.org/book

Will study about it soon!

--
Cheers,
Vu

Project Gutenberg

Project Gutenberg (http://www.gutenberg.org/wiki/Main_Page) - a site containing more than 100,000 free online books in various languages ^_^. Especially, it allows to access the raw texts from books for further processing (e.g. book summarization - a very interesting research direction which has been underestimated so far).

--
Cheers,
Vu

Tuesday, 27 October 2009

Reference Management Tools (part 2)

Just want to collect more information about reference management tools.

Part 1: here

1) BibDesk (http://bibdesk.sf.net) for Mac

2) Mendeley: http://www.mendeley.com/

If you know any others, please suggest me! Thanks in advance!

--
Cheers,
Vu

Sunday, 25 October 2009

Remarkable papers for Vietnamese Word Segmentation

By Doan NGUYEN (Hewlett-Packard Company)

1) "Query Preprocessing: Improving Web Search Through a Vietnamese Word Tokenization Approach". SIGIR'08 (short paper)

2) "Using Search Engine to Construct a Scalable Corpus for Vietnamese Lexical Development for Word Segmentation". Proceedings of the 7th Workshop on Asian Language Resources, ACL-IJCNLP 2009.

--
Cheers,
Vu