Wednesday, 3 March 2010

Wikipedia issues

All Wikipedia related issues will be posted here:

Dumps of Wikipedia: http://download.wikimedia.org
Extraction of plain text corpus from Wikipedia: http://blog.afterthedeadline.com/2009/12/04/generating-a-plain-text-corpus-from-wikipedia/

--
Cheers,
Vu

No comments:

Post a Comment