Thursday, 18 November 2010

MultiTree - A Digital Library of Language Relationships

(look quite beautiful! :D)




Abstract: PaperMaker is a novel IT solution that receives a scientific manuscript via a Web interface, automatically analyses the publication, evaluates consistency parameters and interactively delivers feedback to the author. It analyses the proper use of acronyms and their definitions, and the use of specialized terminology. It provides Gene Ontology (GO) and Medline Subject Headings (MeSH) categorization of text passages, the retrieval of relevant publications from public scientific literature repositories, and the identification of missing or unused references.

Monday, 15 November 2010

Stanford CoreNLP

All-in-one included:
- sentence splitter
- tokenizer
- part of speech tagger
- lemmatizer
- named entity recognizers (probabilistic and rule-based)
- numeric entity canonicalizer
- parser
- coreference system

Gensim – Python Framework for Vector Space Modelling