Thursday, 23 February 2012
SMT without Parallel Corpora
Toward Statistical Machine Translation without Parallel Corpora (link). EMNLP 2011.
Labels:
links,
papers,
parallel corpora,
research,
SMT,
statistical machine translation
Wednesday, 22 February 2012
Monday, 20 February 2012
Topic Hierarchy Generation
This post continues the problem of topic summarization posted earlier. Here I try to collect research articles related to the problem of topic hierarchy generation which is an important step for topic summarization.
1) Non-Parametric Estimation of Topic Hierarchies from Texts with Hierarchical Dirichlet Processes (link). Journal of Machine Learning Research 2011.
2) A Practical Web-based Approach to Generating Topic Hierarchy for Text Segments (link). CIKM 2004.
3) Finding Topic Words for Hierarchical Summarization (link). SIGIR 2001.
4) The Nested Chinese Restaurant Process and Bayesian Non-parametric Inference of Topic Hierarchies (link). Journal of the ACM 2010.
5) Mining bilingual topic hierarchies from unaligned text (link). IJCNLP 2011.
6) Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews (link). ACL 2011.
7) (TBA)
4) The Nested Chinese Restaurant Process and Bayesian Non-parametric Inference of Topic Hierarchies (link). Journal of the ACM 2010.
5) Mining bilingual topic hierarchies from unaligned text (link). IJCNLP 2011.
6) Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews (link). ACL 2011.
7) (TBA)
(to be updated).
Labels:
links,
NLP,
research,
summarization,
topic summarization
Sunday, 19 February 2012
Albatross Toolkit
http://www.object-craft.com.au/projects/albatross/
http://www.nactem.ac.uk/brat-annotation/
Albatross is a small and flexible Python toolkit for developing highly stateful web applications. The toolkit has been designed to take a lot of the pain out of constructing intranet applications although you can also use Albatross for deploying publicly accessed web applications.
Subscribe to:
Posts (Atom)