Research: Difference between revisions
Computational Linguistics and Information Processing
Line 89: | Line 89: | ||
| | | | ||
|- | |- | ||
| style="border-bottom: 3px solid grey; border-right: 1px solid grey; background:#ffefef" valign="top" | <b> | | style="border-bottom: 3px solid grey; border-right: 1px solid grey; background:#ffefef" valign="top" | <b> Students </b> | ||
| style="border-bottom: 3px solid grey;" | | | style="border-bottom: 3px solid grey;" | Olivia Buzek, [http://www.ling.umd.edu/~yakov/ Yakov Kronrod] | ||
|- | |- | ||
<b>Paraphrase</b>, the ability to express the same meaning in multiple ways, is an active area of research within the NLP community and here in the CLIP Laboratory. Our work in paraphrase includes the use of paraphrase in MT evaluation and parameter estimation, lattice and forest translation, and collaborative translation, as well as research on lexical and phrasal semantic similarity measures, meaning preservation in machine translation and summarization, and large-scale document similarity computation via cloud computing methods. | <b>Paraphrase</b>, the ability to express the same meaning in multiple ways, is an active area of research within the NLP community and here in the CLIP Laboratory. Our work in paraphrase includes the use of paraphrase in MT evaluation and parameter estimation, lattice and forest translation, and collaborative translation, as well as research on lexical and phrasal semantic similarity measures, meaning preservation in machine translation and summarization, and large-scale document similarity computation via cloud computing methods. | ||
|} | |} | ||
<b>Some Project Pages</b> | |||
* [http://www.cs.umd.edu/hcil/monotrans/ Monotrans: Crowdsourcing Translation without Bilinguals] | |||
<b>Some Representative Publications</b> | |||
* Generating Phrasal & Sentential Paraphrases: A Survey of Data-Driven Methods. 2010. Computational Linguistics, 36(3). Nitin Madnani and Bonnie Dorr. | |||
* Philip Resnik, Olivia Buzek, Chang Hu, Yakov Kronrod, Alex Quinn, Benjamin B. Bederson. [http://ling.umd.edu/~yakov/CrowdConf2010/final.pdf Improving Translation via Targeted Paraphrasing], 2010 Conference on Empirical Methods in Natural Language Processing, October 2010. | |||
* Yuval Marton, Saif Mohammad, and Philip Resnik. [http://www.aclweb.org/anthology/D/D09/D09-1081.pdf Estimating Semantic Distance Using Soft Semantic Constraints in Knowledge-Source / Corpus Hybrid Models']. Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore, August 6-7, 2009. | |||
* Nitin Madnani, Necip Fazil Ayan, Philip Resnik, Bonnie Dorr. [http://www.desilinguist.org/pdf/paraphrase-wmt07.pdf Using Paraphrases for Parameter Tuning in Statistical Machine Translation]. 2007. Proceedings of the Second ACL Workshop on Statistical Machine Translation (WMT-07). | |||
==Text Summarization == | ==Text Summarization == |
Revision as of 00:27, 28 October 2010
Bayesian Modeling
Faculty |
| ||||||||
Postdocs | |||||||||
Graduate Students | Vladimir Eidelman, Eric Hardisty, Yuening Hu, Ke Zhai | ||||||||
Bayesian modeling is a rigorous mathematical formalism that allows us to build systems that reflect our uncertainty about the world. Applied to language, they allow us to build models that reflect the "latent" aspects of communication such as topic, part of speech, syntax, or sentiment. Using posterior inference, we can use the models to discover the latent features that best explain observed language. In the CLIP lab, we are interested in
Representative Publications and Project Pages: |
Machine Translation
Faculty |
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Postdocs | Kristy Hollingshead | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Graduate Students | Vladimir Eidelman | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The CLIP Laboratory's current work in machine translation continues the lab's long tradition of research in this area. Like most of the field, we work within the framework of statistical MT, but with an emphasis on taking appropriate advantage of knowledge driven or linguistically informed model structures, features, and priors. Some current areas of research include syntactically informed language models, linguistically informed translation model features, the use of unsupervised methods in translation modeling, exploitation of large scale "cloud computing" methods, and human-machine collaborative translation via crowdsourcing. Some Representative Publications:
Some Project Pages ParaphrasingParaphrase, the ability to express the same meaning in multiple ways, is an active area of research within the NLP community and here in the CLIP Laboratory. Our work in paraphrase includes the use of paraphrase in MT evaluation and parameter estimation, lattice and forest translation, and collaborative translation, as well as research on lexical and phrasal semantic similarity measures, meaning preservation in machine translation and summarization, and large-scale document similarity computation via cloud computing methods.
Some Project Pages Some Representative Publications
Text Summarization
Parsing and Tagging
Computational Social Science
Information Retrieval
Disambiguation
Annotation and Sense-making
|