A principled approach to paraphrasing analyzes input text and paraphrases at atomic linguistic level, instead of analyzing the input text and paraphrases as a whole set at one time. The principled approach extracts atomic linguistic elements from the input text and identifies matching atomic paraphrasing elements to form candidate atomic paraphrasing pairs. A variety of atomic transformation types are identified to form atomic paraphrasing pairs. The candidate atomic paraphrasing pairs are evaluated using feature functions and a probability model. The principled approach scores a combination of multiple candidate atomic paraphrasing pairs using a score function which derives its value from the feature functions of the candidate atomic paraphrasing pairs. A combination which has a high score may be used for constructing a paraphrasing text.
展开▼