A method of determining which concepts in a set of medical concepts pertain to an input text, comprising: a) creating a set of queries for each concept, each query being a string of two of the words in the concept; b) for each query, determining whether or not the input text includes all the words of that query, and calculating a sub-score indicating a degree of matching between the query and the input text; c) for each concept for which enough of the queries have their words in the input text sufficiently close together, calculating a score depending on the sub-scores; and d) determining which of the concepts, for which a score was calculated, pertain to the input text and which do not, depending on the score of the concept.
展开▼