"Sensitive words" are the terms, certain words, and other bad words which are restricted to be used by the state or institutions. Here, we built a Tibetan and Uygur sensitive word tracking system, in it we first built a sensitive word vocabulary and classified the sensitive words. Then in order to track Tibetan and Uygur sensitive word effectively, we tried to search sensitive word on Web based on the sensitive word vocabulary. According to the search results, we have found the high focused sensitive words on Web, so these words are those we will track next. In our track system, we adopted a new link analysis algorithm to track high usage frequency Tibetan, Uygur sensitive word. From the experiments, we can see that it has effective performance.
展开▼