首页> 外国专利> METHODS AND APPARATUSES FOR VIDEO SEGMENTATION, CLASSIFICATION, AND RETRIEVAL USING IMAGE CLASS STATISTICAL MODELS

METHODS AND APPARATUSES FOR VIDEO SEGMENTATION, CLASSIFICATION, AND RETRIEVAL USING IMAGE CLASS STATISTICAL MODELS

机译：使用图像分类统计模型对视频进行分类，分类和检索的方法和装置

页面导航

摘要
著录项
相似文献

摘要

Techniques for classifying video frames using statistical models of transform coefficients are disclosed. After optionally being decimated in time and space, image frames are transformed using a discrete cosine transform or Hadamard transform. The methods disclosed model image composition and operate on grayscale images. The resulting transform matrices are reduced using truncation, principal component analysis, or linear discriminant analysis to produce feature vectors. Feature vectors of training images for image classes are used to compute image class statistical models. Once image class statistical models are derived, individual frames are classified by the maximum likelihood resulting from the image class statistical models. Thus, the probabilities that a feature vector derived from a frame would be produced from each of the image class statistical models are computed. The frame is classified into the image class corresponding to the image class statistical model which produced the highest probability for the feature vector derived from the frame. Optionally, frame sequence information is taken into account by applying a hidden Markov model to represent image class transitions from the previous frame to the current frame. After computing all class probabilities for all frames in the video or sequence of frames using the image class statistical models and the image class transition probabilities, the final class is selected as having the maximum likelihood. Previous frames are selected in reverse order based upon their likelihood given determined current states.

机译：公开了使用变换系数的统计模型对视频帧进行分类的技术。在可选地在时间和空间上进行抽取之后，使用离散余弦变换或Hadamard变换对图像帧进行变换。该方法公开了对图像合成进行建模并在灰度图像上进行操作。使用截断，主成分分析或线性判别分析来减少所得的变换矩阵，以生成特征向量。用于图像类别的训练图像的特征向量用于计算图像类别统计模型。一旦得出图像类别统计模型，就根据图像类别统计模型产生的最大似然对各个帧进行分类。因此，计算了从每个图像类别统计模型产生从帧导出的特征向量的概率。将该帧分类为与图像类统计模型相对应的图像类，该图像类统计模型对于从该帧得出的特征向量产生最高概率。可选地，通过应用隐藏的马尔可夫模型来表示帧序列信息，以表示从前一帧到当前帧的图像类转换。在使用图像类别统计模型和图像类别转换概率为视频或帧序列计算所有帧的所有类别概率之后，选择最终类别为具有最大似然性。根据给定确定的当前状态的可能性，以相反的顺序选择先前的帧。

著录项

公开/公告号US2002028021A1

专利类型
公开/公告日2002-03-07

原文格式PDF
申请/专利权人 FOOTE JONATHAN T.;WILCOX LYNN;GIRGENSOHN ANDREAS;
展开▼

申请/专利号US19990266637
发明设计人 JONATHAN T. FOOTE;LYNN WILCOX;ANDREAS GIRGENSOHN;
展开▼

申请日1999-03-11
分类号G06K9/62;
国家 US
入库时间 2022-08-22 00:48:39

相似文献

专利
外文文献
中文文献