Algorithm for video shot clustering based on spectral division
-
摘要: 针对现有镜头聚类算法中选择最优化分类个数复杂度较高、分类结果准确性较低的问题,提出了一种基于谱分割理论的镜头聚类算法.通过对每个镜头集构造球状高斯模型SGM(Spherical Gaussian Model),最优化地拟合镜头数据集,提高镜头分割的准确性;在镜头迭代分类过程中采用谱分割算法以提高最终分类结果的准确性;在迭代分类过程中,采用贝叶斯信息准则BIC(Bayesian Information Criterion)作为分类停止的评判标准;最后根据BIC准则计算每两类融合前后的匹配值,判断比较后对分类结果进行融合,矫正在分类过程中同一类被割裂的错误.通过3类体育视频样本对算法的有效性进行了验证、比较和分析.Abstract: A spectral-division unsupervised shot-clustering algorithm (SDUSC) was proposed. First, the keyframes were picked out to represent the shots, and the color features of keyframes were extracted to describe the video shots. In order to characterize shots optimally, a spherical gaussian model (SGM) was constructed for every shot category. Then the spectral division (SD) method was employed to divide a category into two categories, and the method was iteratively used to further divide the results of previous divisions. At the end of each iterative shot-division, bayesian information criterion (BIC) was utilized to automatically judge whether to stop further division. During the processes of shot-divisions, one category might be dissevered by mistake. In order to correct these mistakes, similar categories would be merged by calculating the similarities of all the possible category pairs. This approach was applied to three kinds of sports videos, and the experimental results shows that the proposed approach is reliable and effective.
-
Key words:
- video /
- shot clustering /
- spectral division /
- Bayesian information criterion
-
[1] Wang Peng, Liu Zhiqiang, Yang Shiqiang. Investigation on unsupervised clustering algorithms for video shot categorization [J]. Soft Computing, 2007, 11(4):355-360 [2] Andrew Y Ng, Michael Jordan, Yair W. On spectral clustering: Analysis and an algorithm Advances in NIPS 14.Canbridge. MA: MIT Press, 2001:849-856 [3] Wu Z, Leahy R. An optimal graph theoretic approach to data clustering: theory and its application to image segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1993, 11:1101-1113 [4] Shi J, Malik J. Normalized cuts and image segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(8):888-905 [5] Fiedler M. A property of eigenvectors of nonnegative symmetric matrices and its application to graph theory [J]. Czechoslovak Math J, 1975, 25:619-633 [6] Duan Lingyu, Xu Min, Chua Taiseng, et al. A mid-level representation framework for semantic sports video analysis Proc of the 11th ACM International Conference on Multimedia. Berkeley: ACM Press, 2003:33-44 [7] Zhong D, Chang S F. Structure analysis of sports video using domain models Proceedings of IEEE International Conference on Multimedia & Expo. Tokyo: Elsevier Science Inc, 2001:713-716 [8] Ishioka T. An expansion of x-means for automatically determining the optimal number of clusters-progressive iterations of k-means and merging of the clusters IASTED International Conference on Computational Intelligence. : WSEAS Press, 2005:91-96
点击查看大图
计量
- 文章访问数: 2776
- HTML全文浏览量: 62
- PDF下载量: 1519
- 被引次数: 0