We present a shot segmentation method based on the representation of visual contents in video using a coupled Markov chains approach. This representation allows us to combine different image features and to keep information about all the images since the beginning of the shot, instead of simply comparing adjacent frames. We also define an adaptative detection threshold that depends on the distance measures that are obtained, instead of trying to find a fixed threshold. Results show that the combination of color and motion image features in the same representation provides a more robust detection of shot boundaries than using each feature separately. © Springer-Verlag Berlin Heidelberg 2003.
|Journal||Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|
|Publication status||Published - 1 Dec 2003|