Professional Documents
Culture Documents
Abstract
Detecting dominant motion flows in crowd scenes is one of the major problems in video surveillance. This is particularly difficult in unstructured crowd scenes, where the participants move randomly in various directions. This paper presents a novel method which utilizes SIFT features flow vectors to calculate the dominant motion flows in both unstructured and structured crowd scenes. SIFT features can represent the characteristic parts of objects, allowing robust tracking under non-rigid motion. First, flow vectors of SIFT features are calculated at certain intervals to form a motion flow map of the video. ext, this map is divided into equally sized square regions and in each region dominant motion flows are estimated by clustering the flow vectors. Then, local dominant motion flows are combined to obtain the global dominant motion flows. Experimental results demonstrate the successful application of the proposed method to challenging real-world scenes.
1. Introduction
Dominant motion patterns in videos provide very significant information which has a wide range of applications. Since motion patterns are formed by individual motions or interacted motions of crowds, it helps to analyze the social behavior in a given environment in the video. Furthermore it is useful during public place design and activity analysis for security reasons. Over the years, there have been many researches which try to find the motion patterns by using individual object tracking and trajectory classification methods. However, in real world situations, high density crowds form the most cases, and it is not always possible to track individual
1051-4651/10 $26.00 2010 IEEE DOI 10.1109/ICPR.2010.862 3521 3537 3533
Figure 1. Unstructured/structured Crowds. objects. Crowd scenes can be divided into two groups, unstructured and structured scenes, as in Figure 1. Structured crowds are the ones where main motion tracks are defined by environmental conditions, such as elevators, crosswalks, etc. Unstructured crowds are those where objects can move freely in any direction, following any path. So far, very few researchers have attempted to solve the complexity of the crowd scenes that are structured. Detecting dominant motion flows in unstructured crowds still remains as a challenging task. To solve the problem of calculating the dominant motion flows both in unstructured and structured crowds, we propose a new approach which has two distinctive contributions. First, our approach utilizes motion flows of the SIFT features in a scene. Unlike corner-based features which have been used commonly in other researches, SIFT features can represent characteristics parts of the objects. Therefore, their tracking consistency and accuracy are higher during complex motions. Second, we propose a hierarchical clustering framework to deal with the complexity of unstructured motion flows.
Entire scene is divided into equally sized local regions. In each local region, flow vectors are classified into groups based on their orientation. Then, location-based classification is applied to find the spatial accumulation of the vectors. Finally, local dominant motion flows are connected to obtain global dominant motion flows.
Figure 2. SIFT motion flow vectors.
1.1. Related Work Tracking individual objects and constructing the trajectories is a common approach to find the global motion flows as in [1, 6]. However, for crowd videos, continuous tracking of individual objects is not possible because of occlusion or failures. Another approach is to employ instantaneous flow vectors of image features in the entire image [3-5, 11]. They use corner-based features. But, these features are not reliable under non-rigid motion, affine transformation or noise. Hence, these researches consider only structured motions and do not work for unstructured crowds. In [4], they use neighborhood information, but it fails when a region contains flows with multiple directions eliminating each other. In [7], they propose floor fields, which are applicable for structured crowds. Only, the work in [2] considers unstructured crowd scenes where they try to track individual targets.
(a)
(b)
Figure 3. (a) Unstructured Crowd Scene. (b) SIFT flows in the marked region for 400 frames.
Figure 3(a) demonstrates an unstructured crowd scene. Motion flow map of the region in white square is depicted in 3(b). Motion flows are calculated for 400 frames with interval length 3. Accumulation of flow vectors can be seen in certain orientations. However, if the variety of orientations in the region increases, the flow map becomes very complicated. When entire scene is considered, data amount and complexity will be higher. In this case, common clustering methods [3] in the literature will not work effectively. We introduce a hierarchical clustering method to detect the dominant motion flows in the region, which is explained in the next section.
(a)
(a)
Finally, local dominant motion flows are connected to compute the global dominant motion flows.
Figure 5. Hierarchical clustering. are stated as horizontal flows, whereas groups I and IV are vertical flows. The algorithm is as follows: While scanning, for each local motion flow, 1. Determine the neighbor cells, Ns. 2. In each N, search for the motion flows that are in the same orientation group 3. Choose the closest one in the neighborhood and connect with the current flow. 4. If, there are not motion flows with the same orientation group in the neighbor cells and next neighbor cells, choose the motion flow that is the closest
(a)
Neighbor cells are defined as the two regions that are in the direction of the current flow. For example, in Figure 6(a), for the horizontal vector, the neighborhood cells are c, e and next neighbor cells are c, e. In Figure 6, the vectors shown with A are in orientation group II. A1 is connected to A3 and A2, A3 are connected to A4. Hence, they form the global flow shown with bold gray line. If there are not any vectors in the neighbor and next neighbor cells, then it is connected to the closest vector to keep the continuity. In which case, it means there is a dominant abrupt motion orientation change in that region. For example, if there wasnt A4 , A3 would be connected to B1.
(a)
(b)
(c)
(d)
5. Conclusions
In this work, we have presented a new approach to solve the problem of calculating dominant motion flows in various crowd scenes. By using SIFT feature flows and hierarchical clustering approach, it becomes possible to analyze the motion flows even for unstructured and structured crowds. The proposed approach can detect global motion flows, at the same time it can give information about local characteristics of the motion flows.
References
[1] F. M. Porikli, Trajectory Pattern Detection by HMM Parameter Space Features and Eigenvector Clustering, ECCV, 2004. [2] M. Rodriguez, S. Ali and T. Kanade, Tracking In Unstructured Crowded Scenes, ICCV, 2009. [3] G. Eibl, N. Brandle, Evaluation of Clustering Methods for Finding Dominant Optical Flow Fields in Crowded Scenes, ICPR, 2008. [4] M. Hu, S. Ali and M. Shah, Detecting Global Motion Patterns in Complex Videos, ICPR, 2008. [5] G. Brostow, R. Cipolla, Unsupervised Bayesian Detection of Independent Motion in Crowds, CVPR, 2006. [6] X. Wang et al., Learning Semantic Scene Models by Trajectory Analysis, ECCV, 2006. [7] S. Ali, M. Shah, Floor Fields for Tracking in High Density Crowd Scenes, ECCV, 2008. [8] B. D. Lucas and T. Kanade, An Iterative Image Registration Technique with an Application to Stereo Vision, IJCAI, 1981. [9] D. Lowe. Distinctive image features from scaleinvariant key points. Intl. J. of Computer Vision, 60(2):91110, 2004. [10] Y. Tsuduki, H. Fujiyoshi, A Method for Visualizing Pedestrian Traffic Flow using SIFT, PSIVT, 2009. [11] N. Ihaddadene, C. Djeraba, Real-time Crowd Motion Analysis, ICPR, 2008. [12] L. Zelnik-Manor, P. Perona, Self-Tuning Spectral Clustering, In Adv. Neur. Inf. Proc. Sys.: 16011608, 2004.