TY - GEN
T1 - Automatic counting of interacting people by using a single uncalibrated camera
AU - Velipasalar, Senem
AU - Tian, Ying Li
AU - Hampapur, Arun
PY - 2006
Y1 - 2006
N2 - Automatic counting of people, entering or exiting a region of interest, is very important for both business and security applications. This paper introduces an automatic and robust people counting system which can count multiple people who interact in the region of interest, by using only one camera. Two-level hierarchical tracking is employed. For cases not involving merges or splits, a fast blob tracking method is used. In order to deal with interactions among people in a more thorough and reliable way, the system uses the mean shift tracking algorithm. Using the first-level blob tracker in general, and employing the mean shift tracking only in the case of merges and splits saves power and makes the system computationally efficient. The system setup parameter can be automatically learned in a new environment from a 3 to 5 minute-video with people going in or out of the target region one at a time. With a 2GHz Pentium machine, the system runs at about 33fps on 320×240 images without code optimization. Average accuracy rates of 98.5% and 95% are achieved on videos with normal traffic flow and videos with many cases of merges and splits, respectively.
AB - Automatic counting of people, entering or exiting a region of interest, is very important for both business and security applications. This paper introduces an automatic and robust people counting system which can count multiple people who interact in the region of interest, by using only one camera. Two-level hierarchical tracking is employed. For cases not involving merges or splits, a fast blob tracking method is used. In order to deal with interactions among people in a more thorough and reliable way, the system uses the mean shift tracking algorithm. Using the first-level blob tracker in general, and employing the mean shift tracking only in the case of merges and splits saves power and makes the system computationally efficient. The system setup parameter can be automatically learned in a new environment from a 3 to 5 minute-video with people going in or out of the target region one at a time. With a 2GHz Pentium machine, the system runs at about 33fps on 320×240 images without code optimization. Average accuracy rates of 98.5% and 95% are achieved on videos with normal traffic flow and videos with many cases of merges and splits, respectively.
UR - http://www.scopus.com/inward/record.url?scp=34247640278&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34247640278&partnerID=8YFLogxK
U2 - 10.1109/ICME.2006.262768
DO - 10.1109/ICME.2006.262768
M3 - Conference contribution
AN - SCOPUS:34247640278
SN - 1424403677
SN - 9781424403677
T3 - 2006 IEEE International Conference on Multimedia and Expo, ICME 2006 - Proceedings
SP - 1265
EP - 1268
BT - 2006 IEEE International Conference on Multimedia and Expo, ICME 2006 - Proceedings
T2 - 2006 IEEE International Conference on Multimedia and Expo, ICME 2006
Y2 - 9 July 2006 through 12 July 2006
ER -