Total views : 232

Multicue Optimized Object Detection for Automatic Video Event Extraction


  • Computer Engineering Department, Fr. Conceicao Rodrigues College of Engineering, Mumbai − 400050, India
  • Dr Babasaheb Ambedkar Technological University, Lonere − 402103, India


Objective: To demonstrate the video event extraction system, combining the multicue optimized object detection with video semantic modelling. Methods/ Statistical Analysis: The literature established video semantic model has been recreated with a limited model. The multicue optimized object detector has been employed substituting the primary object detector in the system. The multicue object detector provides the inter object spatio-temporal cue information for event detection and aids to the semantic model. This similitude system establishes the proof of concept for the idea of incubating the multicue object identifier within a video extraction system for performance augmentation. The system had been tested for two different event extractions from a combination of six videos. Findings: The testing results had been presented for the two target events demonstrating the multicue object detector performing in frame by frame manner, for the test videos. The system performance measured and presented in terms of precision and recall indicates encouraging enhancements in the baseline system accuracy. Application/Improvement: The novelty of the work presented lies in clubbing the most advanced multicue optimized object detection technology with a semantic model based video extraction system, and thus bringing together the benefits of spatio-temporal cue method with a semantic model system.


Event Detection, Event Extraction, Multicue Object Identifier, Spatio-Temporal Cue, Video Content Modelling.

Full Text:

 |  (PDF views: 168)


  • Gelin P, Wellekens CJ. Keyword Spotting Enhancement for Video Soundtrack Indexing, ICSLP, IEEE Fourth International Conference on in Spoken Language. 1996; 2:586−89.
  • Saraceno C, Leonardi R. Audio as a Support to Scene Change Detection and Characterization of Video Sequences, IEEE International Conference on in Acoustics, Speech, and Signal Processing. 1997; 4:2597−600.
  • Tseng VS, Su J, Huang JH, Chen CJ. Integrated Mining of Visual Features, Speech Features, and Frequent Patterns for Semantic Video Annotation, IEEE Transactions on Multimedia. 2008; 10(2):260−67.
  • Yildirim Y, Yazici A, Yilmaz T. Automatic Semantic Content Extraction in Video using a Fuzzy Ontology and Rule based Model, IEEE Transactions on Knowledge of Data Engineering. 2013; 25(1).
  • Weber M, Welling M, Perona P. Towards Automatic Discovery of Object Categories, Proceedings of IEEE Conference Comput. Vis. Pattern Recognition; 2000. 2, p. 101-08.
  • Viola P, Jones M. Rapid Object Detection using a Boosted Cascade of Simple Features, Proceedings of IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognition (CVPR); 2001. 1, p. 511-18.
  • Zhang J, Marszalek M, Lazebnik S, Schmid C. Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study. Proceedings of Conf. Comput. Vis. Pattern Recognition Workshop (CVPRW); 2006. p. 13.
  • Leibe B, Leonardis A, Schiele B. Robust Object Detection with Interleaved Categorization And Segmentation, International Journal of Computer Vision. 2007; 77(13):259−89.
  • Qi B, Ghazal M, Amer A. Robust Global Motion Estimation Oriented to Video Object Segmentation, IEEE Trans. Image Process. 2008; 17(6):958−67.
  • Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D. Object Detection with Discriminatively Trained Part-Based Models, IEEE Trans. Pattern Anal. Mach. Intell. 2010; 32(9):1627−45.
  • Kim J, Ye G, Kim D. Moving Object Detection Under Free-Moving Camera. Proceedings of 17th IEEE Int. Conf. Image Process (ICIP); 2010. p. 4669−72.
  • Liu D, Shyu ML, Zhu Q, Chen SC. Moving Object Detection Under Object Occlusion Situations in Video Sequences. Proceedings of IEEE Int. Symp. Multimedia (ISM); 2011. p. 271−78.
  • Ayvaci A, Soatto S. Detachable Object Detection: Segmentation and Depth Ordering from Short-Baseline Video, IEEE Trans. Pattern Anal. Mach. Intell. 2012; 34(10):1942− 51.
  • Natarajan R, Chandrakala S. Audio-Based Event Detection in Videos – A Comprehensive Survey, International Journal of Engineering and Technology. 2014; 6(4):1-12.
  • Yan R, Zhang J, Yang J, Hauptmann AG. A Discriminative Learning Framework with Pair-Wise Constraints for Video Object Classification, IEEE Trans. Pattern Anal. Mach. Intell. 2006; 28(4):578−93.
  • Sheikh Y, Shah M. Bayesian Modeling of Dynamic Scenes for Object Detection, IEEE Trans. Pattern Anal. Mach. Intell. 2005l; 27(11):1778−92.
  • Yang M, Wu Y, Hua G. Context-Aware Visual Tracking, IEEE Trans. Pattern Anal. Mach. Intell. 2009; 31(7):1195209.
  • Fleitis F, Wang H, Chen S. Enhancing Product Detection with Multicue Optimization for TV Shopping Applications, IEEE Transaction on Emerging Topics in Computing. 2015; 3(2):161−71.
  • Davis LS, Fejes S, Harwood D, Yacoob Y, Haratoglu I, Black MJ. Visual Surveillance of Human Activity. Proceedings of Third Asian Conf. Computer Vision (ACCV); 1998. 2, p. 267−74.
  • Medioni GG, Cohen I, Bre´mond F, Hongeng S, Nevatia R. Event Detection and Analysis from Video Streams, IEEE Trans. Pattern Analysis Machine Intelligence. 2001; 23(8):873−89.
  • Daga BS, Bhute AN, Ghatol AA. Implementation of Parallel Image Processing using NVIDIA GPU Framework, Springer-Verlag Berlin Heidelberg, Communications in Computer and Information Science. 2011; 125:457−64.
  • Daga BS, Content based Video Retrieval using Color Feature: An Integration Approach, Springer-Verlag Berlin Heidelberg, Communications in Computer and Information Science. 2013; 361:609-19.
  • Do¨nderler ME, Saykol E, Arslan U, Ulusoy O, Gu¨du¨kbay G. Video: Design and Implementation of a Video Database Management System, Multimedia Tools Applications. 2005; 27(1):79-104.
  • Fan J, Aref W, Elmagarmid A, Hacid M, Marzouk M, Zhu X. Multiview: Multilevel Video Content Representation and Retrieval, J. Electronic Imaging. 2001; 10(4):895−908.
  • Fan J, Elmagarmid AK, Zhu X, Aref WG, Wu L. Classview: Hierarchical Video Shot Classification, Indexing, and Accessing, IEEE Trans. Multimedia. 2004; 6(1):70−86.
  • Nevatia R, Natarajan P. EDF: A Framework for Semantic Annotation of Video. Proceedings of 10th IEEE Int’l Conf. Computer Vision Workshops (ICCVW ’05); 2005. p. 1876.
  • Sangeetha M, Anandakumar K. Annotation based Image Retrieval System by Mining of Semantically Related User Queries with Improved Markovian Model, Indian Journal of Science and Technology. 2015; 8(35):1−8.
  • Bhatia MPS, Kumar A, Beniwal R. Ontologies for Software Engineering: Past, Present and Future. Indian Journal of Science and Technology. 2016; 9(9):1-16.
  • Nikam P, Nandwalkar BR. Video Semantic Content Extraction using K-Means and Fuzzy Ontology, International Journal of Modern Trends in Engineering and Research. 2015; 2(7):129-33.
  • Shoba G, Kalaitchelvi S. Extraction of Video Semantic Content using Fuzzy Rule based Model, International Journal of Engineering Research and Technology. 2013; 2(12).


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.