Open this publication in new window or tab >>2025 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]
This thesis shows our investigation on scene understanding and object detection for surveillance applications, emphasizing context-aware computer vision models that enhance detection accuracy in complex environments while respecting privacy considerations. The research advances object detection by addressing key aspects such as variability across environments, contextual information, and multimodal data fusion. Through a comprehensive literature review, we examines the role of contextual information, such as spatial, scale, and temporal context, in improving detection performance. Furthermore, we introduce specialized object detection models designed for indoor and outdoor environments, demonstrating howscene-specific training enhances detection accuracy. We also explore hierarchical scene classification, analyzing how different levels contribute to scene recognition. Lastly, a multimodal fall detection method integrating video and audio is proposed, overcoming limitations of purely visual systems in obstructed or low-visibility conditions. The findings of all papers highlight the effectiveness of scene context, hierarchical classification, and multimodal fusion in developing robust, high-accuracy surveillance models suitable for real-world environments.
Place, publisher, year, edition, pages
Malmö: Malmö University Press, 2025. p. 35
Series
Studies in Computer Science ; 34
Keywords
Object detection, Scene classification, Vision based systems, Multimodal learning, Context-aware learning
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:mau:diva-75404 (URN)10.24834/isbn.9789178776238 (DOI)978-91-7877-622-1 (ISBN)978-91-7877-623-8 (ISBN)
Presentation
2025-04-24, B1, Niagara, Malmö University, Malmö, 10:00 (English)
Opponent
Supervisors
2025-04-162025-04-142025-04-17Bibliographically approved