YOLOv11-LCA: YOLOv11 Enhanced with the Low-Complexity Attention Mechanisms for a Robust Waterway-Floating Trash Detection
DOI:
https://doi.org/10.37385/dpqnng18Keywords:
Floating Trash Detection, YOLOv11, Attention Mechanism, Deep Learning, Waterway MonitoringAbstract
Accurate trash detection in aquatic environments remains a significant challenge for detection models, which exhibit persistent limitations in identifying small and partially submerged objects. Additionally, a notable gap exists in methodologies for fine-tuning the detection model to optimize performance for a specific waterways. To address these limitations, the first objective is to develop a detection model designed to enhance performance on small and partially submerged trash, and the second is to establish a framework for efficiently adapting the model to achieve high accuracy within local waterways. First, the YOLOv11 architecture is enhanced by integrating LCAM and LCBHAM attention mechanisms and pre-trained on various combinations of public datasets to establish a robust, baseline model. For the second objective, this baseline model is adapted using a data-efficient framework. This study process introduces the BojongTrash dataset, captured from a specific waterway, and involves systematically fine-tuning the model on incremental subsets of this data to determine the minimum quantity of images and training epochs required to achieve high accuracy in the target environment. The proposed YOLOv11s-LCA architecture demonstrated a statistically validated improvement over its baseline, increasing the mAP50 score from 0.779 to 0.836 on the FloW-Img dataset with only a 0.1% parameter increase. Furthermore, the research establishes a highly efficient fine-tuning framework, demonstrating peak mAP50 performance of 0.908 that achieved by fine-tuning on 1,000 images for only 3-5 epochs. Therefore, this research validates lightweight attention mechanisms as an efficient strategy for enhancing detection in complex environments and provides a practical framework that enables the rapid deployment of tailored, high-accuracy monitoring systems.
Downloads
References
Abdu, H., & Noor, M. H. M. (2022). Domestic Trash Classification with Transfer Learning Using VGG16. 2022 IEEE 12th International Conference on Control System, Computing and Engineering (ICCSCE), 137–141. https://doi.org/10.1109/ICCSCE54767.2022.9935653
Alinsaif, S., & Lang, J. (2020). Histological Image Classification using Deep Features and Transfer Learning. 2020 17th Conference on Computer and Robot Vision (CRV), 101–108. https://doi.org/10.1109/CRV50864.2020.00022
Aral, R. A., Keskin, S. R., Kaya, M., & Haciomeroglu, M. (2018). Classification of TrashNet Dataset Based on Deep Learning Models. 2018 IEEE International Conference on Big Data (Big Data), 2058–2062. https://doi.org/10.1109/BigData.2018.8622212
Bhuvaneswary, N., Fedrick, A. A., Alltrin, K. S., Jaspher, C. J. J., & Sounder, K. (2025). AI-Based Trash Collector Boat for Autonomous Waterway Pollution Management. 2025 International Conference on Sustainable Energy Technologies and Computational Intelligence (SETCOM), 1–5. https://doi.org/10.1109/SETCOM64758.2025.10932499
Bianco, S., Gaviraghi, E., & Schettini, R. (2024). Efficient Deep Learning Models for Litter Detection in the Wild. 2024 IEEE 8th Forum on Research and Technologies for Society and Industry Innovation (RTSI), 601–606. https://doi.org/10.1109/RTSI61910.2024.10761805
Carolis, B. De, Ladogana, F., & Macchiarulo, N. (2020). YOLO TrashNet: Garbage Detection in Video Streams. 2020 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS), 1–7. https://doi.org/10.1109/EAIS48028.2020.9122693
Cheng, Y., Zhu, J., Jiang, M., Fu, J., Pang, C., Wang, P., Sankaran, K., Onabola, O., Liu, Y., Liu, D., & Bengio, Y. (2021). FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 10933–10942. https://doi.org/10.1109/ICCV48922.2021.01077
Devi, B. S., Nagaraja, K. V, & Singh, R. P. (2024). Optimization Enhancements for Faster R-CNN in Floating Bottle Detection. 2024 IEEE 11th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON), 1–7. https://doi.org/10.1109/UPCON62832.2024.10982812
Escobedo-Gordillo, A., Brieva, J., Moya-Albor, E., Ponce, H., Franco-Gaona, E., & Cruz-Aceves, I. (2024). Optimal Dataset Size for Fine-Tuning sEMG-Based Hand Gesture Recognition in Rehabilitation Prosthesis. 2024 20th International Symposium on Medical Information Processing and Analysis (SIPAIM), 1–5. https://doi.org/10.1109/SIPAIM62974.2024.10783516
Fulton, M., Hong, J., Islam, M. J., & Sattar, J. (2019). Robotic Detection of Marine Litter Using Deep Visual Detection Models. 2019 International Conference on Robotics and Automation (ICRA), 5752–5758. https://doi.org/10.1109/ICRA.2019.8793975
Ganvir, K. D., Nerkar, P. R., Ghate, L. W., & Bhagat, H. H. (2019). The impact of water pollution and preliminary study on river trash collecting mechanism. International Journal of Technical Research and Applications, 7(1), 85–87.
Hasibuan, N. H., Salsabila, R., Perdana, Z., Khair, H., Husin, A., Suryati, I., Nurfahasdi, M., & Patumona, S. (2022). Assessment of macro litter in Deli River Medan during pandemic COVID-19. IOP Conference Series: Earth and Environmental Science, 977(1), 012106. https://doi.org/10.1088/1755-1315/977/1/012106
Hou, Q., Zhou, D., & Feng, J. (2021). Coordinate Attention for Efficient Mobile Network Design. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 13708–13717. https://doi.org/10.1109/CVPR46437.2021.01350
Hu, J., Shen, L., & Sun, G. (2018). Squeeze-and-Excitation Networks. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 7132–7141. https://doi.org/10.1109/CVPR.2018.00745
Huang, J., Rathod, V., Sun, C., Zhu, M., Korattikara, A., Fathi, A., Fischer, I., Wojna, Z., Song, Y., Guadarrama, S., & Murphy, K. (2017). Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3296–3297. https://doi.org/10.1109/CVPR.2017.351
Jambeck, J. R., Geyer, R., Wilcox, C., Siegler, T. R., Perryman, M., Andrady, A., Narayan, R., & Law, K. L. (2015). Plastic waste inputs from land into the ocean. Science, 347(6223), 768–771. https://doi.org/10.1126/science.1260352
Jia, T., de Vries, R., Kapelan, Z., van Emmerik, T. H. M., & Taormina, R. (2024). Detecting floating litter in freshwater bodies with semi-supervised deep learning. Water Research, 266, 122405. https://doi.org/10.1016/j.watres.2024.122405
Jia, T., Vallendar, A. J., de Vries, R., Kapelan, Z., & Taormina, R. (2023). Advancing deep learning-based detection of floating litter using a novel open dataset. Frontiers in Water, 5. https://doi.org/10.3389/frwa.2023.1298465
Jiang, L., Liu, F., Lv, J., Liu, B., & Wang, C. (2024). GST-YOLO: a lightweight visual detection algorithm for underwater garbage detection. Journal of Real-Time Image Processing, 21(4), 114. https://doi.org/10.1007/s11554-024-01494-w
Jiang, Z., Wu, B., Ma, L., Zhang, H., & Lian, J. (2023). APM-YOLOv7 for Small-Target Water-Floating Garbage Detection Based on Multi-Scale Feature Adaptive Weighted Fusion. Sensors, 24(1), 50. https://doi.org/10.3390/s24010050
Jocher, G., & Qiu, J. (2024). Ultralytics YOLO11. https://github.com/ultralytics/ultralytics
Kelly, B. O., Chen, S., Zhou, E. P., & Elshakankiri, M. (2023). AI-Enabled Plastic Pollution Monitoring System for Toronto Waterways. 2023 10th International Conference on Internet of Things: Systems, Management and Security (IOTSMS), 53–58. https://doi.org/10.1109/IOTSMS59855.2023.10325803
Kraft, M., Piechocki, M., Ptak, B., & Walas, K. (2021). Autonomous, Onboard Vision-Based Trash and Litter Detection in Low Altitude Aerial Images Collected by an Unmanned Aerial Vehicle. Remote Sensing, 13(5), 965. https://doi.org/10.3390/rs13050965
Li, K., Wang, Y., & Hu, Z. (2023). Improved YOLOv7 for Small Object Detection Algorithm Based on Attention and Dynamic Convolution. Applied Sciences, 13(16), 9316. https://doi.org/10.3390/app13169316
Li, N., Huang, H., Wang, X., Yuan, B., Liu, Y., & Xu, S. (2022). Detection of Floating Garbage on Water Surface Based on PC-Net. Sustainability, 14(18), 11729. https://doi.org/10.3390/su141811729
Liao, Y.-H., & Juang, J.-G. (2022). Real-Time UAV Trash Monitoring System. Applied Sciences, 12(4), 1838. https://doi.org/10.3390/app12041838
Liao, Y.-H., & Juang, J.-G. (2023). Automatic Marine Debris Inspection. Aerospace, 10(1), 84. https://doi.org/10.3390/aerospace10010084
Liu, C., Xie, N., Yang, X., Chen, R., Chang, X., Zhong, R. Y., Peng, S., & Liu, X. (2022). A Domestic Trash Detection Model Based on Improved YOLOX. Sensors, 22(18). https://doi.org/10.3390/s22186974
Liu, T., Luo, R., Xu, L., Feng, D., Cao, L., Liu, S., & Guo, J. (2022). Spatial Channel Attention for Deep Convolutional Neural Networks. Mathematics, 10(10), 1750. https://doi.org/10.3390/math10101750
Liu, Y., Ge, Z., Lv, G., & Wang, S. (2018). Research on Automatic Garbage Detection System Based on Deep Learning and Narrowband Internet of Things. Journal of Physics: Conference Series, 1069, 012032. https://doi.org/10.1088/1742-6596/1069/1/012032
Maharjan, N., Miyazaki, H., Pati, B. M., Dailey, M. N., Shrestha, S., & Nakamura, T. (2022). Detection of River Plastic Using UAV Sensor Data and Deep Learning. Remote Sensing, 14(13), 3049. https://doi.org/10.3390/rs14133049
Mao, W.-L., Chen, W.-C., Wang, C.-T., & Lin, Y.-H. (2021). Recycling waste classification using optimized convolutional neural network. Resources, Conservation and Recycling, 164, 105132. https://doi.org/10.1016/j.resconrec.2020.105132
Meijer, L. J. J., van Emmerik, T., van der Ent, R., Schmidt, C., & Lebreton, L. (2021). More than 1000 rivers account for 80% of global riverine plastic emissions into the ocean. Science Advances, 7(18). https://doi.org/10.1126/sciadv.aaz5803
Misra, D., Nalamada, T., Arasanipalai, A. U., & Hou, Q. (2021). Rotate to Attend: Convolutional Triplet Attention Module. 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), 3138–3147. https://doi.org/10.1109/WACV48630.2021.00318
Mittal, G., Yagnik, K. B., Garg, M., & Krishnan, N. C. (2016). SpotGarbage. Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 940–945. https://doi.org/10.1145/2971648.2971731
Mo, R., Lai, S., Yan, Y., Chai, Z., & Wei, X. (2022). Dimension-aware attention for efficient mobile networks. Pattern Recognition, 131, 108899. https://doi.org/10.1016/j.patcog.2022.108899
Napper, I. E., & Thompson, R. C. (2019). Environmental Deterioration of Biodegradable, Oxo-biodegradable, Compostable, and Conventional Plastic Carrier Bags in the Sea, Soil, and Open-Air Over a 3-Year Period. Environmental Science & Technology, 53(9), 4775–4783. https://doi.org/10.1021/acs.est.8b06984
Nguyen, T.-T., & Tran, H.-L. (2022). An Efficient Model for Floating Trash Detection based on YOLOv5s. 2022 9th NAFOSTED Conference on Information and Computer Science (NICS), 230–234. https://doi.org/10.1109/NICS56915.2022.10013413
Niu, G., Li, J., Guo, S., Pun, M.-O., Hou, L., & Yang, L. (2019). SuperDock: A Deep Learning-Based Automated Floating Trash Monitoring System. 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO), 1035–1040. https://doi.org/10.1109/ROBIO49542.2019.8961509
PENG, C., HE, B., XI, W., & LIN, G. (2024). Improved YOLOv7 Algorithm for Floating Waste Detection Based on GFPN and Long-Range Attention Mechanism. Wuhan University Journal of Natural Sciences, 29(4), 338–348. https://doi.org/10.1051/wujns/2024294338
Politikos, D. V., Fakiris, E., Davvetas, A., Klampanos, I. A., & Papatheodorou, G. (2021). Automatic detection of seafloor marine litter using towed camera images and deep learning. Marine Pollution Bulletin, 164, 111974. https://doi.org/10.1016/j.marpolbul.2021.111974
Proença, P. F., & Simões, P. (2020). TACO: Trash Annotations in Context for Litter Detection.
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 779–788.
Sakti, A. D., Sembiring, E., Rohayani, P., Fauzan, K. N., Anggraini, T. S., Santoso, C., Patricia, V. A., Ihsan, K. T. N., Ramadan, A. H., Arjasakusuma, S., & Candra, D. S. (2023). Identification of illegally dumped plastic waste in a highly polluted river in Indonesia using Sentinel-2 satellite imagery. Scientific Reports, 13(1), 5039. https://doi.org/10.1038/s41598-023-32087-5
Salman, N. (2021). ANALYSIS AND MONITORING OF RIVER WATER QUALITY IN TASIKMALAYA CITY. Journal of Community Based Environmental Engineering and Management, 5(1), 33–40. https://doi.org/10.23969/jcbeem.v5i1.3786
Sari, M. M., Andarani, P., Notodarmojo, S., Harryes, R. K., Nguyen, M. N., Yokota, K., & Inoue, T. (2022). Plastic pollution in the surface water in Jakarta, Indonesia. Marine Pollution Bulletin, 182, 114023. https://doi.org/10.1016/j.marpolbul.2022.114023
Shi, C., Xia, R., & Wang, L. (2020). A Novel Multi-Branch Channel Expansion Network for Garbage Image Classification. IEEE Access, 8, 154436–154452. https://doi.org/10.1109/ACCESS.2020.3016116
Sukmono, Y., Hadibarata, T., Kristanti, R. A., Singh, A., Al Farraj, D. A., & Elshikh, M. S. (2024). Occurrence and visual characterization of microplastics from Mahakam River at Tenggarong City, Indonesia. Journal of Contaminant Hydrology, 267, 104440. https://doi.org/10.1016/j.jconhyd.2024.104440
Sun, B., Tang, H., Gao, L., Bi, K., & Wen, J. (2025). RTDETR-MARD: A Multi-Scale Adaptive Real-Time Framework for Floating Waste Detection in Aquatic Environments. Journal of Marine Science and Engineering, 13(5), 996. https://doi.org/10.3390/jmse13050996
Tamin, O., Moung, E. G., Dargham, J. A., Yahya, F., Farzamnia, A., Sia, F., Naim, N. F. M., & Angeline, L. (2023). On-Shore Plastic Waste Detection with YOLOv5 and RGB-Near-Infrared Fusion: A State-of-the-Art Solution for Accurate and Efficient Environmental Monitoring. Big Data and Cognitive Computing, 7(2). https://doi.org/10.3390/bdcc7020103
Tharani, M., Amin, A. W., Maaz, M., & Taj, M. (2020). Attention Neural Network for Trash Detection on Water Channels.
van Lieshout, C., van Oeveren, K., van Emmerik, T., & Postma, E. (2020). Automated River Plastic Monitoring Using Deep Learning and Cameras. Earth and Space Science, 7(8). https://doi.org/10.1029/2019EA000960
Wahyutama, A. B., & Hwang, M. (2022). YOLO-Based Object Detection for Separate Collection of Recyclables and Capacity Monitoring of Trash Bins. Electronics, 11(9), 1323. https://doi.org/10.3390/electronics11091323
Wang, J., & Zhao, H. (2024). Improved YOLOv8 Algorithm for Water Surface Object Detection. Sensors, 24(15), 5059. https://doi.org/10.3390/s24155059
Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., & Hu, Q. (2020). ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 11531–11539. https://doi.org/10.1109/CVPR42600.2020.01155
Woo, S., Park, J., Lee, J.-Y., & Kweon, I. S. (2018). CBAM: Convolutional Block Attention Module (pp. 3–19). https://doi.org/10.1007/978-3-030-01234-2_1
World Bank. (2020). Stemming the Plastic Tide in Indonesia: Policy, Investments, and Research. https://www.worldbank.org/en/news/feature/2020/10/06/stemming-the-plastics-tide-in-indonesia
Wu, C. M., Sun, Y. Q., Wang, T. J., & Liu, Y. L. (2022). Underwater trash detection algorithm based on improved YOLOv5s. Journal of Real-Time Image Processing, 19(5), 911–920. https://doi.org/10.1007/s11554-022-01232-0
Wu, G., Ge, Y., & Yang, Q. (2023). UTD-YOLO: underwater trash detection model based on improved YOLOv5. Journal of Electronic Imaging, 32(06). https://doi.org/10.1117/1.JEI.32.6.063034
Xia, Z., Zhou, H., Yu, H., Hu, H., Zhang, G., Hu, J., & He, T. (2024). YOLO-MTG: a lightweight YOLO model for multi-target garbage detection. Signal, Image and Video Processing, 18(6–7), 5121–5136. https://doi.org/10.1007/s11760-024-03220-2
Xian, R., Tang, L., & Liu, S. (2024). Development of a Lightweight Floating Object Detection Algorithm. Water, 16(11), 1633. https://doi.org/10.3390/w16111633
Xiao, R., Wang, H., Wang, L., & Yuan, H. (2025). C3Ghost and C3k2: performance study of feature extraction module for small target detection in YOLOv11 remote sensing images. In S. S. Agaian (Ed.), Second International Conference on Big Data, Computational Intelligence, and Applications (BDCIA 2024) (p. 139). SPIE. https://doi.org/10.1117/12.3059792
Xu, S., Tang, H., Li, J., Wang, L., Zhang, X., & Gao, H. (2023). A YOLOW Algorithm of Water-Crossing Object Detection. Applied Sciences, 13(15), 8890. https://doi.org/10.3390/app13158890
Yu, J., Zheng, H., Xie, L., Zhang, L., Yu, M., & Han, J. (2023). Enhanced YOLOv7 integrated with small target enhancement for rapid detection of objects on water surfaces. Frontiers in Neurorobotics, 17. https://doi.org/10.3389/fnbot.2023.1315251
Yu, R.-S., Yang, Y.-F., & Singh, S. (2023). Global analysis of marine plastics and implications of control measure strategies. Frontiers in Marine Science, 10. https://doi.org/10.3389/fmars.2023.1305091
Zhang, L., Wei, Y., Wang, H., Shao, Y., & Shen, J. (2021). Real-Time Detection of River Surface Floating Object Based on Improved RefineDet. IEEE Access, 9, 81147–81160. https://doi.org/10.1109/ACCESS.2021.3085348
Zhang, Y., Wang, X., Shakeel, M. S., Wan, H., & Kang, W. (2022). Learning upper patch attention using dual-branch training strategy for masked face recognition. Pattern Recognition, 126, 108522. https://doi.org/10.1016/j.patcog.2022.108522
Zhang, Y., Zhang, H., Huang, Q., Han, Y., & Zhao, M. (2024). DsP-YOLO: An anchor-free network with DsPAN for small object detection of multiscale defects. Expert Systems with Applications, 241, 122669. https://doi.org/10.1016/j.eswa.2023.122669
Zhao, P., Guo, Y., Yang, Z., Wang, Z., Wang, H., & He, Y. (2024). YOLOv8_CB: An Improved YOLOv8 Model with CBAM and BiFPN for Pipeline Girth Weld Defect Detection (pp. 372–383). https://doi.org/10.1007/978-981-96-0313-8_28




CITEDNESS IN SCOPUS
CITEDNESS IN WOS




