Deep learning-based oriented object detection in remote sensing imagery: YOLOv7-OBB

Santos, Pietro Terra Pizzutti dos

dc.contributor.advisor	Gomes, Natanael Rodrigues
dc.creator	Santos, Pietro Terra Pizzutti dos
dc.date.accessioned	2023-02-03T16:00:40Z
dc.date.available	2023-02-03T16:00:40Z
dc.date.issued	2023-01-25
dc.date.submitted	2023
dc.identifier.citation	SANTOS, P. T. P. dos. Deep Learning-Based Oriented Object Detection in Remote Sensing Imagery: YOLOv7-OBB. 2023. 86 p. Trabalho de Conclusão de Curso (Graduação em Engenharia de Telecomunicações) - Universidade Federal de Santa Maria, Santa Maria, RS, 2023.	por
dc.identifier.uri	http://repositorio.ufsm.br/handle/1/27722
dc.description	Trabalho de conclusão de curso (graduação) - Universidade Federal de Santa Maria, Centro de Tecnologia, Curso de Engenharia de Telecomunicações, RS, 2023.	por
dc.description.abstract	Remote sensing (RS) is the act of processing and extracting meaningful features about the ground and objects observed at a distance, usually from a much higher position from aircraft and satellites. Due to the large field of coverage in RS imagery, object detection in these images can be really useful, gathering a broad and concise notion of the objects present in certain areas. Due to their great capability of assimilating intricate patterns, Deep Learning (DL) models have achieved state-of-the-art (SOTA) performance in computer vision tasks. In this project, an extensive research is conducted on current DL-based object detection models and a suitable model, YOLOv7, is chosen to serve as a baseline for modifications to enable a high performance oriented bounding-box (OBB) detector in RS imagery. In supervised DL models, their final performance is very dependent on the quality of their training. To improve it, large datasets covering the specific task are pursued, converging to the use of DOTA dataset. Moreover, the concept of transfer learning is employed to allow the use of a pre-trained model on a very large dataset with different tasks. The final model is evaluated on common object detection metrics, such as the confusion matrix, precision, and recall curves. They validate the detector, capable of identifying 16 object classes with SOTA performance: high accuracy, fast and with the latest oriented bounding-box. Comparing the confusion matrices of the developed model and YOLOv5-OBB (KAIXUAN, 2022), for instance, it correctly identifies with a probability of 0.97, 0.89, 0.67 and 0.67% the following classes: plane, baseball diamond, bridge and ground track field. Meanwhile the YOLOv5-OBB obtains 0.96, 0.83, 0.6 and 0.6% for the same respective classes. Another interesting point is the reduction from 0.73 to 0.69% in the probability of mistaking the background for a small-vehicle. The model can further be trained on custom datasets for detection in agriculture, livestock, militarily, etc., bringing implications for many areas and activities. The repository containing all the codes used and developed in this project is available at (SANTOS, 2022).	eng
dc.language	eng	por
dc.publisher	Universidade Federal de Santa Maria	por
dc.rights	Acesso Aberto	por
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Machine learning	eng
dc.subject	Deep learning	eng
dc.subject	Remote sensing imagery	eng
dc.subject	Object detection	eng
dc.subject	Oriented bounding-box	eng
dc.subject	YOLOv7	eng
dc.subject	DOTA dataset	eng
dc.title	Deep learning-based oriented object detection in remote sensing imagery: YOLOv7-OBB	por
dc.title.alternative	Deep learning-based oriented object detection in remote sensing imagery: YOLOv7-OBB	eng
dc.type	Trabalho de Conclusão de Curso de Graduação	por
dc.degree.local	Santa Maria, RS, Brasil.	por
dc.degree.graduation	Engenharia de Telecomunicações	por
dc.description.resumo	Remote sensing (RS) is the act of processing and extracting meaningful features about the ground and objects observed at a distance, usually from a much higher position from aircraft and satellites. Due to the large field of coverage in RS imagery, object detection in these images can be really useful, gathering a broad and concise notion of the objects present in certain areas. Due to their great capability of assimilating intricate patterns, Deep Learning (DL) models have achieved state-of-the-art (SOTA) performance in computer vision tasks. In this project, an extensive research is conducted on current DL-based object detection models and a suitable model, YOLOv7, is chosen to serve as a baseline for modifications to enable a high performance oriented bounding-box (OBB) detector in RS imagery. In supervised DL models, their final performance is very dependent on the quality of their training. To improve it, large datasets covering the specific task are pursued, converging to the use of DOTA dataset. Moreover, the concept of transfer learning is employed to allow the use of a pre-trained model on a very large dataset with different tasks. The final model is evaluated on common object detection metrics, such as the confusion matrix, precision, and recall curves. They validate the detector, capable of identifying 16 object classes with SOTA performance: high accuracy, fast and with the latest oriented bounding-box. Comparing the confusion matrices of the developed model and YOLOv5-OBB (KAIXUAN, 2022), for instance, it correctly identifies with a probability of 0.97, 0.89, 0.67 and 0.67% the following classes: plane, baseball diamond, bridge and ground track field. Meanwhile the YOLOv5-OBB obtains 0.96, 0.83, 0.6 and 0.6% for the same respective classes. Another interesting point is the reduction from 0.73 to 0.69% in the probability of mistaking the background for a small-vehicle. The model can further be trained on custom datasets for detection in agriculture, livestock, militarily, etc., bringing implications for many areas and activities. The repository containing all the codes used and developed in this project is available at (SANTOS, 2022).	por
dc.publisher.country	Brasil	por
dc.publisher.initials	UFSM	por
dc.subject.cnpq	CNPQ::ENGENHARIAS::ENGENHARIA ELETRICA::TELECOMUNICACOES	por
dc.publisher.unidade	Centro de Tecnologia	por

Arquivos deste item

Nome:: Santos_Pietro_Terra_Pizzutti_d ...
Tamanho:: 58.87Mb
Formato:: PDF
Descrição:: TCC de Graduação

Visualizar/Abrir

Nome:: license_rdf
Tamanho:: 805bytes
Formato:: application/rdf+xml

Visualizar/Abrir

Este item aparece na(s) seguinte(s) coleção(s)

TCC Engenharia de Telecomunicações [13]

Mostrar registro simples

Exceto quando indicado o contrário, a licença deste item é descrito como Acesso Aberto