Contribuição à abordagem de problemas de classificação por redes convolucionais profundas

Trabalho

Ano: 2018

Tipo: Tese

Agência fin.: CNPq

Grau: Doutorado

Disciplina: Automação

Universidade (IES): UNICAMP

Faculdade/Departamento: Faculdade de Engenharia Elétrica e de Computação

Programa: Doutorado em Engenharia Elétrica

Fonte de dados: UNICAMP DSpace

Autor: Marques, Alan Caio Rodrigues

Orientador: Christiano Lyra Filho

Assunto: Redes neurais (Computacao),Aprendizado de maquina,Neural networks (Computing),Machine learning

Resumo: Resumo: Deep learning, tema de pesquisa recente na área de machine learning, obteve sucesso na proposta de modelos para classificação de padrões com grande quantidade de rótulos, em inteligência artificial aplicada a jogos, em transcrições de falas, em traduções e em outros problemas computacionais de difícil solução. Grande parte desse destaque se deve às redes convolucionais, redes neurais especializadas em dados que possuem parâmetros que dependem de suas vizinhanças. Áudio e imagens são exemplos desses dados, pois os parâmetros só trazem informação quando avaliados em conjunto, formando padrões que possam ser reconhecidos. Esta tese desenvolve aplicações baseadas em redes convolucionais para identificação de padrões em áreas para as quais o uso de técnicas de machine learning são pouco exploradas. Especificamente, desenvolve sistemas para três diferentes tarefas de classificação: classificação de formatos de rostos, classificação de gêneros taxonômicos de formigas e classificação de filtros utilizados para manipulação de imagens. As principais contribuições resultantes do desenvolvimento dessas aplicações estão ligadas ao tratamento dos dados antes da fase de treinamento e à utilização dos resultados de diferentes modelos para aumentar as qualidades das classificações. Na primeira aplicação, os experimentos mostraram a possibilidade de direcionar o aprendizado através de alterações nos dados de entrada, auxiliando a compreensão e o controle das extrações de padrões que a rede utiliza no processo de aprendizado. A segunda aplicação mostra que é possível aumentar a robustez da classificação ao utilizar visões múltiplas (multiview) reforçadas com o recurso de ensemble. Na terceira aplicação foi desenvolvido uma metodologia para identificar as perdas de informações decorrentes da aplicação de filtros às imagens, além disso, foi desenvolvido uma metodologia para identificar qual o processo de manipulação com filtros foi aplicado,\$aAbstract: Deep learning is a recent area of investigation in machine learning. It has received much interest for achieving good results in classification tasks, mainly with a large number of labels. The domain of applications include artificial intelligence applied to games, transcription of words, translation and other challenging computer problems. Most of the successful applications are based in convolutional networks, neural network architectures specialized in data with parameters that depend on interactions with neighbors. Audio and images are examples of such data, because their parameters only bring information when evaluated together, defining recognizable patterns. This thesis investigates applications of convolutional neural networks to identify hidden patterns in areas where the use of machine learning techniques has not been fully explored. Three different systems for classification tasks are developed: classification of face shapes, classification of taxonomy of genus ants and classification of filters used to manipulate images. The main contribution resulting from these projects concern the procedures for analyzing data before the training phase of the networks and the use of results with different models to enhance the quality of the classification output. The first project shows the possibility to use changes in the data input to guide the learning process. The second project shows that it is possible to increase the robustness of the classification by using multiview applied to ensemble. The third project develops a methodology for identifyingidentifying from the information loss from applying filters to images. Furthermore, it develops a methodology to identify which filtering process was applied to the images

Abstract: Abstract: Deep learning is a recent area of investigation in machine learning. It has received much interest for achieving good results in classification tasks, mainly with a large number of labels. The domain of applications include artificial intelligence applied to games, transcription of words, translation and other challenging computer problems. Most of the successful applications are based in convolutional networks, neural network architectures specialized in data with parameters that depend on interactions with neighbors. Audio and images are examples of such data, because their parameters only bring information when evaluated together, defining recognizable patterns. This thesis investigates applications of convolutional neural networks to identify hidden patterns in areas where the use of machine learning techniques has not been fully explored. Three different systems for classification tasks are developed: classification of face shapes, classification of taxonomy of genus ants and classification of filters used to manipulate images. The main contribution resulting from these projects concern the procedures for analyzing data before the training phase of the networks and the use of results with different models to enhance the quality of the classification output. The first project shows the possibility to use changes in the data input to guide the learning process. The second project shows that it is possible to increase the robustness of the classification by using multiview applied to ensemble. The third project develops a methodology for identifyingidentifying from the information loss from applying filters to images. Furthermore, it develops a methodology to identify which filtering process was applied to the images,\$aAutomação,\$a141308/2014-1,\$aCNPQ

Referência: MARQUES, Alan Caio Rodrigues. Contribuição à abordagem de problemas de classificação por redes convolucionais profundas. 2018. 1 recurso online (121 p.). Tese (doutorado) - Universidade Estadual de Campinas, Faculdade de Engenharia Elétrica e de Computação, Campinas, SP. Disponível em:

Tags: