HO HPEOHPD GH OD 8$(0 GHWHUPLQDQ MXQWR FRQ...

17 septiembre 2020

Visión por Computadora con

Aprendizaje ProfundoJuan Manuel Rendón Mancha

rendon@uaem.mx

81,9(56,'$'�$87Ð120$�'(/�(67$'2�'(�025(/26

0DQXDO�GH�,GHQWLGDG

��

)RUPDWR�\�SURSRUFLRQHV

/D� GHVFULSFLyQ� GHO� RULJHQ� JUiILFR� \� OD� VLJQLILFDFLyQ� H[SUHVDGD� HQ�HO� HPEOHPD� GH� OD� 8$(0� GHWHUPLQDQ�� MXQWR� FRQ� OD� WLSRJUDItD�� HO�ORJRVtPEROR�GH�OD�LQVWLWXFLyQ�(O� XVR� SUHFLVR� GH� OD� LPDJHQ� JUiILFD� FRQOOHYD� XQ� FRPSURPLVR� GH�

OD� FRPXQLGDG� XQLYHUVLWDULD�� UD]yQ� SRU� OD� TXH� VH� IXQGDPHQWD� OD�QRUPDWLYLGDG�GHO�XVR�GH�VX�LPDJHQ�D�WUDYpV�GHO�0DQXDO�GH�,GHQWLGDG�LQVWLWXFLRQDO�� \� HV� UHVSRQVDELOLGDG� GHO� GLVHxDGRU�HGLWRU� VHJXLU� ORV�OLQHDPLHQWRV�GHVFULWRV�HQ�pVWH�/D� LGHQWLGDG� JUiILFD� GH� OD� 8$(0� VH� EDVD� HQ� HO� HPSOHR� GH�

GRV� HOHPHQWRV� JUiILFRV� LQVWLWXFLRQDOHV�� HO� ORJRVtPEROR� �EODVyQ� \�QRPHQFODWXUD��\�HO�OHPD�(O� ORJRVtPEROR�GH� OD�8$(0�HV�XQD� UHSUHVHQWDFLyQ�JUiILFD�GH� ORV�

VLJQLILFDQWHV�FXOWXUDOHV�DGRSWDGRV�SRU�ORV�IXQGDGRUHV�GH�OD�LQVWLWXFLyQ��DVt�FRPR�VX�GHQRPLQDFLyQ�SURSLD��/D�QRPHQFODWXUD�HVWi�IRUPDGD�HQ�WLSRJUDItD�1RYDUHVH�%N�%W�HQ�YHUVDOLWDV��\�HO�OHPD�3RU�XQD�KXPDQLGDG�FXOWD�HQ�OD�PLVPD�WLSRJUDItD��HQ�FXUVLYDV��SXHGH�LU�HQ�QHJULWDV��

/RJRVtPEROR�LQVWLWXFLRQDO�HQ�EODQFR�\�QHJUR

/RJRVtPEROR�LQVWLWXFLRQDO�HQ�FRORU

Visión por ComputadoraLas imágenes digitales son conjuntos de números

Visión por Computadora

Técnicas Clásicas vs Aprendizaje ProfundoVisión por Computadora

Neurona artificial

Aprendizaje Profundo(Deep Learning)

Trabajo recienteDetección de hojas de árboles de higo en imágenes aéreas

Remote Sens. 2019, 11, 1157 8 of 18

4.1. Approach

Our CNN is inspired by SegNet architecture [14], which uses the principles of an encoder-decoderarchitecture to perform pixel-wise semantic segmentation. Unlike SegNet, our architecture comprisesonly 7 learnable layers as follows. The encoder section has 4 convolutional layers and 3 poolinglayers to generate a low-resolution representation. The decoder part has 3 convolutional layers and3 upsampling layers for producing pixel-wise predictions. In Figure 3 we present a scheme of ourapproach. We use fewer convolutional layers and have a smaller number of trainable parameters thanSegNet-Basic, the smaller version of SegNet, turning it into a simpler model. Likewise, we discardthe multi-class soft-max classifier as a final layer because we expect only 2 classes. On the contrary,a sigmoid layer is used in the output to predict a probability of that each pixel belongs to one classor another.

Remote Sens. 2019, 11, x FOR PEER REVIEW 7 of 17

of our approach. We use fewer convolutional layers and have a smaller number of trainable parameters than SegNet-Basic, the smaller version of SegNet, turning it into a simpler model. Likewise, we discard the multi-class soft-max classifier as a final layer because we expect only 2 classes. On the contrary, a sigmoid layer is used in the output to predict a probability of that each pixel belongs to one class or another.

Figure 3. A scheme of the proposed architecture.

The input of our CNN is a 128 × 128 RGB patch and the output is a 128 × 128 greyscale patch; details about image sizes are presented in the next section. For training, firstly each pixel value is normalized to a range of 0 to 1. The patch is passed through a set of different convolutional layers, where we use relatively large receptive fields (7 × 7) for the first layer and very small receptive fields (3 × 3) for the rest. The convolution stride is fixed to 1 pixel and a zero-padding option is used for all layers. Two activation functions are used. A Sigmoid activation is applied after the last convolution layer and for the rest, a Rectified Linear Unit (ReLU) activation is employed in order to introduce nonlinearities. Maxpooling is done over 2 × 2 windows with stride 2. Upsampling is performed by a factor of 2 × 2 to increase the resolution of the image. In Table 2 we present a summary of our proposed CNN architecture. The convolutional layers parameters are denoted as “[receptive field size] _{layer output} = [number of channels] [image size].”

Table 2. Summary of the CNN configuration. First column indicates the size of the filters while second column details the output of the layer.

Details of Our CNN Input (128 × 128 × 3 image) [7 × 7] [32] [128 × 128]

pooling [3 × 3] [8] [64 × 64]

pooling [3 × 3] [8] [32 × 32]

pooling [3 × 3] [8] [16 × 16]

upsampling [3 × 3] [8] [32 × 32]

upsampling [3 × 3] [16] [64 × 64]

upsampling [3 × 3] [1] [128 × 128]

Output (128 × 128 × 1 image)

The best set of hyperparameters that define the structure of the network (e.g. number of layers, number and size of filters) was determined by experience and performing a series of experiments.

Figure 3. A scheme of the proposed architecture.

The input of our CNN is a 128 ⇥ 128 RGB patch and the output is a 128 ⇥ 128 greyscale patch;details about image sizes are presented in the next section. For training, firstly each pixel value isnormalized to a range of 0 to 1. The patch is passed through a set of di↵erent convolutional layers,where we use relatively large receptive fields (7 ⇥ 7) for the first layer and very small receptive fields(3 ⇥ 3) for the rest. The convolution stride is fixed to 1 pixel and a zero-padding option is used for alllayers. Two activation functions are used. A Sigmoid activation is applied after the last convolutionlayer and for the rest, a Rectified Linear Unit (ReLU) activation is employed in order to introducenonlinearities. Maxpooling is done over 2 ⇥ 2 windows with stride 2. Upsampling is performedby a factor of 2 ⇥ 2 to increase the resolution of the image. In Table 2 we present a summary of ourproposed CNN architecture. The convolutional layers parameters are denoted as “[receptive field size]|{layer output} = [number of channels] [image size].”

Table 2. Summary of the CNN configuration. First column indicates the size of the filters while secondcolumn details the output of the layer.

Details of Our CNN

Input (128 ⇥ 128 ⇥ 3 image)[7 ⇥ 7] [32] [128 ⇥ 128]

pooling[3 ⇥ 3] [8] [64 ⇥ 64]

pooling[3 ⇥ 3] [8] [32 ⇥ 32]

pooling[3 ⇥ 3] [8] [16 ⇥ 16]

upsampling[3 ⇥ 3] [8] [32 ⇥ 32]

upsampling[3 ⇥ 3] [16] [64 ⇥ 64]

upsampling[3 ⇥ 3] [1] [128 ⇥ 128]

Output (128 ⇥ 128 ⇥ 1 image)

• Detección de rostros (licenciatura: Fredy Marín Flores)

En colaboración con el Dr. Jorge Alberto Fuentes Pacheco:

• Clasificación de plantas (maestría: Roger Salinas González, doctorado: Osvaldo Sáez Lombira)

• Navegación autónoma de drones (doctorado: Victor Hugo Santos Pucheta y Alain Serrano Roque)

Proyectos actuales

HO HPEOHPD GH OD 8$(0 GHWHUPLQDQ MXQWR FRQ...

Documents

Transcript of HO HPEOHPD GH OD 8$(0 GHWHUPLQDQ MXQWR FRQ...

CATALOGO NAVIDAD 2018 · confitura de higo, higo con vino de oporto y calabaza gourmet(230gr) condimento de mostaza con piÑa (200gr) jalea de rosas y violetas 125gr producto artesanal

Revista El Higo No 3

Cosecha y postcosecha en higo

El higo más dulce.ppt

La cerámica de El Higo E

Revista el higo 1

Servidor de Información de Sistemas Operativos - ULPGC - 1 …sopa.dis.ulpgc.es/ii-dso/lecminix/inicio/init/init_doc.pdf · 2003-04-11 · Init.c 3 4 Pasos de la ejecución de Init.

Matlab INIT

Cultivo de Higo 2

CARACTERIZACIÓN Y ESTABILIDAD DE LAS ANTOCIANINAS DE HIGO …eprints.uanl.mx/1884/6/1080190952.pdf · 2015-02-12 · CARACTERIZACIÓN Y ESTABILIDAD DE LAS ANTOCIANINAS DE HIGO Ficus

¡Al rico higo! · 2010. 7. 15. · ¡Al rico higo! Texto: Rosa Barasoain En la primavera recogimos de la higuera la primera cosecha del año: las brevas. La segunda -los higos- la

Trabajo de exportar bombones de higo y de cereza

Operaciones Postcosecha Del Higo

Arquitectura zoque prehispánica en El Higo, selva El Ocote ...repositorio.cesmeca.mx/bitstream/cesmeca/156/1/Medio ambiente 9.pdf · 181 Arquitectura zoque prehispánica en El Higo,

Challenge SSTIC 2012 · $ qemu-system-mips -kernel vmlinux -append "console=ttyS0 root=/dev/hda1 init=/sbin/init" -nographic dump.img Les mots de passe des comptes utilisateurs (

CALIDAD AGROALIMENTARIA DE LOS HIGOS ... 2014...principal variedad cultivada “Higo Cuello de Dama blanca” en la zona de estudio. MARCA DE GARANTÍA “HIGO de GREDOS” Análisis

Yogurt de Higo

Corneliu Zelea Codreanu · 3ulphud *xhuud 0xqgldo 5xpdqtd hqwuy hq od 3ulphud *xhuud 0xqgldo mxqwr d orv doldgrv ho gh djrvwr gh edmr xqdv frqglflrqhv gh lqihulrulgdg hvwudwpjlfd

eventos init

Localización Higo d

Corneliu Zelea Codreanu · 3ulphud xhuud 0xqgldo 5xpdqtd hqwuy hq od 3ulphud xhuud 0xqgldo mxqwr d orv doldgrv ho gh djrvwr gh edmr xqdv frqglflrqhv gh lqihulrulgdg hvwudwpjlfd