Título del Documento: Investigación sobre Aprendizaje Profundo en Visión Computacional

Este documento presenta un resumen de los principales avances en aprendizaje profundo aplicado a visión por computadora. Se incluyen referencias a trabajos seminales y publicaciones recientes.

Referencias Principales:

1. AlexNet: Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. Advances in neural information processing systems, 25. DOI: 10.1145/3065386

2. VGG: Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.

3. ResNet: He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778). DOI: 10.1109/CVPR.2016.90

4. Transformers para visión: Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., ... & Houlsby, N. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929.

5. YOLO: Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 779-788).

Libros de Referencia:

- Deep Learning (Adaptive Computation and Machine Learning series) de Ian Goodfellow, Yoshua Bengio, Aaron Courville. MIT Press, 2016. ISBN: 978-0262035613

- Pattern Recognition and Machine Learning de Christopher M. Bishop. Springer, 2006. ISBN: 978-0387310732

- Computer Vision: Algorithms and Applications de Richard Szeliski. Springer, 2010. ISBN: 978-1848829343

Recursos en Línea:

- Tutorial sobre redes neuronales convolucionales: https://cs231n.github.io/convolutional-networks/
- Repositorio de implementaciones: https://github.com/pytorch/vision
- Dataset ImageNet: https://www.image-net.org/
- Papers with Code: https://paperswithcode.com/

Artículos Adicionales:

6. MobileNet: Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., ... & Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861.

7. EfficientNet: Tan, M., & Le, Q. V. (2019). Efficientnet: Rethinking model scaling for convolutional neural networks. In International Conference on Machine Learning (pp. 6105-6114). PMLR.

8. UNet: Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention (pp. 234-241). Springer, Cham. DOI: 10.1007/978-3-319-24574-4_28

9. StyleGAN: Karras, T., Laine, S., & Aila, T. (2019). A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 4401-4410).

Referencias para Búsqueda Avanzada:

- DOI: 10.1038/s41586-021-03819-2
- DOI: 10.1109/TPAMI.2020.2983666
- DOI: 10.1007/s11263-020-01407-x

IDs de arXiv para descarga directa:
- arXiv:1506.02640 (YOLO v1)
- arXiv:1512.03385 (ResNet)
- arXiv:1704.04861 (MobileNet)
- arXiv:1905.11946 (EfficientNet)

URLs Académicas:
- https://ieeexplore.ieee.org/document/9356352
- https://dl.acm.org/doi/10.1145/3394171.3413521
- https://www.sciencedirect.com/science/article/pii/S0167865521001039
- https://link.springer.com/chapter/10.1007/978-3-030-58592-1_5

Identificadores Varios:

- PubMed ID: 33745078 (revisión sobre visión computacional en medicina)
- PubMed ID: 33423081 (aplicaciones de deep learning en radiología)
- ISBN adicional: 978-1492032649 (Deep Learning with Python)

Conferencias Relevantes:

- CVPR (Conference on Computer Vision and Pattern Recognition): https://cvpr2022.thecvf.com/
- ICCV (International Conference on Computer Vision): https://iccv2021.thecvf.com/
- ECCV (European Conference on Computer Vision): https://eccv2022.ecva.net/

Más Referencias en Texto Libre:

El trabajo seminal de LeCun et al. sobre backpropagation (DOI: 10.1038/323533a0) sentó las bases. Más recientemente, los transformers han dominado el campo NLP (arXiv:1706.03762) y ahora se aplican a visión.

Para implementaciones prácticas, consultar los repositorios oficiales:
- TensorFlow Models: https://github.com/tensorflow/models
- Detectron2: https://github.com/facebookresearch/detectron2
- MMDetection: https://github.com/open-mmlab/mmdetection

Referencias de Surveys:

10. Liu, L., Ouyang, W., Wang, X., Fieguth, P., Chen, J., Liu, X., & Pietikäinen, M. (2020). Deep learning for generic object detection: A survey. International journal of computer vision, 128(2), 261-318. DOI: 10.1007/s11263-019-01247-4

11. Minaee, S., Boykov, Y., Porikli, F., Plaza, A., Kehtarnavaz, N., & Terzopoulos, D. (2021). Image segmentation using deep learning: A survey. IEEE transactions on pattern analysis and machine intelligence, 44(7), 3523-3542. DOI: 10.1109/TPAMI.2021.3059968

Lista de DOIs para Pruebas Masivas:

10.1145/3065386
10.1109/CVPR.2016.90
10.1007/978-3-319-24574-4_28
10.1038/s41586-021-03819-2
10.1109/TPAMI.2020.2983666
10.1007/s11263-020-01407-x
10.1007/s11263-019-01247-4
10.1109/TPAMI.2021.3059968
10.1038/323533a0
10.1016/j.neunet.2020.01.001

Información de Contacto:

Para más detalles sobre esta investigación, contactar al autor principal:
Dr. Investigador Ejemplo
Universidad de Ejemplo
Correo: investigador@ejemplo.edu
Sitio web: https://www.ejemplo.edu/investigador

Repositorio de datos: https://zenodo.org/record/1234567
Código fuente: https://github.com/ejemplo/codigo-vision

---

Fin del documento de ejemplo.
Este archivo contiene múltiples tipos de referencias académicas:
- 15+ DOIs
- 5+ arXiv IDs
- 10+ URLs académicas
- 4+ ISBNs
- 2+ PubMed IDs
- Referencias en formato APA y texto libre

Última actualización: 2024