From 8056f79eed1b477bd32da3f3e9b88f60a6a2f765 Mon Sep 17 00:00:00 2001 From: Davide Rigoni Date: Mon, 31 Oct 2022 14:53:16 +0100 Subject: [PATCH 1/2] New visual grounding paper Add reference to a new visual grounding paper --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index f82c3e3..bc84ff5 100644 --- a/README.md +++ b/README.md @@ -256,6 +256,8 @@ ConvNet (SCRC)* [[Paper]](https://www.cv-foundation.org/openaccess/content_cvpr_ 1. Kamath, Aishwarya, et al. **MDETR--Modulated Detection for End-to-End Multi-Modal Understanding.** arXiv preprint arXiv:2104.12763 (2021). [[Paper]](https://arxiv.org/pdf/2104.12763) +1. Rigoni, Davide, Luciano Serafini, and Alessandro Sperduti. "A better loss for visual-textual grounding." Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing. 2022. [[Paper]](https://dl.acm.org/doi/pdf/10.1145/3477314.3507047?casa_token=w2ChGKkPGtIAAAAA:jC1s0t1rjQC8PX-ttq9Utn9cnDqKmsGHrck8MjQYDkWhS03BdiYhYko-vXE7DeMWBizqRRb_KjvD) [[Code]](https://github.com/drigoni/Loss_VT_Grounding) + ### Natural Language Object Retrieval (Images) 1. Guadarrama, Sergio, et al. **Open-vocabulary Object Retrieval.** Robotics: science and systems. Vol. 2. No. 5. 2014. [[Paper]](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.434.3000&rep=rep1&type=pdf) [[Code]](http://openvoc.berkeleyvision.org/) From de7395779c96de486fe63789d48294beb6bd2614 Mon Sep 17 00:00:00 2001 From: Davide Rigoni Date: Mon, 31 Oct 2022 15:07:29 +0100 Subject: [PATCH 2/2] Update font Update title with bold --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index bc84ff5..efde26d 100644 --- a/README.md +++ b/README.md @@ -256,7 +256,7 @@ ConvNet (SCRC)* [[Paper]](https://www.cv-foundation.org/openaccess/content_cvpr_ 1. Kamath, Aishwarya, et al. **MDETR--Modulated Detection for End-to-End Multi-Modal Understanding.** arXiv preprint arXiv:2104.12763 (2021). [[Paper]](https://arxiv.org/pdf/2104.12763) -1. Rigoni, Davide, Luciano Serafini, and Alessandro Sperduti. "A better loss for visual-textual grounding." Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing. 2022. [[Paper]](https://dl.acm.org/doi/pdf/10.1145/3477314.3507047?casa_token=w2ChGKkPGtIAAAAA:jC1s0t1rjQC8PX-ttq9Utn9cnDqKmsGHrck8MjQYDkWhS03BdiYhYko-vXE7DeMWBizqRRb_KjvD) [[Code]](https://github.com/drigoni/Loss_VT_Grounding) +1. Rigoni, Davide, Luciano Serafini, and Alessandro Sperduti. **A better loss for visual-textual grounding.** Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing. 2022. [[Paper]](https://dl.acm.org/doi/pdf/10.1145/3477314.3507047?casa_token=w2ChGKkPGtIAAAAA:jC1s0t1rjQC8PX-ttq9Utn9cnDqKmsGHrck8MjQYDkWhS03BdiYhYko-vXE7DeMWBizqRRb_KjvD) [[Code]](https://github.com/drigoni/Loss_VT_Grounding) ### Natural Language Object Retrieval (Images)