-
Notifications
You must be signed in to change notification settings - Fork 6
/
2020.05.26.txt
1419 lines (1164 loc) · 109 KB
/
2020.05.26.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
==========New Papers==========
1, TITLE: Rethinking of Pedestrian Attribute Recognition: Realistic Datasets with Efficient Method
http://arxiv.org/abs/2005.11909
AUTHORS: Jian Jia ; Houjing Huang ; Wenjie Yang ; Xiaotang Chen ; Kaiqi Huang
COMMENTS: 12 pages, 4 figures
HIGHLIGHT: To address this problem, we propose two realistic datasets PETA\textsubscript{$zs$} and RAPv2\textsubscript{$zs$} following zero-shot setting of pedestrian identities based on PETA and RAPv2 datasets.
2, TITLE: Adaptive Adversarial Logits Pairing
http://arxiv.org/abs/2005.11904
AUTHORS: Shangxi Wu ; Jitao Sang ; Kaiyuan Xu ; Guanhua Zheng ; Changsheng Xu
HIGHLIGHT: Based on the analysis of state-of-the-art defense solution Adversarial Logits Pairing (ALP), we observed in this work that: (1) The inference of adversarially robust models tends to rely on fewer high-contribution features compared with vulnerable ones.
3, TITLE: mr2NST: Multi-Resolution and Multi-Reference Neural Style Transfer for Mammography
http://arxiv.org/abs/2005.11926
AUTHORS: Sheng Wang ; Jiayu Huo ; Xi Ouyang ; Jifei Che ; Xuhua Ren ; Zhong Xue ; Qian Wang ; Jie-Zhi Cheng
HIGHLIGHT: In this study, we explicitly address style variety issue with the proposed multi-resolution and multi-reference neural style transfer (mr2NST) network.
4, TITLE: Visual Localization Using Semantic Segmentation and Depth Prediction
http://arxiv.org/abs/2005.11922
AUTHORS: Huanhuan Fan ; Yuhao Zhou ; Ang Li ; Shuang Gao ; Jijunnan Li ; Yandong Guo
HIGHLIGHT: In this paper, we propose a monocular visual localization pipeline leveraging semantic and depth cues.
5, TITLE: A Bayesian-inspired, deep learning, semi-supervised domain adaptation technique for land cover mapping
http://arxiv.org/abs/2005.11930
AUTHORS: Benjamin Lucas ; Charlotte Pelletier ; Daniel Schmidt ; Geoffrey I. Webb ; François Petitjean
HIGHLIGHT: In this paper we present Sourcerer, a Bayesian-inspired, deep learning-based, semi-supervised DA technique for producing land cover maps from SITS data.
6, TITLE: Multi-Margin based Decorrelation Learning for Heterogeneous Face Recognition
http://arxiv.org/abs/2005.11945
AUTHORS: Bing Cao ; Nannan Wang ; Xinbo Gao ; Jie Li ; Zhifeng Li
COMMENTS: IJCAI 2019
HIGHLIGHT: This paper presents a deep neural network approach namely Multi-Margin based Decorrelation Learning (MMDL) to extract decorrelation representations in a hyperspherical space for cross-domain face images.
7, TITLE: Interlayer and Intralayer Scale Aggregation for Scale-invariant Crowd Counting
http://arxiv.org/abs/2005.11943
AUTHORS: Mingjie Wang ; Hao Cai ; Jun Zhou ; Minglun Gong
HIGHLIGHT: To overcome these limitations, a Single-column Scale-invariant Network (ScSiNet) is presented in this paper, which extracts sophisticated scale-invariant features via the combination of interlayer multi-scale integration and a novel intralayer scale-invariant transformation (SiT).
8, TITLE: An End-to-End Mispronunciation Detection System for L2 English Speech Leveraging Novel Anti-Phone Modeling
http://arxiv.org/abs/2005.11950
AUTHORS: Bi-Cheng Yan ; Meng-Che Wu ; Hsiao-Tsung Hung ; Berlin Chen
COMMENTS: 5 pages, 2 figures, Submitted to INTERSPEECH 2020
HIGHLIGHT: In view of this, we propose to conduct MDD with a novel end- to-end automatic speech recognition (E2E-based ASR) approach.
9, TITLE: Non-Destructive Sample Generation From Conditional Belief Functions
http://arxiv.org/abs/2005.11963
AUTHORS: Mieczysław A. Kłopotek
HIGHLIGHT: This paper presents a new approach to generate samples from conditional belief functions for a restricted but non trivial subset of conditional belief functions.
10, TITLE: Keypoints Localization for Joint Vertebra Detection and Fracture Severity Quantification
http://arxiv.org/abs/2005.11960
AUTHORS: Maxim Pisov ; Vladimir Kondratenko ; Alexey Zakharov ; Alexey Petraikin ; Victor Gombolevskiy ; Sergey Morozov ; Mikhail Belyaev
COMMENTS: Accepted to MICCAI-2020
HIGHLIGHT: We propose a new two-step algorithm to localize the vertebral column in 3D CT images and then to simultaneously detect individual vertebrae and quantify fractures in 2D.
11, TITLE: On Irrelevance of Attributes in Flexible Prediction
http://arxiv.org/abs/2005.11979
AUTHORS: Mieczyslaw A. Klopotek ; Andrzej Matuszewski
HIGHLIGHT: On Irrelevance of Attributes in Flexible Prediction
12, TITLE: Hyperspectral Image Classification with Attention Aided CNNs
http://arxiv.org/abs/2005.11977
AUTHORS: Renlong Hang ; Zhu Li ; Qingshan Liu ; Pedram Ghamisi ; Shuvra S. Bhattacharyya
HIGHLIGHT: Along this direction, we propose an attention aided CNN model for spectral-spatial classification of hyperspectral images.
13, TITLE: Deep Learning Models for Automatic Summarization
http://arxiv.org/abs/2005.11988
AUTHORS: Pirmin Lemberger
COMMENTS: 13 pages, 5 figures
HIGHLIGHT: We will discuss in particular applications of pointer networks, hierarchical Transformers and Reinforcement Learning.
14, TITLE: Pointwise Paraphrase Appraisal is Potentially Problematic
http://arxiv.org/abs/2005.11996
AUTHORS: Hannah Chen ; Yangfeng Ji ; David Evans
HIGHLIGHT: This pointwise-based evaluation method does not match well the objective of most real world applications, so the goal of our work is to understand how models which perform well under pointwise evaluation may fail in practice and find better methods for evaluating paraphrase identification models.
15, TITLE: Eye Gaze Controlled Robotic Arm for Persons with SSMI
http://arxiv.org/abs/2005.11994
AUTHORS: Vinay Krishna Sharma ; L. R. D. Murthy ; KamalPreet Singh Saluja ; Vimal Mollyn ; Gourav Sharma ; Pradipta Biswas
COMMENTS: Citation: VK Sharma, KPS Saluja, LRD Murthy, G Sharma and P Biswas, Webcam Controlled Robotic Arm for Persons with SSMI, Technology and Disability 32 (3), IOS Press 2020 [Official journal of EU AAATE association]
HIGHLIGHT: Objective: We aimed to automate the eye tracking process electronically by using commercially available tablet, computer or laptop and without requiring any dedicated hardware for eye gaze tracking.
16, TITLE: Coronavirus: Comparing COVID-19, SARS and MERS in the eyes of AI
http://arxiv.org/abs/2005.11524
AUTHORS: Anas Tahir ; Yazan Qiblawey ; Amith Khandakar ; Tawsifur Rahman ; Uzair Khurshid ; Farayi Musharavati ; Serkan Kiranyaz ; Muhammad E. H. Chowdhury
COMMENTS: 10 Figures, 4 Tables
HIGHLIGHT: In this work, authors used deep machine learning algorithms along with innovative image pre-processing techniques to distinguish COVID-19 images from SARS and MERS images.
17, TITLE: Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media
http://arxiv.org/abs/2005.11424
AUTHORS: Xiangjue Dong ; Changmao Li ; Jinho D. Choi
COMMENTS: To be published in ACL FigLang2020
HIGHLIGHT: We present a transformer-based sarcasm detection model that accounts for the context from the entire conversation thread for more robust predictions.
18, TITLE: Hashing-based Non-Maximum Suppression for Crowded Object Detection
http://arxiv.org/abs/2005.11426
AUTHORS: Jianfeng Wang ; Xi Yin ; Lijuan Wang ; Lei Zhang
HIGHLIGHT: In this paper, we propose an algorithm, named hashing-based non-maximum suppression (HNMS) to efficiently suppress the non-maximum boxes for object detection.
19, TITLE: Multi-view polarimetric scattering cloud tomography and retrieval of droplet size
http://arxiv.org/abs/2005.11423
AUTHORS: Aviad Levis ; Yoav Y. Schechner ; Anthony B. Davis ; Jesse Loveridge
HIGHLIGHT: This work defines and derives a framework for a full 3D tomography of cloud droplets for both their mass concentration in space and their distribution across a range of sizes.
20, TITLE: S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation
http://arxiv.org/abs/2005.11437
AUTHORS: Yizhe Zhu ; Martin Renqiang Min ; Asim Kadav ; Hans Peter Graf
COMMENTS: to appear in CVPR2020
HIGHLIGHT: We propose a sequential variational autoencoder to learn disentangled representations of sequential data (e.g., videos and audios) under self-supervision.
21, TITLE: Fine-Grain Few-Shot Vision via Domain Knowledge as Hyperspherical Priors
http://arxiv.org/abs/2005.11450
AUTHORS: Bijan Haney ; Alexander Lavin
HIGHLIGHT: We describe how to construct a hypersphere of prototypes that embed a-priori domain information, and demonstrate the effectiveness of the approach on challenging benchmark datasets for fine-grain classification, with top results for one-shot classification and 5x speedups in training time.
22, TITLE: Designing with Static Capabilities and Effects: Use, Mention, and Invariants
http://arxiv.org/abs/2005.11444
AUTHORS: Colin S. Gordon
COMMENTS: Preprint of ECOOP 2020 paper
HIGHLIGHT: Along the way, we highlight how seemingly-minor aspects of type systems -- weakening/framing and the mere existence of type contexts -- play a subtle role in the efficacy of these systems.
23, TITLE: COVID-19 Public Opinion and Emotion Monitoring System Based on Time Series Thermal New Word Mining
http://arxiv.org/abs/2005.11458
AUTHORS: Yixian Zhang ; Jieren Chen ; Boyi Liu ; Yifan Yang ; Haocheng Li ; Xinyi Zheng ; Xi Chen ; Tenglong Ren ; Naixue Xiong
HIGHLIGHT: We designed and implemented the COVID-19 public opinion monitoring system based on time series thermal new word mining.
24, TITLE: Delving into the Imbalance of Positive Proposals in Two-stage Object Detection
http://arxiv.org/abs/2005.11472
AUTHORS: Zheng Ge ; Zequn Jie ; Xin Huang ; Chengzheng Li ; Osamu Yoshie
HIGHLIGHT: In this work, we observe two crucial yet never discussed imbalance issues.
25, TITLE: Learning from Naturalistic Driving Data for Human-like Autonomous Highway Driving
http://arxiv.org/abs/2005.11470
AUTHORS: Donghao Xu ; Zhezhang Ding ; Xu He ; Huijing Zhao ; Mathieu Moze ; François Aioun ; Franck Guillemard
COMMENTS: 14 pages, 9 figures. Submitted to T.ITS
HIGHLIGHT: In this study, a method of learning cost parameters of a motion planner from naturalistic driving data is proposed.
26, TITLE: Misalignment Resilient Diffractive Optical Networks
http://arxiv.org/abs/2005.11464
AUTHORS: Deniz Mengu ; Yifan Zhao ; Nezih T. Yardimci ; Yair Rivenson ; Mona Jarrahi ; Aydogan Ozcan
COMMENTS: 15 Pages, 6 Figures
HIGHLIGHT: Here, we introduce and experimentally demonstrate a new training scheme that significantly increases the robustness of diffractive networks against 3D misalignments and fabrication tolerances in the physical implementation of a trained diffractive network.
27, TITLE: Attention-guided Context Feature Pyramid Network for Object Detection
http://arxiv.org/abs/2005.11475
AUTHORS: Junxu Cao ; Qi Chen ; Jun Guo ; Ruichao Shi
HIGHLIGHT: In this paper, to tackle this issue, we build a novel architecture, called Attention-guided Context Feature Pyramid Network (AC-FPN), that exploits discriminative information from various large receptive fields via integrating attention-guided multi-path features.
28, TITLE: From Witch's Shot to Music Making Bones -- Resources for Medical Laymen to Technical Language and Vice Versa
http://arxiv.org/abs/2005.11494
AUTHORS: Laura Seiffe ; Oliver Marten ; Michael Mikhailov ; Sven Schmeier ; Sebastian Möller ; Roland Roller
COMMENTS: In Proceedings of LREC 2020
HIGHLIGHT: This work presents baseline data sources in order to address this challenge for German. We introduce a new data set which annotates medical laymen and technical expressions in a patient forum, along with a set of medical synonyms and definitions, and present first baseline results on the data.
29, TITLE: Summarizing and Exploring Tabular Data in Conversational Search
http://arxiv.org/abs/2005.11490
AUTHORS: Shuo Zhang ; Zhuyun Dai ; Krisztian Balog ; Jamie Callan
COMMENTS: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2020), 2020
HIGHLIGHT: We propose to generate natural language summaries as answers to describe the complex information contained in a table. Through crowdsourcing experiments, we build a new conversation-oriented, open-domain table summarization dataset.
30, TITLE: Self-Training for Domain Adaptive Scene Text Detection
http://arxiv.org/abs/2005.11487
AUTHORS: Yudi Chen ; Wei Wang ; Yu Zhou ; Fei Yang ; Dongbao Yang ; Weiping Wang
HIGHLIGHT: To address this problem, we propose a self-training framework to automatically mine hard examples with pseudo-labels from unannotated videos or images.
31, TITLE: AnimGAN: A Spatiotemporally-Conditioned Generative Adversarial Network for Character Animation
http://arxiv.org/abs/2005.11489
AUTHORS: Maryam Sadat Mirzaei ; Kourosh Meshgi ; Etienne Frigo ; Toyoaki Nishida
COMMENTS: Submitted to ICIP 2020
HIGHLIGHT: We proposed a spatiotemporally-conditioned GAN that generates a sequence that is similar to a given sequence in terms of semantics and spatiotemporal dynamics.
32, TITLE: Stable Style Transformer: Delete and Generate Approach with Encoder-Decoder for Text Style Transfer
http://arxiv.org/abs/2005.12086
AUTHORS: Joosung Lee
COMMENTS: 9 pages, 3 figures
HIGHLIGHT: In this work, we introduce a method that follows two stages in non-parallel datasets.
33, TITLE: Køpsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
http://arxiv.org/abs/2005.12094
AUTHORS: Daniel Hershcovich ; Miryam de Lhoneux ; Artur Kulmizev ; Elham Pejhan ; Joakim Nivre
COMMENTS: IWPT shared task 2020
HIGHLIGHT: We present K{\o}psala, the Copenhagen-Uppsala system for the Enhanced Universal Dependencies Shared Task at IWPT 2020.
34, TITLE: Knee Point Identification Based on Trade-Off Utility
http://arxiv.org/abs/2005.11600
AUTHORS: Ke Li ; Haifeng Nie ; Huifu Gao ; Xin Yao
HIGHLIGHT: In this paper, we propose a simple and effective knee point identification method based on trade-off utility, dubbed KPITU, to help decision makers identify knee points from a given set of trade-off solutions.
35, TITLE: Geometric algorithms for predicting resilience and recovering damage in neural networks
http://arxiv.org/abs/2005.11603
AUTHORS: Guruprasad Raghavan ; Jiayi Li ; Matt Thomson
COMMENTS: 10 pages and 4 figures
HIGHLIGHT: In this paper, we establish a mathematical framework to analyze the resilience of artificial neural networks through the lens of differential geometry.
36, TITLE: Bayesian Neural Networks at Scale: A Performance Analysis and Pruning Study
http://arxiv.org/abs/2005.11619
AUTHORS: Himanshu Sharma ; Elise Jennings
HIGHLIGHT: We present a performance and scalability comparison of training the VGG-16 and Resnet-18 models on a Cray-XC40 cluster.
37, TITLE: One-Shot Unsupervised Cross-Domain Detection
http://arxiv.org/abs/2005.11610
AUTHORS: Antonio D'Innocente ; Francesco Cappio Borlino ; Silvia Bucci ; Barbara Caputo ; Tatiana Tommasi
HIGHLIGHT: This paper addresses this setting, presenting an object detection algorithm able to perform unsupervised adaption across domains by using only one target sample, seen at test time.
38, TITLE: ShapeAdv: Generating Shape-Aware Adversarial 3D Point Clouds
http://arxiv.org/abs/2005.11626
AUTHORS: Kibok Lee ; Zhuoyuan Chen ; Xinchen Yan ; Raquel Urtasun ; Ersin Yumer
COMMENTS: 3D Point Clouds, Adversarial Learning
HIGHLIGHT: We introduce ShapeAdv, a novel framework to study shape-aware adversarial perturbations that reflect the underlying shape variations (e.g., geometric deformations and structural differences) in the 3D point cloud space.
39, TITLE: Unsupervised Geometric Disentanglement for Surfaces via CFAN-VAE
http://arxiv.org/abs/2005.11622
AUTHORS: N. Joseph Tatro ; Stefan C. Schonsheck ; Rongjie Lai
HIGHLIGHT: This work introduces a novel mesh feature, the conformal factor and normal feature (CFAN), for use in mesh convolutional autoencoders.
40, TITLE: RAPiD: Rotation-Aware People Detection in Overhead Fisheye Images
http://arxiv.org/abs/2005.11623
AUTHORS: Zhihao Duan ; M. Ozan Tezcan ; Hayato Nakamura ; Prakash Ishwar ; Janusz Konrad
COMMENTS: CVPR 2020 OmniCV Workshop paper extended version
HIGHLIGHT: In this work, we develop an end-to-end rotation-aware people detection method, named RAPiD, that detects people using arbitrarily-oriented bounding boxes. We have also created a new dataset with spatio-temporal annotations of rotated bounding boxes, for people detection as well as other vision tasks in overhead fisheye videos.
41, TITLE: Effective and Efficient Computation with Multiple-timescale Spiking Recurrent Neural Networks
http://arxiv.org/abs/2005.11633
AUTHORS: Bojian Yin ; Federico Corradi ; Sander M. Bohté
COMMENTS: 11 pages,5 figures
HIGHLIGHT: From this, we calculate a $>$100x energy improvement for our SRNNs over classical RNNs on the harder tasks.
42, TITLE: PoliteCamera: Respecting Strangers' Privacy in Mobile Photographing
http://arxiv.org/abs/2005.11634
AUTHORS: Ang Li ; Wei Du ; Qinghua Li
HIGHLIGHT: In this paper, we propose a cooperative mobile photographing scheme called PoliteCamera to protect strangers' privacy.
43, TITLE: MVStylizer: An Efficient Edge-Assisted Video Photorealistic Style Transfer System for Mobile Phones
http://arxiv.org/abs/2005.11630
AUTHORS: Ang Li ; Chunpeng Wu ; Yiran Chen ; Bin Ni
HIGHLIGHT: To address this challenge, we propose MVStylizer, an efficient edge-assisted photorealistic video style transfer system for mobile phones.
44, TITLE: miniKanren as a Tool for Symbolic Computation in Python
http://arxiv.org/abs/2005.11644
AUTHORS: Brandon T. Willard
HIGHLIGHT: In this article, we give a brief overview of the current state and future potential of symbolic computation within the Python statistical modeling and machine learning community.
45, TITLE: Master-Auxiliary: an efficient aggregation strategy for video anomaly detection
http://arxiv.org/abs/2005.11645
AUTHORS: Zhiguo Wang ; Zhongliang Yang ; Yujin Zhang
COMMENTS: 8 pages
HIGHLIGHT: This paper proposes an efficient strategy to aggregate multiple detectors together.
46, TITLE: Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding
http://arxiv.org/abs/2005.11640
AUTHORS: Chen Liu ; Su Zhu ; Zijian Zhao ; Ruisheng Cao ; Lu Chen ; Kai Yu
COMMENTS: Submitted to INTERSPEECH 2020
HIGHLIGHT: In this paper, a novel BERT based SLU model (WCN-BERT SLU) is proposed to encode WCNs and the dialogue context jointly.
47, TITLE: Robust Object Detection under Occlusion with \\Context-Aware CompositionalNets
http://arxiv.org/abs/2005.11643
AUTHORS: Angtian Wang ; Yihong Sun ; Adam Kortylewski ; Alan Yuille
COMMENTS: Accepted to CVPR 2020
HIGHLIGHT: In this work, we propose to overcome two limitations of CompositionalNets which will enable them to detect partially occluded objects: 1) CompositionalNets, as well as other DCNN architectures, do not explicitly separate the representation of the context from the object itself.
48, TITLE: A Note on the Concrete Hardness of the Shortest Independent Vectors Problem in Lattices
http://arxiv.org/abs/2005.11654
AUTHORS: Divesh Aggarwal ; Eldon Chung
HIGHLIGHT: In order to formally define this requirement on the $\mathsf{CVP}$ instance, we introduce a new computational problem called the Gap Closest Vector Problem with Bounded Minima.
49, TITLE: Self-supervised Robust Object Detectors from Partially Labelled datasets
http://arxiv.org/abs/2005.11549
AUTHORS: Mahdieh Abbasi ; Denis Laurendeau ; Christian Gagne
HIGHLIGHT: With the goal of training \emph{one integrated robust object detector with high generalization performance}, we propose a training framework to overcome missing-label challenge of the merged datasets.
50, TITLE: ProAlignNet : Unsupervised Learning for Progressively Aligning Noisy Contours
http://arxiv.org/abs/2005.11546
AUTHORS: VSR Veeravasarapu ; Abhishek Goel ; Deepak Mittal ; Maneesh Singh
COMMENTS: Accepted at CVPR 2020
HIGHLIGHT: This work presents a novel ConvNet, "ProAlignNet" that accounts for large scale misalignments and complex transformations between the contour shapes.
51, TITLE: Finding Small Satisfying Assignments Faster Than Brute Force: A Fine-grained Perspective into Boolean Constraint Satisfaction
http://arxiv.org/abs/2005.11541
AUTHORS: Marvin Künnemann ; Dániel Marx
HIGHLIGHT: More precisely, we aim to determine, for any finite constraint family, the optimal running time $f(k)n^{g(k)}$ required to find satisfying assignments that set precisely $k$ of the $n$ variables to $1$.
52, TITLE: Invariant 3D Shape Recognition using Predictive Modular Neural Networks
http://arxiv.org/abs/2005.11558
AUTHORS: Vasileios Petridis
COMMENTS: 17 pages, 2 figures
HIGHLIGHT: The methods presented in this paper can be applied to many problems such as gesture recognition, action recognition, dynamic texture recognition etc.
53, TITLE: Underwater object detection using Invert Multi-Class Adaboost with deep learning
http://arxiv.org/abs/2005.11552
AUTHORS: Long Chen ; Zhihua Liu ; Lei Tong ; Zheheng Jiang ; Shengke Wang ; Junyu Dong ; Huiyu Zhou
HIGHLIGHT: In recent years, deep learning based methods have achieved promising performance in standard object detection.
54, TITLE: Evolution of Cooperative Hunting in Artificial Multi-layered Societies
http://arxiv.org/abs/2005.11580
AUTHORS: Honglin Bao ; Wolfgang Banzhaf
COMMENTS: 29 pages, 6 figures, an extension of our ALife 2018 conference paper
HIGHLIGHT: In this paper, an agent-based model is proposed to study the evolution of cooperative hunting behaviors in an artificial society.
55, TITLE: Hierarchical Feature Embedding for Attribute Recognition
http://arxiv.org/abs/2005.11576
AUTHORS: Jie Yang ; Jiarou Fan ; Yiru Wang ; Yige Wang ; Weihao Gan ; Lin Liu ; Wei Wu
COMMENTS: CVPR 2020
HIGHLIGHT: To address this problem, we propose a hierarchical feature embedding (HFE) framework, which learns a fine-grained feature embedding by combining attribute and ID information.
56, TITLE: Revisiting Street-to-Aerial View Image Geo-localization and Orientation Estimation
http://arxiv.org/abs/2005.11592
AUTHORS: Sijie Zhu ; Taojiannan Yang ; Chen Chen
HIGHLIGHT: In this paper, we revisit this problem and point out the ignored issue about image alignment information.
57, TITLE: MaxSAT Resolution and Subcube Sums
http://arxiv.org/abs/2005.11589
AUTHORS: Yuval Filmus ; Meena Mahajan ; Gaurav Sood ; Marc Vinyals
HIGHLIGHT: We study the MaxRes rule in the context of certifying unsatisfiability.
58, TITLE: A Preliminary Study for Identification of Additive Manufactured Objects with Transmitted Images
http://arxiv.org/abs/2005.12027
AUTHORS: Kenta Yamamoto ; Ryota Kawamura ; Kazuki Takazawa ; Hiroyuki Osone ; Yoichi Ochiai
HIGHLIGHT: In this study, we have developed a product identification system that does not require embedding barcodes inside.
59, TITLE: A Joint Pixel and Feature Alignment Framework for Cross-dataset Palmprint Recognition
http://arxiv.org/abs/2005.12044
AUTHORS: Huikai Shao ; Dexing Zhong
COMMENTS: 12 pages, 7 figures
HIGHLIGHT: Therefore, we propose a novel Joint Pixel and Feature Alignment (JPFA) framework for such cross-dataset palmprint recognition scenarios.
60, TITLE: Knowledge Graph Simple Question Answering for Unseen Domains
http://arxiv.org/abs/2005.12040
AUTHORS: Georgios Sidiropoulos ; Nikos Voskarides ; Evangelos Kanoulas
COMMENTS: Accepted at AKBC 2020
HIGHLIGHT: We propose a data-centric domain adaptation framework that consists of a KGSQA system that is applicable to new domains, and a sequence to sequence question generation method that automatically generates question-answer pairs for the new domain.
61, TITLE: An interpretable automated detection system for FISH-based HER2 oncogene amplification testing in histo-pathological routine images of breast and gastric cancer diagnostics
http://arxiv.org/abs/2005.12066
AUTHORS: Sarah Schmell ; Falk Zakrzewski ; Walter de Back ; Martin Weigert ; Uwe Schmidt ; Torsten Wenke ; Silke Zeugner ; Robert Mantey ; Christian Sperling ; Ingo Roeder ; Pia Hoenscheid ; Daniela Aust ; Gustavo Baretton
HIGHLIGHT: Here, we develop an interpretable, deep learning (DL)-based pipeline which automates the evaluation of FISH images with respect to HER2 gene amplification testing.
62, TITLE: Combinatorics of a Discrete Trajectory Space for Robot Motion Planning
http://arxiv.org/abs/2005.12064
AUTHORS: Felix Wiebe ; Shivesh Kumar ; Daniel Harnack ; Malte Langosz ; Hendrik Wöhrle ; Frank Kirchner
COMMENTS: 8 pages, 3 figures, to be published in the proceedings of 2nd IMA Conference on Mathematics of Robotics 2021
HIGHLIGHT: Using lattice path methods, we provide estimates for the complexity of motion planning by counting the number of possible trajectories in a discrete robot configuration space.
63, TITLE: Happy Are Those Who Grade without Seeing: A Multi-Task Learning Approach to Grade Essays Using Gaze Behaviour
http://arxiv.org/abs/2005.12078
AUTHORS: Sandeep Mathias ; Rudra Murthy ; Diptesh Kanojia ; Abhijit Mishra ; Pushpak Bhattacharyya
HIGHLIGHT: In this paper, we propose a way to improve automatic essay grading using gaze behaviour, where the gaze features are learnt at run time using a multi-task learning framework.
64, TITLE: Visual Attention: Deep Rare Features
http://arxiv.org/abs/2005.12073
AUTHORS: Mancas Matei ; Kong Phutphalla ; Gosselin Bernard
COMMENTS: 6 pages, double-colmun, accepted to IVPR2020
HIGHLIGHT: In this paper, we propose a model called DeepRare2019 (DR) which uses the power of DNNs feature extraction and the genericity of feature-engineered algorithms.
65, TITLE: Egocentric Human Segmentation for Mixed Reality
http://arxiv.org/abs/2005.12074
AUTHORS: Andrija Gajic ; Ester Gonzalez-Sosa ; Diego Gonzalez-Morin ; Marcos Escudero-Viñolo ; Alvaro Villegas
COMMENTS: Accepted for presentation at EPIC@CVPR2020 workshop
HIGHLIGHT: The objective of this work is to segment human body parts from egocentric video using semantic segmentation networks. Our contribution is two-fold: i) we create a semi-synthetic dataset composed of more than 15, 000 realistic images and associated pixel-wise labels of egocentric human body parts, such as arms or legs including different demographic factors; ii) building upon the ThunderNet architecture, we implement a deep learning semantic segmentation algorithm that is able to perform beyond real-time requirements (16 ms for 720 x 720 images).
66, TITLE: Policy Entropy for Out-of-Distribution Classification
http://arxiv.org/abs/2005.12069
AUTHORS: Andreas Sedlmeier ; Robert Müller ; Steffen Illium ; Claudia Linnhoff-Popien
HIGHLIGHT: In this work, we propose PEOC, a new policy entropy based out-of-distribution classifier that reliably detects unencountered states in deep reinforcement learning.
67, TITLE: A Novel Distributed Representation of News (DRNews) for Stock Market Predictions
http://arxiv.org/abs/2005.11706
AUTHORS: Ye Ma ; Lu Zong ; Peiwan Wang
COMMENTS: 25 pages
HIGHLIGHT: In this study, a novel Distributed Representation of News (DRNews) model is developed and applied in deep learning-based stock market predictions.
68, TITLE: When does MAML Work the Best? An Empirical Study on Model-Agnostic Meta-Learning in NLP Applications
http://arxiv.org/abs/2005.11700
AUTHORS: Zequn Liu ; Ruiyi Zhang ; Yiping Song ; Ming Zhang
HIGHLIGHT: In this paper, we conduct an empirical study to investigate these impacting factors and conclude when MAML works the best based on the experimental results.
69, TITLE: A Lightweight CNN and Joint Shape-Joint Space (JS2) Descriptor for Radiological Osteoarthritis Detection
http://arxiv.org/abs/2005.11715
AUTHORS: Neslihan Bayramoglu ; Miika T. Nieminen ; Simo Saarakkala
COMMENTS: MIUA2020
HIGHLIGHT: In this study, we propose a fully automated novel method, based on combination of joint shape and convolutional neural network (CNN) based bone texture features, to distinguish between the knee radiographs with and without radiographic osteoarthritis.
70, TITLE: Multi-view Alignment and Generation in CCA via Consistent Latent Encoding
http://arxiv.org/abs/2005.11716
AUTHORS: Yaxin Shi ; Yuangang Pan ; Donna Xu ; Ivor W. Tsang
COMMENTS: 37 pages, 22 figures
HIGHLIGHT: Delving into the impairments of inconsistent encodings, we propose to recover correspondence of the multi-view inputs by matching the marginalization of the joint distribution of multi-view random variables under different forms of factorization.
71, TITLE: Featherweight Go
http://arxiv.org/abs/2005.11710
AUTHORS: Robert Griesemer ; Raymond Hu ; Wen Kokke ; Julien Lange ; Ian Lance Taylor ; Bernardo Toninho ; Philip Wadler ; Nobuko Yoshida
COMMENTS: a full version
HIGHLIGHT: We describe a design for generics in Go inspired by previous work on Featherweight Java by Igarashi, Pierce, and Wadler.
72, TITLE: Learning Camera Miscalibration Detection
http://arxiv.org/abs/2005.11711
AUTHORS: Andrei Cramariuc ; Aleksandar Petrov ; Rohit Suri ; Mayank Mittal ; Roland Siegwart ; Cesar Cadena
COMMENTS: ICRA 2020
HIGHLIGHT: Our contributions include a proposed miscalibration metric for RGB cameras and a novel semi-synthetic dataset generation pipeline based on this metric.
73, TITLE: GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
http://arxiv.org/abs/2005.11729
AUTHORS: Jianfeng Liu ; Feiyang Pan ; Ling Luo
HIGHLIGHT: In this paper, we propose Goal-oriented Chatbots (GoChat), a framework for end-to-end training chatbots to maximize the longterm return from offline multi-turn dialogue datasets.
74, TITLE: Query Resolution for Conversational Search with Limited Supervision
http://arxiv.org/abs/2005.11723
AUTHORS: Nikos Voskarides ; Dan Li ; Pengjie Ren ; Evangelos Kanoulas ; Maarten de Rijke
COMMENTS: SIGIR 2020 full conference paper
HIGHLIGHT: In this work we focus on multi-turn passage retrieval as a crucial component of conversational search.
75, TITLE: Adversarial NLI for Factual Correctness in Text Summarisation Models
http://arxiv.org/abs/2005.11739
AUTHORS: Mario Barrantes ; Benedikt Herudek ; Richard Wang
HIGHLIGHT: We apply the Adversarial NLI dataset to train the NLI model and show that the model has the potential to enhance factual correctness in abstract summarization.
76, TITLE: Automatic Discovery of Interpretable Planning Strategies
http://arxiv.org/abs/2005.11730
AUTHORS: Julian Skirzynski ; Frederic Becker ; Falk Lieder
COMMENTS: Submitted to the Special Issue on Reinforcement Learning for Real Life in Machine Learning Journal. Code available at https://github.com/RationalityEnhancement/InterpretableStrategyDiscovery
HIGHLIGHT: We conclude that the methods and findings presented in this article are an important step towards leveraging automatic strategy discovery to improve human decision-making.
77, TITLE: Domain Specific, Semi-Supervised Transfer Learning for Medical Imaging
http://arxiv.org/abs/2005.11746
AUTHORS: Jitender Singh Virk ; Deepti R. Bathula
COMMENTS: 9 pages 4 figures
HIGHLIGHT: Consequently, we propose a lightweight architecture that uses mixed asymmetric kernels (MAKNet) to reduce the number of parameters significantly.
78, TITLE: High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling
http://arxiv.org/abs/2005.11742
AUTHORS: Yu Zeng ; Zhe Lin ; Jimei Yang ; Jianming Zhang ; Eli Shechtman ; Huchuan Lu
HIGHLIGHT: To address this challenge, we propose an iterative inpainting method with a feedback mechanism. Furthermore, to mimic real object removal scenarios, we collect a large object mask dataset and synthesize more realistic training data that better simulates user inputs.
79, TITLE: On the impact of treewidth in the computational complexity of freezing dynamics
http://arxiv.org/abs/2005.11758
AUTHORS: Eric Goles ; Pedro Montealegre ; Martín Ríos-Wilson ; Guillaume Theyssier
HIGHLIGHT: In this paper we establish how treewidth and maximum degree of the underlying graph are key parameters which influence the overall computational complexity of finite freezing automata networks.
80, TITLE: Lite Audio-Visual Speech Enhancement
http://arxiv.org/abs/2005.11769
AUTHORS: Shang-Yi Chuang ; Yu Tsao ; Chen-Chou Lo ; Hsin-Min Wang
COMMENTS: Submitted to Interspeech 2020
HIGHLIGHT: In this study, we propose a Lite AVSE (LAVSE) system to address these problems.
81, TITLE: The Weisfeiler-Leman dimension of distance-hereditary graphs
http://arxiv.org/abs/2005.11766
AUTHORS: Alexander L. Gavrilyuk ; Roman Nedela ; Ilia Ponomarenko
HIGHLIGHT: We prove that the ordinary Weisfeiler-Leman algorithm correctly tests the isomorphism of any two graphs if one of them is distance-hereditary; more precisely, the Weisfeiler-Leman dimension of the class of finite distance-hereditary graphs is equal to $2$.
82, TITLE: KaLM at SemEval-2020 Task 4: Knowledge-aware Language Models for Comprehension And Generation
http://arxiv.org/abs/2005.11768
AUTHORS: Jiajing Wan ; Xinting Huang
COMMENTS: 6 pages, 1 figure
HIGHLIGHT: This paper presents our strategies in SemEval 2020 Task 4: Commonsense Validation and Explanation.
83, TITLE: Deep Convolutional Neural Network-based Bernoulli Heatmap for Head Pose Estimation
http://arxiv.org/abs/2005.11780
AUTHORS: Zhongxu Hu ; Yang Xing ; Chen Lv ; Peng Hang ; Jie Liu
HIGHLIGHT: This paper proposes a novel Bernoulli heatmap for head pose estimation from a single RGB image.
84, TITLE: Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection
http://arxiv.org/abs/2005.11777
AUTHORS: Murong Ma ; Haiwei Wu ; Xuyang Wang ; Lin Yang ; Junjie Wang ; Ming Li
HIGHLIGHT: In this paper, we propose a deep convolutional neural network-based acoustic word embedding system on code-switching query by example spoken term detection.
85, TITLE: Deep learning approach to describe and classify fungi microscopic images
http://arxiv.org/abs/2005.11772
AUTHORS: Bartosz Zieliński ; Agnieszka Sroka-Oleksiak ; Dawid Rymarczyk ; Adam Piekarczyk ; Monika Brzychczy-Włoch
HIGHLIGHT: In this paper, we apply a machine learning approach based on deep neural networks and Fisher Vector (advanced bag-of-words method) to classify microscopic images of various fungi species.
86, TITLE: Benefits of temporal information for appearance-based gaze estimation
http://arxiv.org/abs/2005.11670
AUTHORS: Cristina Palmero ; Oleg V. Komogortsev ; Sachin S. Talathi
COMMENTS: In ACM Symposium on Eye Tracking Research & Applications (ETRA), 2020
HIGHLIGHT: In this paper, we investigate whether temporal sequences of eye images, captured using a high-resolution, high-frame rate head-mounted virtual reality system, can be leveraged to enhance the accuracy of an end-to-end appearance-based deep-learning model for gaze estimation.
87, TITLE: Quadratic Sieve Factorization Quantum Algorithm and its Simulation
http://arxiv.org/abs/2005.11668
AUTHORS: Amandeep Singh Bhatia ; Ajay Kumar
COMMENTS: 8 pages, 3 figures
HIGHLIGHT: In this paper, we have designed a quantum variant of the second fastest classical factorization algorithm named "Quadratic Sieve".
88, TITLE: A Question Type Driven and Copy Loss Enhanced Frameworkfor Answer-Agnostic Neural Question Generation
http://arxiv.org/abs/2005.11665
AUTHORS: Xiuyu Wu ; Nan Jiang ; Yunfang Wu
HIGHLIGHT: In this paper, we propose two new strategies to deal with this task: question type prediction and copy loss mechanism.
89, TITLE: Glottal source estimation robustness: A comparison of sensitivity of voice source estimation techniques
http://arxiv.org/abs/2005.11682
AUTHORS: Thomas Drugman ; Thomas Dubuisson ; Alexis Moinet ; Nicolas D'Alessandro ; Thierry Dutoit
HIGHLIGHT: This paper addresses the problem of estimating the voice source directly from speech waveforms.
90, TITLE: Networks with pixels embedding: a method to improve noise resistance in images classification
http://arxiv.org/abs/2005.11679
AUTHORS: Chi-Chun Zhou ; Hai-Long Tu ; Yi Liua ; Fu-Lin Zhang
HIGHLIGHT: In this work, we provide a noise-resistance network in images classification by introducing a technique of pixels embedding.
91, TITLE: Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
http://arxiv.org/abs/2005.11676
AUTHORS: Andros Tjandra ; Sakriani Sakti ; Satoshi Nakamura
COMMENTS: Submitted to INTERSPEECH 2020
HIGHLIGHT: In this paper, we report our submitted system for the ZeroSpeech 2020 challenge on Track 2019.
92, TITLE: Integrated Node Encoder for Labelled Textual Networks
http://arxiv.org/abs/2005.11694
AUTHORS: Ye Ma ; Lu Zong
COMMENTS: 7 pages
HIGHLIGHT: In this study, we design an integrated node encoder (INE) for textual networks which is jointly trained on the structure-based and label-based objectives.
93, TITLE: Derivation of Symmetric PCA Learning Rules from a Novel Objective Function
http://arxiv.org/abs/2005.11689
AUTHORS: Ralf Möller
HIGHLIGHT: Here we introduce an alternative objective function where it is not necessary to introduce fixed weight factors; instead, the alternative objective function uses squared summands.
94, TITLE: MASK: A flexible framework to facilitate de-identification of clinical texts
http://arxiv.org/abs/2005.11687
AUTHORS: Nikola Milosevic ; Gangamma Kalappa ; Hesam Dadafarin ; Mahmoud Azimaee ; Goran Nenadic
HIGHLIGHT: In this paper, we present MASK, a software package that is designed to perform the de-identification task.
95, TITLE: Physics-based polynomial neural networks for one-short learning of dynamical systems from one or a few samples
http://arxiv.org/abs/2005.11699
AUTHORS: Andrei Ivanov ; Uwe Iben ; Anna Golovkina
HIGHLIGHT: The paper describes practical results on real experiments with both a simple pendulum and one of the largest worldwide X-ray source.
96, TITLE: The efficiency of deep learning algorithms for detecting anatomical reference points on radiological images of the head profile
http://arxiv.org/abs/2005.12110
AUTHORS: Konstantin Dobratulin ; Andrey Gaidel ; Irina Aupova ; Anna Ivleva ; Aleksandr Kapishnikov ; Pavel Zelter
HIGHLIGHT: In this article we investigate the efficiency of deep learning algorithms in solving the task of detecting anatomical reference points on radiological images of the head in lateral projection using a fully convolutional neural network and a fully convolutional neural network with an extended architecture for biomedical image segmentation - U-Net.
97, TITLE: NILE : Natural Language Inference with Faithful Natural Language Explanations
http://arxiv.org/abs/2005.12116
AUTHORS: Sawan Kumar ; Partha Talukdar
COMMENTS: 13 pages, 3 figures, Accepted to ACL 2020
HIGHLIGHT: We propose Natural-language Inference over Label-specific Explanations (NILE), a novel NLI method which utilizes auto-generated label-specific NL explanations to produce labels along with its faithful explanation.
98, TITLE: Degree-Aware Alignment for Entities in Tail
http://arxiv.org/abs/2005.12132
AUTHORS: Weixin Zeng ; Xiang Zhao ; Wei Wang ; Jiuyang Tang ; Zhen Tan
COMMENTS: Accepted by SIGIR 2020
HIGHLIGHT: For pre-alignment, we propose to amplify long-tail entities, which are of relatively weak structural information, with entity name information that is generally available (but overlooked) in the form of concatenated power mean word embeddings.
99, TITLE: Learning to Simulate Dynamic Environments with GameGAN
http://arxiv.org/abs/2005.12126
AUTHORS: Seung Wook Kim ; Yuhao Zhou ; Jonah Philion ; Antonio Torralba ; Sanja Fidler
COMMENTS: CVPR 2020
HIGHLIGHT: In this paper, we aim to learn a simulator by simply watching an agent interact with an environment.
100, TITLE: An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering
http://arxiv.org/abs/2005.12142
AUTHORS: Chia-Chih Kuo ; Shang-Bao Luo ; Kuan-Yu Chen
HIGHLIGHT: Consequently, an audio-enriched BERT-based SMCQA framework is proposed.
101, TITLE: Adapting End-to-End Speech Recognition for Readable Subtitles
http://arxiv.org/abs/2005.12143
AUTHORS: Danni Liu ; Jan Niehues ; Gerasimos Spanakis
COMMENTS: IWSLT 2020
HIGHLIGHT: The experiments show that with limited data far less than needed for training a model from scratch, we can adapt a Transformer-based ASR model to incorporate both transcription and compression capabilities.
102, TITLE: AGVNet: Attention Guided Velocity Learning for 3D Human Motion Prediction
http://arxiv.org/abs/2005.12155
AUTHORS: Xiaoli Liu ; Jianqin Yin ; Jun Liu
HIGHLIGHT: In this paper, we propose a novel feedforward network, AGVNet (Attention Guided Velocity Learning Network), to predict future poses, which explicitly models the velocities at both Encoder and Decoder.
103, TITLE: NENET: An Edge Learnable Network for Link Prediction in Scene Text
http://arxiv.org/abs/2005.12147
AUTHORS: Mayank Kumar Singh ; Sayan Banerjee ; Shubhasis Chaudhuri
COMMENTS: 9 pages
HIGHLIGHT: This necessitates the need to link adjacent characters, which we propose in this paper using a novel Graph Neural Network (GNN) architecture that allows us to learn both node and edge features as opposed to only the node features under the typical GNN.
104, TITLE: Formal Methods with a Touch of Magic
http://arxiv.org/abs/2005.12175
AUTHORS: Parand Alizadeh Alamdari ; Guy Avni ; Thomas A. Henzinger ; Anna Lukina
HIGHLIGHT: In this work, we address the controller-design problem with a combination of techniques from both fields.
105, TITLE: Depth-2 QAC circuits cannot simulate quantum parity
http://arxiv.org/abs/2005.12169
AUTHORS: Daniel Padé ; Stephen Fenner ; Daniel Grier ; Thomas Thierauf
COMMENTS: 21 pages, 2 figures
HIGHLIGHT: We show that the quantum parity gate on $n > 3$ qubits cannot be cleanly simulated by a quantum circuit with two layers of arbitrary C-SIGN gates of any arity and arbitrary 1-qubit unitary gates, regardless of the number of allowed ancilla qubits.
106, TITLE: AMR quality rating with a lightweight CNN
http://arxiv.org/abs/2005.12187
AUTHORS: Juri Opitz
HIGHLIGHT: In our experiments, we show that the method can rate the quality of AMR graphs more accurately than a strong baseline, with respect to several dimensions of interest.
107, TITLE: Automating the Surveillance of Mosquito Vectors from Trapped Specimens Using Computer Vision Techniques
http://arxiv.org/abs/2005.12188
AUTHORS: Mona Minakshi ; Pratool Bharti ; Willie B. McClinton III ; Jamshidbek Mirzakhalov ; Ryan M. Carney ; Sriram Chellappan
HIGHLIGHT: This paper presents an innovative solution to this problem.
108, TITLE: Monitoring and Diagnosability of Perception Systems
http://arxiv.org/abs/2005.11816
AUTHORS: Pasquale Antonante ; David I. Spivak ; Luca Carlone
HIGHLIGHT: In this work, we propose a mathematical model for runtime monitoring and fault detection of perception systems.
109, TITLE: Learning visual servo policies via planner cloning
http://arxiv.org/abs/2005.11810
AUTHORS: Ulrich Viereck ; Kate Saenko ; Robert Platt
HIGHLIGHT: We propose Penalized Q Cloning (PQC), a new behavior cloning algorithm.
110, TITLE: Recognizing Families through Images with Pretrained Encoder
http://arxiv.org/abs/2005.11811
AUTHORS: Tuan-Duy H. Nguyen ; Huu-Nghia H. Nguyen ; Hieu Dao
COMMENTS: Will appear as part of RFIW2020 in the Proceedings of 2020 International Conference on Automatic Face and Gesture Recognition (IEEE AMFG)
HIGHLIGHT: We employ 3 methods, FaceNet, Siamese VGG-Face, and a combination of FaceNet and VGG-Face models as feature extractors, to achieve the 9th standing for kinship verification and the 5th standing for kinship retrieval in the Recognizing Family in The Wild 2020 competition.
111, TITLE: A Proof Assistant Based Formalisation of Core Erlang
http://arxiv.org/abs/2005.11821
AUTHORS: Péter Bereczky ; Dániel Horpácsi ; Simon Thompson
COMMENTS: 21st International Symposium on Trends in Functional Programming
HIGHLIGHT: In this paper, we present our proof-assistant-based formalisation of a subset of Erlang, intended to serve as a base for proving refactorings correct.
112, TITLE: Tezla, an Intermediate Representation for Static Analysis of Michelson Smart Contracts
http://arxiv.org/abs/2005.11839
AUTHORS: João Santos Reis ; Paul Crocker ; Simão Melo de Sousa
HIGHLIGHT: This paper introduces Tezla, an intermediate representation of Michelson smart contracts that eases the design of static smart contract analysers.
113, TITLE: How Does That Sound? Multi-Language SpokenName2Vec Algorithm Using Speech Generation and Deep Learning
http://arxiv.org/abs/2005.11838
AUTHORS: Aviad Elyashar ; Rami Puzis ; Michael Fire
HIGHLIGHT: In this paper, we propose SpokenName2Vec, a novel and generic approach which addresses the similar name suggestion problem by utilizing automated speech generation, and deep learning to produce spoken name embeddings.
114, TITLE: Stronger Baselines for Grammatical Error Correction Using Pretrained Encoder-Decoder Model
http://arxiv.org/abs/2005.11849
AUTHORS: Satoru Katsumata ; Mamoru Komachi
HIGHLIGHT: In this study, we explored two generic pretrained encoder-decoder (Enc-Dec) models, including BART, which reported the state-of-the-art (SOTA) results for several Seq2Seq tasks other than GEC.
115, TITLE: An efficient iterative method for reconstructing surface from point clouds
http://arxiv.org/abs/2005.11864
AUTHORS: Dong Wang
COMMENTS: 23 pages, 15 figures
HIGHLIGHT: In this paper, we develop an efficient iterative method on a variational model for the surface reconstruction from point clouds.
116, TITLE: ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020
http://arxiv.org/abs/2005.11861
AUTHORS: Maha Elbayad ; Ha Nguyen ; Fethi Bougares ; Natalia Tomashenko ; Antoine Caubrière ; Benjamin Lecouteux ; Yannick Estève ; Laurent Besacier
HIGHLIGHT: We propose an algorithm to control the latency of the ASR+MT cascade and achieve a good latency-quality trade-off on both subtasks.
117, TITLE: Bayesian Conditional GAN for MRI Brain Image Synthesis
http://arxiv.org/abs/2005.11875
AUTHORS: Gengyan Zhaoa ; Mary E. Meyerand ; Rasmus M. Birn
COMMENTS: 26 pages, 7 figures
HIGHLIGHT: In this work, we propose to use Bayesian conditional generative adversarial network (GAN) with concrete dropout to improve image synthesis accuracy.
118, TITLE: Novel Human-Object Interaction Detection via Adversarial Domain Generalization
http://arxiv.org/abs/2005.11406
AUTHORS: Yuhang Song ; Wenbo Li ; Lei Zhang ; Jianwei Yang ; Emre Kiciman ; Hamid Palangi ; Jianfeng Gao ; C. -C. Jay Kuo ; Pengchuan Zhang
HIGHLIGHT: We study in this paper the problem of novel human-object interaction (HOI) detection, aiming at improving the generalization ability of the model to unseen scenarios.
119, TITLE: AutoSUM: Automating Feature Extraction and Multi-user Preference Simulation for Entity Summarization
http://arxiv.org/abs/2005.11888
AUTHORS: Dongjun Wei ; Yaxin Liu ; Fuqing Zhu ; Liangjun Zang ; Wei Zhou ; Yijun Lu ; Songlin Hu
COMMENTS: 11 pages, accepted in PAKDD'2020
HIGHLIGHT: In this paper, a novel integration method called AutoSUM is proposed for automatic feature extraction and multi-user preference simulation to overcome the drawbacks of previous methods.
120, TITLE: One of these (Few) Things is Not Like the Others
http://arxiv.org/abs/2005.11405
AUTHORS: Nat Roth ; Justin Wagle
HIGHLIGHT: We propose a model which can both classify new images based on a small number of examples and recognize images which do not belong to any previously seen group.
121, TITLE: Sentiment Analysis: Detecting Valence, Emotions, and Other Affectual States from Text
http://arxiv.org/abs/2005.11882
AUTHORS: Saif M. Mohammad
COMMENTS: This is the author's manuscript of what is slated to appear in the Second Edition of Emotion Measurement, 2020
HIGHLIGHT: This article presents a sweeping overview of sentiment analysis research that includes: the origins of the field, the rich landscape of tasks, challenges, a survey of the methods and resources used, and applications.
122, TITLE: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
http://arxiv.org/abs/2005.11401
AUTHORS: Patrick Lewis ; Ethan Perez ; Aleksandara Piktus ; Fabio Petroni ; Vladimir Karpukhin ; Naman Goyal ; Heinrich Küttler ; Mike Lewis ; Wen-tau Yih ; Tim Rocktäschel ; Sebastian Riedel ; Douwe Kiela
HIGHLIGHT: We introduce RAG models where the parametric memory is a pre-trained seq2seq model and the non-parametric memory is a dense vector index of Wikipedia, accessed with a pre-trained neural retriever.
123, TITLE: Approaching Bio Cellular Classification for Malaria Infected Cells Using Machine Learning and then Deep Learning to compare & analyze K-Nearest Neighbours and Deep CNNs
http://arxiv.org/abs/2005.11417
AUTHORS: Rishabh Malhotra ; Dhron Joshi ; Ku Young Shin
COMMENTS: 7 Pages
HIGHLIGHT: Here, we implement two models of classification; a convolutional neural network, and the k nearest neighbours algorithm.
124, TITLE: Reinforcement Learning with Iterative Reasoning for Merging in Dense Traffic
http://arxiv.org/abs/2005.11895
AUTHORS: Maxime Bouton ; Alireza Nakhaei ; David Isele ; Kikuo Fujimura ; Mykel J. Kochenderfer
COMMENTS: 6pages, 5 figures
HIGHLIGHT: In this work, we propose a combination of reinforcement learning and game theory to learn merging behaviors.
125, TITLE: Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
http://arxiv.org/abs/2005.11787
AUTHORS: Anne Lauscher ; Olga Majewska ; Leonardo F. R. Ribeiro ; Iryna Gurevych ; Nikolai Rozanov ; Goran Glavaš
HIGHLIGHT: In this work, we investigate models for complementing the distributional knowledge of BERT with conceptual knowledge from ConceptNet and its corresponding Open Mind Common Sense (OMCS) corpus, respectively, using adapter training.
126, TITLE: Vision-based control of a knuckle boom crane with online cable length estimation
http://arxiv.org/abs/2005.11794
AUTHORS: Geir Ole Tysse ; Andrej Cibicik ; Olav Egeland
COMMENTS: This paper was first submitted to IEEE/ASME Transactions on Mechatronics
HIGHLIGHT: The length of the payload cable is also estimated using a least squares technique with projection.
127, TITLE: Walrasian Equilibria in Markets with Small Demands
http://arxiv.org/abs/2005.11796
AUTHORS: Argyrios Deligkas ; Themistoklis Melissourgos ; Paul G. Spirakis
HIGHLIGHT: Put differently, we give the first parallel algorithm that finds a Walrasian equilibrium in polylogarithmic time.
128, TITLE: Functional Space Variational Inference for Uncertainty Estimation in Computer Aided Diagnosis
http://arxiv.org/abs/2005.11797
AUTHORS: Pranav Poduval ; Hrushikesh Loya ; Amit Sethi
COMMENTS: Meaningful priors on the functional space rather than the weight space, result in well calibrated uncertainty estimates
HIGHLIGHT: In this work, by taking skin lesion classification as an example task, we show that by shifting Bayesian inference to the functional space we can craft meaningful priors that give better calibrated uncertainty estimates at a much lower computational cost.
129, TITLE: Pulmonary Nodule Malignancy Classification Using its Temporal Evolution with Two-Stream 3D Convolutional Neural Networks
http://arxiv.org/abs/2005.11341
AUTHORS: Xavier Rafael-Palou ; Anton Aubanell ; Ilaria Bonavita ; Mario Ceresa ; Gemma Piella ; Vicent Ribas ; Miguel A. González Ballester
HIGHLIGHT: In this work, we propose a two-stream 3D convolutional neural network that predicts malignancy by jointly analyzing two pulmonary nodule volumes from the same patient taken at different time-points.
130, TITLE: Single-Agent Optimization Through Policy Iteration Using Monte-Carlo Tree Search
http://arxiv.org/abs/2005.11335
AUTHORS: Arta Seify ; Michael Buro
COMMENTS: Poster presentation at RL in Games Workshop, AAAI 2020
HIGHLIGHT: In this paper, we describe a search algorithm that uses a variant of MCTS which we enhanced by 1) a novel action value normalization mechanism for games with potentially unbounded rewards (which is the case in many optimization problems), 2) defining a virtual loss function that enables effective search parallelization, and 3) a policy network, trained by generations of self-play, to guide the search.
131, TITLE: SentPWNet: A Unified Sentence Pair Weighting Network for Task-specific Sentence Embedding
http://arxiv.org/abs/2005.11347
AUTHORS: Li Zhang ; Han Wang ; Lingxiao Li
HIGHLIGHT: In this paper, our theoretical analysis shows that existing works severely suffered from a good pair sampling and instance weighting strategy.
132, TITLE: The Discussion Tracker Corpus of Collaborative Argumentation
http://arxiv.org/abs/2005.11344
AUTHORS: Christopher Olshefski ; Luca Lugini ; Ravneet Singh ; Diane Litman ; Amanda Godley
COMMENTS: In Proceedings of The 12th Language Resources and Evaluation Conference (LREC), Marseille, France, May 2020
HIGHLIGHT: The transcripts were annotated for three dimensions of collaborative argumentation: argument moves (claims, evidence, and explanations), specificity (low, medium, high) and collaboration (e.g., extensions of and disagreements about others' ideas). In addition to providing descriptive statistics on the corpus, we provide performance benchmarks and associated code for predicting each dimension separately, illustrate the use of the multiple annotations in the corpus to improve performance via multi-task learning, and finally discuss other ways the corpus might be used to further NLP research.
133, TITLE: Stable and expressive recurrent vision models
http://arxiv.org/abs/2005.11362
AUTHORS: Drew Linsley ; Alekh Karkada Ashok ; Lakshmi Narasimhan Govindarajan ; Rex Liu ; Thomas Serre
HIGHLIGHT: Here, we develop a new learning algorithm, "contractor recurrent back-propagation" (C-RBP), which alleviates these issues by achieving constant O(1) memory-complexity with steps of recurrent processing.
134, TITLE: Open-Retrieval Conversational Question Answering
http://arxiv.org/abs/2005.11364
AUTHORS: Chen Qu ; Liu Yang ; Cen Chen ; Minghui Qiu ; W. Bruce Croft ; Mohit Iyyer
COMMENTS: Accepted to SIGIR'20
HIGHLIGHT: To address this limitation, we introduce an open-retrieval conversational question answering (ORConvQA) setting, where we learn to retrieve evidence from a large collection before extracting answers, as a further step towards building functional conversational search systems.
135, TITLE: JSSR: A Joint Synthesis, Segmentation, and Registration System for 3D Multi-Modal Image Alignment of Large-scale Pathological CT Scans
http://arxiv.org/abs/2005.12209
AUTHORS: Fengze Liu ; Jingzheng Cai ; Yuankai Huo ; Le Lu ; Adam P Harrison
HIGHLIGHT: In this work, we propose a novel multi-task learning system, JSSR, based on an end-to-end 3D convolutional neural network that is composed of a generator, a register and a segmentor, for the tasks of synthesis, registration and segmentation, respectively.
136, TITLE: Towards Open Domain Event Trigger Identification using Adversarial Domain Adaptation
http://arxiv.org/abs/2005.11355
AUTHORS: Aakanksha Naik ; Carolyn Rosé
COMMENTS: To appear at ACL 2020
HIGHLIGHT: Our work leverages the adversarial domain adaptation (ADA) framework to introduce domain-invariance.
137, TITLE: Gleason Grading of Histology Prostate Images through Semantic Segmentation via Residual U-Net
http://arxiv.org/abs/2005.11368
AUTHORS: Amartya Kalapahar ; Julio Silva-Rodríguez ; Adrián Colomer ; Fernando López-Mir ; Valery Naranjo
HIGHLIGHT: Gleason Grading of Histology Prostate Images through Semantic Segmentation via Residual U-Net
138, TITLE: Image Translation by Latent Union of Subspaces for Cross-Domain Plaque Detection
http://arxiv.org/abs/2005.11384
AUTHORS: Yingying Zhu ; Daniel C. Elton ; Sungwon Lee ; Perry J. Pickhardt ; Ronald M. Summers
COMMENTS: accepted as a short paper in the 2020 Medical Imaging with Deep Learning (MIDL) conference
HIGHLIGHT: Motivated by this, we propose an image translation network using a shared union of subspaces constraint and show our approach preserves subtle structures (plaques) better than the conventional method.
139, TITLE: A review of sentiment analysis research in Arabic language
http://arxiv.org/abs/2005.12240
AUTHORS: Oumaima Oueslati ; Erik Cambria ; Moez Ben HajHmida ; Habib Ounelli
HIGHLIGHT: In this paper, we carry out an in-depth qualitative study of the most important research works in this context by presenting limits and strengths of existing approaches.
140, TITLE: Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning
http://arxiv.org/abs/2005.12254
AUTHORS: Jaskirat Singh ; Liang Zheng
HIGHLIGHT: To this end, we propose a dynamic value estimation (DVE) technique for these multiple-MDP environments, motivated by the clustering effect observed in the value function distribution across different scenes.
141, TITLE: Attention-based Neural Bag-of-Features Learning for Sequence Data
http://arxiv.org/abs/2005.12250
AUTHORS: Dat Thanh Tran ; Nikolaos Passalis ; Anastasios Tefas ; Moncef Gabbouj ; Alexandros Iosifidis
HIGHLIGHT: In this paper, we propose 2D-Attention (2DA), a generic attention formulation for sequence data, which acts as a complementary computation block that can detect and focus on relevant sources of information for the given learning objective.
142, TITLE: Demoting Racial Bias in Hate Speech Detection
http://arxiv.org/abs/2005.12246
AUTHORS: Mengzhou Xia ; Anjalie Field ; Yulia Tsvetkov
COMMENTS: Accepted at SocialNLP Workshop @ACL 2020
HIGHLIGHT: In this paper, we use adversarial training to mitigate this bias, introducing a hate speech classifier that learns to detect toxic sentences while demoting confounds corresponding to AAE texts.
143, TITLE: Neural Topological SLAM for Visual Navigation
http://arxiv.org/abs/2005.12256
AUTHORS: Devendra Singh Chaplot ; Ruslan Salakhutdinov ; Abhinav Gupta ; Saurabh Gupta
COMMENTS: Published in CVPR 2020. See the project webpage at https://devendrachaplot.github.io/projects/Neural-Topological-SLAM
HIGHLIGHT: We describe supervised learning-based algorithms that can build, maintain and use such representations under noisy actuation.
==========Updates to Previous Papers==========
1, TITLE: RelDenClu:A Relative Density based Biclustering Method for identifying non-linear feature relations with an Application to identify factors effecting spread of COVID-19
http://arxiv.org/abs/1811.04661
AUTHORS: Namita Jain ; Susmita Ghosh ; C. A. Murthy
HIGHLIGHT: The proposed method, RelDenClu uses the local variations in marginal and joint densities for each pair of features to find the subset of observations, which forms the bases of the relation between them.
2, TITLE: A case study of Consistent Vehicle Routing Problem with Time Windows
http://arxiv.org/abs/1912.05929
AUTHORS: Hernán Lespay ; Karol Suchan
HIGHLIGHT: We develop a heuristic solution method for the Consistent Vehicle Routing Problem with Time Windows (ConVRPTW), motivated by a real-world application at a distribution center of a food company.
3, TITLE: A type of generalization error induced by initialization in deep neural networks
http://arxiv.org/abs/1905.07777
AUTHORS: Yaoyu Zhang ; Zhi-Qin John Xu ; Tao Luo ; Zheng Ma
COMMENTS: Accepted by MSML
HIGHLIGHT: In this work, by exploiting the linearity of DNN training dynamics in the NTK regime \citep{jacot2018neural,lee2019wide}, we provide an explicit and quantitative answer to this problem.
4, TITLE: Learning Spatial-Spectral Prior for Super-Resolution of Hyperspectral Imagery
http://arxiv.org/abs/2005.08752
AUTHORS: Junjun Jiang ; He Sun ; Xianming Liu ; Jiayi Ma
COMMENTS: Accepted for publication at IEEE Transactions on Computational Imaging
HIGHLIGHT: In this paper, we make a step forward by investigating how to adapt state-of-the-art residual learning based single gray/RGB image super-resolution approaches for computationally efficient single hyperspectral image super-resolution, referred as SSPSR.
5, TITLE: Cell Segmentation and Tracking using CNN-Based Distance Predictions and a Graph-Based Matching Strategy
http://arxiv.org/abs/2004.01486
AUTHORS: Tim Scherr ; Katharina Löffler ; Moritz Böhland ; Ralf Mikut
COMMENTS: 16 pages, 14 figures, methods of the team KIT-Sch-GE for the IEEE ISBI 2020 Cell Tracking Challenge
HIGHLIGHT: In this paper, we present a method for the segmentation of touching cells in microscopy images.
6, TITLE: NPLDA: A Deep Neural PLDA Model for Speaker Verification
http://arxiv.org/abs/2002.03562
AUTHORS: Shreyas Ramoji ; Prashant Krishnan ; Sriram Ganapathy
COMMENTS: Published in Odyssey 2020, the Speaker and Language Recognition Workshop (VOiCES Special Session). Link to GitHub Implementation: https://github.com/iiscleap/NeuralPlda. arXiv admin note: substantial text overlap with arXiv:2001.07034
HIGHLIGHT: In this work, we propose a neural network approach for backend modeling in speaker recognition.
7, TITLE: On the Idiosyncrasies of the Mandarin Chinese Classifier System
http://arxiv.org/abs/1902.10193
AUTHORS: Shijia Liu ; Hongyuan Mei ; Adina Williams ; Ryan Cotterell
HIGHLIGHT: In this paper, we introduce an information-theoretic approach to measuring idiosyncrasy; we examine how much the uncertainty in Mandarin Chinese classifiers can be reduced by knowing semantic information about the nouns that the classifiers modify.
8, TITLE: SEAN: Image Synthesis with Semantic Region-Adaptive Normalization
http://arxiv.org/abs/1911.12861
AUTHORS: Peihao Zhu ; Rameen Abdal ; Yipeng Qin ; Peter Wonka
COMMENTS: Accepted as a CVPR 2020 oral paper. The interactive demo is available at https://youtu.be/0Vbj9xFgoUw
HIGHLIGHT: We propose semantic region-adaptive normalization (SEAN), a simple but effective building block for Generative Adversarial Networks conditioned on segmentation masks that describe the semantic regions in the desired output image.
9, TITLE: Using Computer Vision to enhance Safety of Workforce in Manufacturing in a Post COVID World
http://arxiv.org/abs/2005.05287
AUTHORS: Prateek Khandelwal ; Anuj Khandelwal ; Snigdha Agarwal ; Deep Thomas ; Naveen Xavier ; Arun Raghuraman
COMMENTS: 6 pages, 7 figure, 1 table
HIGHLIGHT: This paper describes an efficient and economic approach of using AI to create a safe environment in a manufacturing setup.
10, TITLE: Network Representation Learning for Link Prediction: Are we improving upon simple heuristics?
http://arxiv.org/abs/2002.11522
AUTHORS: Alexandru Mara ; Jefrey Lijffijt ; Tijl De Bie
HIGHLIGHT: In this work, we analyse 17 network embedding methods on 7 real-world datasets and find, using a consistent evaluation pipeline, only thin progress over the recent years.
11, TITLE: Wake Word Detection with Alignment-Free Lattice-Free MMI
http://arxiv.org/abs/2005.08347
AUTHORS: Yiming Wang ; Hang Lv ; Daniel Povey ; Lei Xie ; Sanjeev Khudanpur
COMMENTS: Submitted to Interspeech 2020. 5 pages, 3 figures
HIGHLIGHT: We present novel methods to train a hybrid DNN/HMM wake word detection system from partially labeled training data, and to use it in on-line applications: (i) we remove the prerequisite of frame-level alignments in the LF-MMI training algorithm, permitting the use of un-transcribed training examples that are annotated only for the presence/absence of the wake word; (ii) we show that the classical keyword/filler model must be supplemented with an explicit non-speech (silence) model for good performance; (iii) we present an FST-based decoder to perform online detection.
12, TITLE: Learning to Ask Medical Questions using Reinforcement Learning
http://arxiv.org/abs/2004.00994
AUTHORS: Uri Shaham ; Tom Zahavy ; Cesar Caraballo ; Shiwani Mahajan ; Daisy Massey ; Harlan Krumholz
HIGHLIGHT: We propose a novel reinforcement learning-based approach for adaptive and iterative feature selection.
13, TITLE: Long-Term Progress and Behavior Complexification in Competitive Co-Evolution
http://arxiv.org/abs/1909.08303
AUTHORS: Luca Simione ; Stefano Nolfi
HIGHLIGHT: We propose a new competitive algorithm that produces long-term global progress by identifying and by filtering out opportunistic variations, i.e. variations leading to progress against current competitors and retrogression against other competitors.
14, TITLE: MBA-RainGAN: Multi-branch Attention Generative Adversarial Network for Mixture of Rain Removal from Single Images
http://arxiv.org/abs/2005.10582
AUTHORS: Yiyang Shen ; Yidan Feng ; Sen Deng ; Dong Liang ; Jing Qin ; Haoran Xie ; Mingqiang Wei
HIGHLIGHT: We observe three intriguing phenomenons that, 1) rain is a mixture of raindrops, rain streaks and rainy haze; 2) the depth from the camera determines the degrees of object visibility, where objects nearby and faraway are visually blocked by rain streaks and rainy haze, respectively; and 3) raindrops on the glass randomly affect the object visibility of the whole image space.
15, TITLE: Should Answer Immediately or Wait for Further Information? A Novel Wait-or-Answer Task and Its Predictive Approach
http://arxiv.org/abs/2002.09616
AUTHORS: Zehao Lin ; Shaobo Cui ; Xiaoming Kang ; Guodun Li ; Feng Ji ; Haiqing Chen ; Yin Zhang
HIGHLIGHT: Motivated by such interesting quandary, we define a novel task: Wait-or-Answer to better tackle this dilemma faced by dialogue systems.
16, TITLE: A Simple Language Model for Task-Oriented Dialogue
http://arxiv.org/abs/2005.00796
AUTHORS: Ehsan Hosseini-Asl ; Bryan McCann ; Chien-Sheng Wu ; Semih Yavuz ; Richard Socher
COMMENTS: Version 2: Adding error analysis; 20 Pages, 1 figure, 18 tables
HIGHLIGHT: SimpleTOD is a simple approach to task-oriented dialogue that uses a single causal language model trained on all sub-tasks recast as a single sequence prediction problem.
17, TITLE: A novel centroid update approach for clustering-based superpixel methods and superpixel-based edge detection
http://arxiv.org/abs/1910.08439
AUTHORS: Houwang Zhang ; Chong Wu ; Le Zhang ; Hanying Zheng
COMMENTS: This paper has been accepted by ICIP2020
HIGHLIGHT: Besides, we propose a novel superpixel-based edge detection method.
18, TITLE: A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
http://arxiv.org/abs/2005.10113
AUTHORS: Linhao Dong ; Cheng Yi ; Jianzong Wang ; Shiyu Zhou ; Shuang Xu ; Xueli Jia ; Bo Xu
COMMENTS: 4 pages, 2 figures
HIGHLIGHT: In this work, we make a detailed comparison on a representative label-synchronous model (transformer) and a soft frame-synchronous model (continuous integrate-and-fire (CIF) based model).
19, TITLE: Synergistic Learning of Lung Lobe Segmentation and Hierarchical Multi-Instance Classification for Automated Severity Assessment of COVID-19 in CT Images
http://arxiv.org/abs/2005.03832
AUTHORS: Kelei He ; Wei Zhao ; Xingzhi Xie ; Wen Ji ; Mingxia Liu ; Zhenyu Tang ; Feng Shi ; Yang Gao ; Jun Liu ; Junfeng Zhang ; Dinggang Shen
HIGHLIGHT: To this end, we propose a synergistic learning framework for automated severity assessment of COVID-19 in 3D CT images, by jointly performing lung lobe segmentation and multi-instance classification.
20, TITLE: Predicting Online Item-choice Behavior: A Shape-restricted Regression Perspective
http://arxiv.org/abs/2004.08519
AUTHORS: Naoki Nishimura ; Noriyoshi Sukegawa ; Yuichi Takano ; Jiro Iwanaga
HIGHLIGHT: We propose a shape-restricted optimization model that accurately estimates item-choice probabilities for all possible PV sequences.
21, TITLE: Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT
http://arxiv.org/abs/1908.04943
AUTHORS: Han He ; Jinho D. Choi
COMMENTS: Accepted to the International Florida Artificial Intelligence Research Society Conference, FLAIRS 2020
HIGHLIGHT: This paper presents new state-of-the-art models for three tasks, part-of-speech tagging, syntactic parsing, and semantic parsing, using the cutting-edge contextualized embedding framework known as BERT.
22, TITLE: Approximation Algorithms for Multi-Robot Patrol-Scheduling with Min-Max Latency
http://arxiv.org/abs/2005.02530
AUTHORS: Peyman Afshani ; Mark De Berg ; Kevin Buchin ; Jie Gao ; Maarten Loffler ; Amir Nayyeri ; Benjamin Raichel ; Rik Sarkar ; Haotian Wang ; Hao-Tsung Yang
COMMENTS: Proceedings of the 14th International Workshop on the Algorithmic Foundations of Robotics (WAFR 20)
HIGHLIGHT: We present a polynomial-time algorithm with an approximation factor of $O(k^2 \log \frac{w_{\max}}{w_{\min}})$ to the optimal solution, where $w_{\max}$ and $w_{\min}$ are the maximum and minimum weight of the sites respectively.
23, TITLE: Understanding Graph Isomorphism Network for rs-fMRI Functional Connectivity Analysis
http://arxiv.org/abs/2001.03690
AUTHORS: Byung-Hoon Kim ; Jong Chul Ye
COMMENTS: This paper is accepted for Frontiers in Neuroscience
HIGHLIGHT: One of the important contributions of this paper is the observation that the GIN is a dual representation of convolutional neural network (CNN) in the graph space where the shift operation is defined using the adjacency matrix.
24, TITLE: Classification of Arrhythmia by Using Deep Learning with 2-D ECG Spectral Image Representation
http://arxiv.org/abs/2005.06902
AUTHORS: Amin Ullah ; Syed M. Anwar ; Muhammad Bilal ; Raja M Mehmood
COMMENTS: 14 pages, 5 figures, accepted for future publication in Remote Sensing MDPI Journal
HIGHLIGHT: In this study, we propose a two-dimensional (2-D) convolutional neural network (CNN) model for the classification of ECG signals into eight classes; namely, normal beat, premature ventricular contraction beat, paced beat, right bundle branch block beat, left bundle branch block beat, atrial premature contraction beat, ventricular flutter wave beat, and ventricular escape beat.
25, TITLE: Noise Pollution in Hospital Readmission Prediction: Long Document Classification with Reinforcement Learning
http://arxiv.org/abs/2005.01259
AUTHORS: Liyan Xu ; Julien Hogan ; Rachel E. Patzer ; Jinho D. Choi
COMMENTS: Accepted to the ACL Workshop on Biomedical Natural Language Processing, BioNLP 2020
HIGHLIGHT: This paper presents a reinforcement learning approach to extract noise in long clinical documents for the task of readmission prediction after kidney transplant.
26, TITLE: DJEnsemble: On the Selection of a Disjoint Ensemble of Deep Learning Black-Box Spatio-temporal Models
http://arxiv.org/abs/2005.11093
AUTHORS: Yania Molina Souto ; Rafael Pereira ; Rocío Zorrilla ; Anderson Chaves ; Brian Tsan ; Florin Rusu ; Eduardo Ogasawara ; Artur Ziviani ; Fabio Porto
HIGHLIGHT: In this paper, we present a cost-based approach for the automatic selection and allocation of a disjoint ensemble of black-box predictors to answer predictive spatio-temporal queries.
27, TITLE: A Closer Look at Deep Policy Gradients
http://arxiv.org/abs/1811.02553
AUTHORS: Andrew Ilyas ; Logan Engstrom ; Shibani Santurkar ; Dimitris Tsipras ; Firdaus Janoos ; Larry Rudolph ; Aleksander Madry
COMMENTS: ICLR 2020 version
HIGHLIGHT: To this end, we propose a fine-grained analysis of state-of-the-art methods based on key elements of this framework: gradient estimation, value prediction, and optimization landscapes.
28, TITLE: CRVOS: Clue Refining Network for Video Object Segmentation
http://arxiv.org/abs/2002.03651
AUTHORS: Suhwan Cho ; MyeongAh Cho ; Tae-young Chung ; Heansung Lee ; Sangyoun Lee
COMMENTS: ICIP 2020 camera-ready version
HIGHLIGHT: In this work, we propose a simple specifier, referred to as the Clue, which consists of the previous frame's coarse mask and coordinates information.
29, TITLE: Algorithm for Finding the Maximum Clique Based on Continuous Time Quantum Walk
http://arxiv.org/abs/1912.02728
AUTHORS: Xi Li ; Mingyou Wu ; Hanwu Chen
HIGHLIGHT: In this work, we consider the application of continuous time quantum walking(CTQW) to the Maximum Clique(MC) Problem.
30, TITLE: DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning
http://arxiv.org/abs/1909.02982
AUTHORS: Theo Jaunet ; Romain Vuillemot ; Christian Wolf
HIGHLIGHT: We present DRLViz, a visual analytics interface to interpret the internal memory of an agent (e.g. a robot) trained using deep reinforcement learning.
31, TITLE: CompLex: A New Corpus for Lexical Complexity Predicition from Likert Scale Data
http://arxiv.org/abs/2003.07008
AUTHORS: Matthew Shardlow ; Michael Cooper ; Marcos Zampieri
COMMENTS: Proceedings of the 1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI). pp. 57-62
HIGHLIGHT: With a few exceptions, previous studies have approached the task as a binary classification task in which systems predict a complexity value (complex vs. non-complex) for a set of target words in a text.
32, TITLE: Principal Neighbourhood Aggregation for Graph Nets
http://arxiv.org/abs/2004.05718
AUTHORS: Gabriele Corso ; Luca Cavalleri ; Dominique Beaini ; Pietro Liò ; Petar Veličković
HIGHLIGHT: Accordingly, we propose Principal Neighbourhood Aggregation (PNA), a novel architecture combining multiple aggregators with degree-scalers (which generalize the sum aggregator).
33, TITLE: Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
http://arxiv.org/abs/2002.08037
AUTHORS: Tianpei Yang ; Jianye Hao ; Zhaopeng Meng ; Zongzhang Zhang ; Yujing Hu ; Yingfeng Cheng ; Changjie Fan ; Weixun Wang ; Wulong Liu ; Zhaodong Wang ; Jiajie Peng
COMMENTS: Accepted by IJCAI'2020
HIGHLIGHT: In this paper, we propose a novel Policy Transfer Framework (PTF) to accelerate RL by taking advantage of this idea.
34, TITLE: Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient
http://arxiv.org/abs/1807.00737
AUTHORS: Rui Zhao ; Volker Tresp
COMMENTS: Published in IEEE Spoken Language Technology (SLT 2018), Athens, Greece
HIGHLIGHT: To mitigate this problem, we propose a class of novel temperature-based extensions for policy gradient methods, which are referred to as Tempered Policy Gradients (TPGs).
35, TITLE: Simple Dataset for Proof Method Recommendation in Isabelle/HOL (Dataset Description)
http://arxiv.org/abs/2004.10667
AUTHORS: Yutaka Nagashima
COMMENTS: This is the preprint of our short paper accepted at the 13th Conference on Intelligent Computer Mathematics (CICM 2020)
HIGHLIGHT: In this data description, we present a simple dataset that contains data on over 400k proof method applications along with over 100 extracted features for each in a format that can be processed easily without any knowledge about formal logic.
36, TITLE: LEAP System for SRE19 CTS Challenge -- Improvements and Error Analysis
http://arxiv.org/abs/2002.02735
AUTHORS: Shreyas Ramoji ; Prashant Krishnan ; Bhargavram Mysore ; Prachi Singh ; Sriram Ganapathy
COMMENTS: Published In Proc. Odyssey 2020, the Speaker and Language Recognition Workshop. Link to GitHub Implementation: https://github.com/iiscleap/NeuralPlda
HIGHLIGHT: In this paper, we provide a detailed account of the LEAP SRE system submitted to the CTS challenge focusing on the novel components in the back-end system modeling.
37, TITLE: Improved BiGAN training with marginal likelihood equalization
http://arxiv.org/abs/1911.01425
AUTHORS: Pablo Sánchez-Martín ; Pablo M. Olmos ; Fernando Perez-Cruz
HIGHLIGHT: We propose a novel training procedure for improving the performance of generative adversarial networks (GANs), especially to bidirectional GANs.
38, TITLE: Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere
http://arxiv.org/abs/2005.10242
AUTHORS: Tongzhou Wang ; Phillip Isola
HIGHLIGHT: In this work, we identify two key properties related to the contrastive loss: (1) alignment (closeness) of features from positive pairs, and (2) uniformity of the induced distribution of the (normalized) features on the hypersphere.