-
Notifications
You must be signed in to change notification settings - Fork 6
/
2020.04.21.txt
1614 lines (1328 loc) · 123 KB
/
2020.04.21.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
==========New Papers==========
1, TITLE: Complex-Object Visual Inspection via Multiple Lighting Configurations
http://arxiv.org/abs/2004.09374
AUTHORS: Maya Aghaei ; Matteo Bustreo ; Pietro Morerio ; Nicolo Carissimi ; Alessio Del Bue ; Vittorio Murino
COMMENTS: 8 pages, 7 figures, submitted to ICPR2020
HIGHLIGHT: Along with an exhaustive set of experiments, in this paper, we demonstrate the suitability of the proposed setup for effective illumination of complex-objects, defined as manufactured items with variable surface characteristics that cannot be determined a priori.
2, TITLE: Counterexamples to the Low-Degree Conjecture
http://arxiv.org/abs/2004.08454
AUTHORS: Justin Holmgren ; Alexander S. Wein
COMMENTS: 10 pages
HIGHLIGHT: In this work, we refute the conjecture of Hopkins.
3, TITLE: Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking
http://arxiv.org/abs/2004.09305
AUTHORS: Peiliang Li ; Jieqi Shi ; Shaojie Shen
COMMENTS: cvpr2020
HIGHLIGHT: To benefit from both the powerful object understanding skill from deep neural network meanwhile tackle precise geometry modeling for consistent trajectory estimation, we propose a joint spatial-temporal optimization-based stereo 3D object tracking method.
4, TITLE: Reducing Commutativity Verification to Reachability with Differencing Abstractions
http://arxiv.org/abs/2004.08450
AUTHORS: Eric Koskinen ; Kshitij Bansal
HIGHLIGHT: In this paper, we describe techniques to automatically prove the correctness of method commutativity conditions from data structure implementations.
5, TITLE: Adversarial Attack on Deep Learning-Based Splice Localization
http://arxiv.org/abs/2004.08443
AUTHORS: Andras Rozsa ; Zheng Zhong ; Terrance E. Boult
COMMENTS: This is a pre-print of the original paper accepted at the CVPR Workshop on Media Forensics 2020
HIGHLIGHT: Regarding image forensics, researchers have proposed various approaches to detect and/or localize manipulations, such as splices.
6, TITLE: Can You Put it All Together: Evaluating Conversational Agents' Ability to Blend Skills
http://arxiv.org/abs/2004.08449
AUTHORS: Eric Michael Smith ; Mary Williamson ; Kurt Shuster ; Jason Weston ; Y-Lan Boureau
COMMENTS: accepted to ACL 2020 (long paper)
HIGHLIGHT: In this work, we investigate several ways to combine models trained towards isolated capabilities, ranging from simple model aggregation schemes that require minimal additional training, to various forms of multi-task training that encompass several skills at all training stages. Previous work has introduced tasks and datasets that aim to help agents to learn those qualities in isolation and gauge how well they can express them. We further propose a new dataset, BlendedSkillTalk, to analyze how these capabilities would mesh together in a natural conversation, and compare the performance of different architectures and training schemes.
7, TITLE: Parallelization Techniques for Verifying Neural Networks
http://arxiv.org/abs/2004.08440
AUTHORS: Haoze Wu ; Alex Ozdemir ; Aleksandar Zeljić ; Ahmed Irfan ; Kyle Julian ; Divya Gopinath ; Sadjad Fouladighaleh ; Guy Katz ; Corina Pasareanu ; Clark Barrett
HIGHLIGHT: We introduce an algorithm based on partitioning the verification problem in an iterative manner and explore two partitioning strategies, that work by partitioning the input space or by case splitting on the phases of the neuron activations, respectively.
8, TITLE: Robust Partial Matching for Person Search in the Wild
http://arxiv.org/abs/2004.09329
AUTHORS: Yingji Zhong ; Xiaoyu Wang ; Shiliang Zhang
COMMENTS: 9 pages, 7 figures, accepted to CVPR 2020. The dataset will be released soon
HIGHLIGHT: To alleviate this issue, this paper proposes an Align-to-Part Network (APNet) for person detection and re-Identification (reID).
9, TITLE: Combining multimodal information for Metal Artefact Reduction: An unsupervised deep learning framework
http://arxiv.org/abs/2004.09321
AUTHORS: Marta B. M. Ranzini ; Irme Groothuis ; Kerstin Kläser ; M. Jorge Cardoso ; Johann Henckel ; Sébastien Ourselin ; Alister Hart ; Marc Modat
COMMENTS: Accepted at IEEE International Symposium on Biomedical Imaging (ISBI) 2020
HIGHLIGHT: In this work, we hypothesise that a multimodal approach to MAR would improve both CT and MRI.
10, TITLE: A Novel Multi-Agent System for Complex Scheduling Problems
http://arxiv.org/abs/2004.09312
AUTHORS: Peter Hillmann ; Tobias Uhlig ; Gabi Dreo Rodosek ; Oliver Rose
HIGHLIGHT: The objective of this paper is the conception and implementation of a multi-agent system that is applicable in various problem domains.
11, TITLE: How Do Neural Networks Estimate Optical Flow? A Neuropsychology-Inspired Study
http://arxiv.org/abs/2004.09317
AUTHORS: D. B. de Jong ; F. Paredes-Vallés ; G. C. H. E. de Croon
COMMENTS: 15 pages, 15 figures
HIGHLIGHT: Instead, in this article, we investigate how deep neural networks estimate optical flow.
12, TITLE: Design and Control of Roller Grasper V2 for In-Hand Manipulation
http://arxiv.org/abs/2004.08499
AUTHORS: Shenli Yuan ; Lin Shao ; Connor L. Yako ; Alex Gruebele ; J. Kenneth Salisbury
COMMENTS: Under Review: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2020
HIGHLIGHT: In this work, we present a novel non-anthropomorphic robot grasper with the ability to manipulate objects by means of active surfaces at the fingertips.
13, TITLE: BReG-NeXt: Facial Affect Computing Using Adaptive Residual Networks With Bounded Gradient
http://arxiv.org/abs/2004.08495
AUTHORS: Behzad Hasani ; Pooran Singh Negi ; Mohammad H. Mahoor
COMMENTS: To appear in IEEE Transactions on Affective Computing journal
HIGHLIGHT: This paper introduces BReG-NeXt, a residual-based network architecture using a function wtih bounded derivative instead of a simple shortcut path (a.k.a. identity mapping) in the residual units for automatic recognition of facial expressions based on the categorical and dimensional models of affect.
14, TITLE: ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact Centers
http://arxiv.org/abs/2004.09367
AUTHORS: Jung-Woo Ha ; Kihyun Nam ; Jin Gu Kang ; Sang-Woo Lee ; Sohee Yang ; Hyunhoon Jung ; Eunmi Kim ; Hyeji Kim ; Soojin Kim ; Hyun Ah Kim ; Kyoungtae Doh ; Chan Kyu Lee ; Sunghun Kim
COMMENTS: 5 pages, 1 figure, The first two authors equally contributed to this work
HIGHLIGHT: Here we introduce a new large-scale Korean call-based speech corpus under a goal-oriented dialog scenario from more than 11,000 people, i.e., ClovaCall corpus.
15, TITLE: Deep-COVID: Predicting COVID-19 From Chest X-Ray Images Using Deep Transfer Learning
http://arxiv.org/abs/2004.09363
AUTHORS: Shervin Minaee ; Rahele Kafieh ; Milan Sonka ; Shakib Yazdani ; Ghazaleh Jamalipour Soufi
HIGHLIGHT: We evaluated these models on the remaining 3,000 images, and most of these networks achieved a sensitivity rate of 97\%($\pm$ 5\%), while having a specificity rate of around 90\%.
16, TITLE: Traffic Lane Detection using FCN
http://arxiv.org/abs/2004.08977
AUTHORS: Shengchang Zhang ; Ahmed EI Koubia ; Khaled Abdul Karim Mohammed
COMMENTS: 6 pages, 6 figures
HIGHLIGHT: In this project, we designed an Encoder- Decoder, Fully Convolutional Network for lane detection.
17, TITLE: Spectral GUI for Automated Tissue and Lesion Segmentation of T1 Weighted Breast MR Images
http://arxiv.org/abs/2004.08960
AUTHORS: Prajval Koul
HIGHLIGHT: We present Spectral GUI, a multiplatform breast MR image analysis tool designed to facilitate the segmentation of fibro glandular tissues and lesions in T1 weighted breast MR images via a graphical user interface (GUI).
18, TITLE: Machine Learning based Pallets Detection and Tracking in AGVs
http://arxiv.org/abs/2004.08965
AUTHORS: Shengchang Zhang ; Jie Xiang ; Weijian Han
COMMENTS: 6 pages, 8 figures, 1 table
HIGHLIGHT: In this project, we constructed a deep learning-based pallets detection and tracking architecture for pallets detection and position tracking.
19, TITLE: Local Search is a Remarkably Strong Baseline for Neural Architecture Search
http://arxiv.org/abs/2004.08996
AUTHORS: T. Den Ottelander ; A. Dushatskiy ; M. Virgolin ; P. A. N. Bosman
COMMENTS: 17 pages, 12 figures
HIGHLIGHT: In this work we consider, for the first time, a simple Local Search (LS) algorithm for NAS.
20, TITLE: Exclusive Hierarchical Decoding for Deep Keyphrase Generation
http://arxiv.org/abs/2004.08511
AUTHORS: Wang Chen ; Hou Pong Chan ; Piji Li ; Irwin King
COMMENTS: ACL 2020
HIGHLIGHT: Keyphrase generation (KG) aims to summarize the main ideas of a document into a set of keyphrases.
21, TITLE: Adversarial Training for Large Neural Language Models
http://arxiv.org/abs/2004.08994
AUTHORS: Xiaodong Liu ; Hao Cheng ; Pengcheng He ; Weizhu Chen ; Yu Wang ; Hoifung Poon ; Jianfeng Gao
COMMENTS: 13 pages, 9 tables, 2 figures
HIGHLIGHT: In this paper, we show that adversarial pre-training can improve both generalization and robustness.
22, TITLE: JL-DCF: Joint Learning and Densely-Cooperative Fusion Framework for RGB-D Salient Object Detection
http://arxiv.org/abs/2004.08515
AUTHORS: Keren Fu ; Deng-Ping Fan ; Ge-Peng Ji ; Qijun Zhao
HIGHLIGHT: This paper proposes a novel joint learning and densely-cooperative fusion (JL-DCF) architecture for RGB-D salient object detection.
23, TITLE: Semi-Supervised Semantic Segmentation via Dynamic Self-Training and Class-Balanced Curriculum
http://arxiv.org/abs/2004.08514
AUTHORS: Zhengyang Feng ; Qianyu Zhou ; Guangliang Cheng ; Xin Tan ; Jianping Shi ; Lizhuang Ma
COMMENTS: Code is available at https://github.com/voldemortX/DST-CBC
HIGHLIGHT: In this work, we propose a novel and concise approach for semi-supervised semantic segmentation.
24, TITLE: ImagePairs: Realistic Super Resolution Dataset via Beam Splitter Camera Rig
http://arxiv.org/abs/2004.08513
AUTHORS: Hamid Reza Vaezi Joze ; Ilya Zharkov ; Karlton Powell ; Carl Ringler ; Luming Liang ; Andy Roulston ; Moshe Lutz ; Vivek Pradeep
HIGHLIGHT: Herein, we are proposing a new data acquisition technique for gathering real image data set which could be used as an input for super resolution, noise cancellation and quality enhancement techniques.
25, TITLE: Finding Berries: Segmentation and Counting of Cranberries using Point Supervision and Shape Priors
http://arxiv.org/abs/2004.08501
AUTHORS: Peri Akiva ; Kristin Dana ; Peter Oudemans ; Michael Mars
COMMENTS: to be published in proceeding of CVPR 2020 in Agriculture Vision Workshop
HIGHLIGHT: In this work, we present a deep learning method for simultaneous segmentation and counting of cranberries to aid in yield estimation and sun exposure predictions. To train and evaluate the network, we have collected the CRanberry Aerial Imagery Dataset (CRAID), the largest dataset of aerial drone imagery from cranberry fields.
26, TITLE: A Formal Hierarchy of RNN Architectures
http://arxiv.org/abs/2004.08500
AUTHORS: William Merrill ; Gail Weiss ; Yoav Goldberg ; Roy Schwartz ; Noah A. Smith ; Eran Yahav
COMMENTS: To appear at ACL 2020. 10 pages with appendix
HIGHLIGHT: We develop a formal hierarchy of the expressive capacity of RNN architectures.
27, TITLE: Predicting Online Item-choice Behavior: A Shape-restricted Regression Perspective
http://arxiv.org/abs/2004.08519
AUTHORS: Naoki Nishimura ; Noriyoshi Sukegawa ; Yuichi Takano ; Jiro Iwanaga
HIGHLIGHT: We propose a shape-restricted optimization model to accurately estimate item-choice probabilities for all possible PV sequences.
28, TITLE: Super-Resolution-based Snake Model -- An Unsupervised Method for Large-Scale Building Extraction using Airborne LiDAR Data and Optical Image
http://arxiv.org/abs/2004.08522
AUTHORS: Thanh Huy Nguyen ; Sylvie Daniel ; Didier Gueriot ; Christophe Sintes ; Jean-Marc Le Caillec
COMMENTS: 30 pages, 15 figures. Submitted to the MDPI Remote Sensing
HIGHLIGHT: This paper presents an efficient snake model to overcome such challenge, called Super-Resolution-based Snake Model (SRSM).
29, TITLE: Effect of Text Color on Word Embeddings
http://arxiv.org/abs/2004.08526
AUTHORS: Masaya Ikoma ; Brian Kenji Iwana ; Seiichi Uchida
COMMENTS: to appear at the 14th International Workshop on Document Analysis Systems (DAS) 2020
HIGHLIGHT: In this paper, we try two tasks to verify the usefulness of text color in understanding the meanings of words, especially in identifying synonyms and antonyms.
30, TITLE: Class Distribution Alignment for Adversarial Domain Adaptation
http://arxiv.org/abs/2004.09403
AUTHORS: Wanqi Yang ; Tong Ling ; Chengmei Yang ; Lei Wang ; Yinghuan Shi ; Luping Zhou ; Ming Yang
HIGHLIGHT: To address this issue, we propose a novel approach called Conditional ADversarial Image Translation (CADIT) to explicitly align the class distributions given samples between the two domains.
31, TITLE: A Comprehensive Survey on Traffic Prediction
http://arxiv.org/abs/2004.08555
AUTHORS: Xueyan Yin ; Genze Wu ; Jinze Wei ; Yanming Shen ; Heng Qi ; Baocai Yin
HIGHLIGHT: The purpose of this paper is to provide a comprehensive survey for traffic prediction. Third, We collect and organize several related public datasets in the existing literature.
32, TITLE: Learning to Dehaze From Realistic Scene with A Fast Physics Based Dehazing Network
http://arxiv.org/abs/2004.08554
AUTHORS: Ruoteng Li ; Xiaoyi Zhang ; Shaodi You ; Yu Li
HIGHLIGHT: In this paper, we complement the exiting datasets with a new, large, and diverse dehazing dataset containing real outdoor scenes from HD 3D videos.
33, TITLE: The Notorious Difficulty of Comparing Human and Machine Perception
http://arxiv.org/abs/2004.09406
AUTHORS: Christina M. Funke ; Judy Borowski ; Karolina Stosio ; Wieland Brendel ; Thomas S. A. Wallis ; Matthias Bethge
HIGHLIGHT: Here, we highlight common shortcomings that can easily lead to fragile conclusions.
34, TITLE: Accurate Tumor Tissue Region Detection with Accelerated Deep Convolutional Neural Networks
http://arxiv.org/abs/2004.08552
AUTHORS: Gabriel Tjio ; Xulei Yang ; Jia Mei Hong ; Sum Thai Wong ; Vanessa Ding ; Andre Choo ; Yi Su
COMMENTS: 9 pages, 6 figures, 3 tables
HIGHLIGHT: In computer vision approaches using deep learning methods, the input image is subdivided into patches which are separately passed through the neural network.
35, TITLE: Kernels for time series with irregularly-spaced multivariate observations
http://arxiv.org/abs/2004.08545
AUTHORS: Ahmed Guecioueur ; Franz J. Király
COMMENTS: 11 pages
HIGHLIGHT: In this manuscript, we show that a "series kernel" that is general enough to represent irregularly-spaced multivariate time series may be built out of well-known "vector kernels".
36, TITLE: Color Image Segmentation using Adaptive Particle Swarm Optimization and Fuzzy C-means
http://arxiv.org/abs/2004.08547
AUTHORS: Narayana Reddy A ; Ranjita Das
COMMENTS: 4 pages, 2 figures, Included in conference proceedings of "International conference in Recent Trends on Electronics & Computer Science (ICRTECS-2019)" organised by National Institute of Technology, Silchar
HIGHLIGHT: This paper presents a new image segmentation algorithm called Adaptive Particle Swarm Optimization and Fuzzy C-means Clustering Algorithm (APSOF), which is based on Adaptive Particle Swarm Optimization (APSO) and Fuzzy C-means clustering.
37, TITLE: FedNAS: Federated Deep Learning via Neural Architecture Search
http://arxiv.org/abs/2004.08546
AUTHORS: Chaoyang He ; Murali Annavaram ; Salman Avestimehr
COMMENTS: accepted to CVPR 2020 workshop on neural architecture search and beyond for representation learning
HIGHLIGHT: We propose a Federated NAS (FedNAS) algorithm to help scattered workers collaboratively searching for a better architecture with higher accuracy.
38, TITLE: Moire Image Restoration using Multi Level Hyper Vision Net
http://arxiv.org/abs/2004.08541
AUTHORS: D. Sabari Nathan ; M. Parisa Beham ; S. M. Md Mansoor Roomi
COMMENTS: 8 pages , 5 figures
HIGHLIGHT: These challenges makes the demoireing more difficult than many other image restoration tasks.
39, TITLE: Autonomous task planning and situation awareness in robotic surgery
http://arxiv.org/abs/2004.08911
AUTHORS: Michele Ginesi ; Daniele Meli ; Andrea Roberti ; Nicola Sansonetto ; Paolo Fiorini
COMMENTS: Submitted to IROS 2020 conference
HIGHLIGHT: In this paper, we propose a framework to implement surgical task automation.
40, TITLE: The Cost of Training NLP Models: A Concise Overview
http://arxiv.org/abs/2004.08900
AUTHORS: Or Sharir ; Barak Peleg ; Yoav Shoham
HIGHLIGHT: We review the cost of training large-scale language models, and the drivers of these costs.
41, TITLE: Tree Echo State Autoencoders with Grammars
http://arxiv.org/abs/2004.08925
AUTHORS: Benjamin Paassen ; Irena Koprinska ; Kalina Yacef
COMMENTS: accepted at the 2020 International Joint Conference on Neural Networks (IJCNN 2020)
HIGHLIGHT: In this paper, we propose tree echo state autoencoders (TES-AE), which are guided by a tree grammar and can be trained within seconds by virtue of reservoir computing.
42, TITLE: 3D rectification with Visual Sphere perspective: an algebraic alternative for P4P pose estimation
http://arxiv.org/abs/2004.08933
AUTHORS: Jakub Maksymilian Fober
COMMENTS: 12 pages, 5 figures
HIGHLIGHT: Presented algorithm solves co-planar P4P problem of parallel lines viewed in perspective with algebraic equation.
43, TITLE: MER-GCN: Micro Expression Recognition Based on Relation Modeling with Graph Convolutional Network
http://arxiv.org/abs/2004.08915
AUTHORS: Ling Lo ; Hong-Xia Xie ; Hong-Han Shuai ; Wen-Huang Cheng
COMMENTS: Accepted by IEEE MIPR 2020
HIGHLIGHT: Inspired by the nodes relationship building Graph Convolutional Networks (GCN), we propose an end-to-end AU-oriented graph classification network, namely MER-GCN, which uses 3D ConvNets to extract AU features and applies GCN layers to discover the dependency laying between AU nodes for ME categorization.
44, TITLE: MuBiNN: Multi-Level Binarized Recurrent Neural Network for EEG signal Classification
http://arxiv.org/abs/2004.08914
AUTHORS: Seyed Ahmad Mirsalari ; Sima Sinaei ; Mostafa E. Salehi ; Masoud Daneshtalab
COMMENTS: To appear in IEEE International Symposium on Circuits & Systems in 2020. arXiv admin note: text overlap with arXiv:1807.04093 by other authors
HIGHLIGHT: In this paper, we propose a multi-level binarized LSTM, which significantly reduces computations whereas ensuring an accuracy pretty close to the full precision LSTM.
45, TITLE: A practical approach to testing random number generators in computer algebra systems
http://arxiv.org/abs/2004.08913
AUTHORS: Migran N. Gevorkyan ; Dmitry S. Kulyabov ; Anastasia V. Demidova ; Anna V. Korolkova
COMMENTS: in English, in Russian
HIGHLIGHT: Up to now, a large number of libraries and weakly supported mathematical packages use outdated algorithms for random number generation.
46, TITLE: Quantum algorithms for computational geometry problems
http://arxiv.org/abs/2004.08949
AUTHORS: Andris Ambainis ; Nikita Larka
COMMENTS: 10 pages
HIGHLIGHT: We study quantum algorithms for problems in computational geometry, such as POINT-ON-3-LINES problem.
47, TITLE: Desmoking laparoscopy surgery images using an image-to-image translation guided by an embedded dark channel
http://arxiv.org/abs/2004.08947
AUTHORS: Sebastián Salazar-Colores ; Hugo Alberto-Moreno ; César Javier Ortiz-Echeverri ; Gerardo Flores
HIGHLIGHT: In this paper, a novel computational approach to remove the smoke effects is introduced.
48, TITLE: ResNeSt: Split-Attention Networks
http://arxiv.org/abs/2004.08955
AUTHORS: Hang Zhang ; Chongruo Wu ; Zhongyue Zhang ; Yi Zhu ; Zhi Zhang ; Haibin Lin ; Yue Sun ; Tong He ; Jonas Mueller ; R. Manmatha ; Mu Li ; Alexander Smola
HIGHLIGHT: We present a simple and modular Split-Attention block that enables attention across feature-map groups.
49, TITLE: Exploring Racial Bias within Face Recognition via per-subject Adversarially-Enabled Data Augmentation
http://arxiv.org/abs/2004.08945
AUTHORS: Seyma Yucer ; Samet Akçay ; Noura Al-Moubayed ; Toby P. Breckon
COMMENTS: CVPR 2020 - Fair, Data Efficient and Trusted Computer Vision Workshop
HIGHLIGHT: In this study, we propose a novel adversarial derived data augmentation methodology that aims to enable dataset balance at a per-subject level via the use of image-to-image transformation for the transfer of sensitive racial characteristic facial features.
50, TITLE: Unsupervised Vehicle Counting via Multiple Camera Domain Adaptation
http://arxiv.org/abs/2004.09251
AUTHORS: Luca Ciampi ; Carlos Santiago ; Joao Paulo Costeira ; Claudio Gennaro ; Giuseppe Amato
COMMENTS: Submitted to the ECAI 2020: "1st International Workshop on New Foundations for Human-Centered AI"
HIGHLIGHT: We propose and discuss a new methodology to design image-based vehicle density estimators with few labeled data via multiple camera domain adaptations.
51, TITLE: A Revised Generative Evaluation of Visual Dialogue
http://arxiv.org/abs/2004.09272
AUTHORS: Daniela Massiceti ; Viveka Kulharia ; Puneet K. Dokania ; N. Siddharth ; Philip H. S. Torr
COMMENTS: 16 pages, 5 figures
HIGHLIGHT: We propose a revised evaluation scheme for the VisDial dataset leveraging metrics from the NLP literature to measure consensus between answers generated by the model and a set of relevant answers. We construct these relevant answer sets using a simple and effective semi-supervised method based on correlation, which allows us to automatically extend and scale sparse relevance annotations from humans to the entire dataset. We release these sets and code for the revised evaluation scheme as DenseVisDial, and intend them to be an improvement to the dataset in the face of its existing constraints and design choices.
52, TITLE: An Efficient Method for Computing Liouvillian First Integrals of Planar Polynomial Vector Fields
http://arxiv.org/abs/2004.09298
AUTHORS: L. G. S. Duarte ; L. A. C. P. da Mota
HIGHLIGHT: Here we present an efficient method to compute Darboux polynomials for polynomial vector fields in the plane.
53, TITLE: MPNet: Masked and Permuted Pre-training for Language Understanding
http://arxiv.org/abs/2004.09297
AUTHORS: Kaitao Song ; Xu Tan ; Tao Qin ; Jianfeng Lu ; Tie-Yan Liu
HIGHLIGHT: In this paper, we propose MPNet, a novel pre-training method that inherits the advantages of BERT and XLNet and avoids their limitations.
54, TITLE: A Study of Cross-Lingual Ability and Language-specific Information in Multilingual BERT
http://arxiv.org/abs/2004.09205
AUTHORS: Chi-Liang Liu ; Tsung-Yuan Hsu ; Yung-Sung Chuang ; Hung-Yi Lee
HIGHLIGHT: In this work, we provide an in-depth experimental study to supplement the existing literature of cross-lingual ability.
55, TITLE: End-to-End Learning for Video Frame Compression with Self-Attention
http://arxiv.org/abs/2004.09226
AUTHORS: Nannan Zou ; Honglei Zhang ; Francesco Cricri ; Hamed R. Tavakoli ; Jani Lainema ; Emre Aksu ; Miska Hannuksela ; Esa Rahtu
HIGHLIGHT: In this paper, we propose an end-to-end learned system for compressing video frames.
56, TITLE: Unsupervised Person Re-identification via Multi-label Classification
http://arxiv.org/abs/2004.09228
AUTHORS: Dongkai Wang ; Shiliang Zhang
COMMENTS: CVPR2020
HIGHLIGHT: This paper formulates unsupervised person ReID as a multi-label classification task to progressively seek true labels.
57, TITLE: Towards Understanding Normalization in Neural ODEs
http://arxiv.org/abs/2004.09222
AUTHORS: Julia Gusak ; Larisa Markeeva ; Talgat Daulbaev ; Alexandr Katrutsa ; Andrzej Cichocki ; Ivan Oseledets
HIGHLIGHT: This paper investigates how different normalization techniques affect the performance of neural ODEs.
58, TITLE: 4D Deep Learning for Multiple Sclerosis Lesion Activity Segmentation
http://arxiv.org/abs/2004.09216
AUTHORS: Nils Gessert ; Marcel Bengs ; Julia Krüger ; Roland Opfer ; Ann-Christin Ostwaldt ; Praveena Manogaran ; Sven Schippling ; Alexander Schlaefer
COMMENTS: Accepted at MIDL 2020
HIGHLIGHT: In this work, we investigate whether extending this problem to full 4D deep learning using a history of MRI volumes and thus an extended baseline can improve performance.
59, TITLE: CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition
http://arxiv.org/abs/2004.09215
AUTHORS: Zhengwei Wang ; Qi She ; Tejo Chalasani ; Aljosa Smolic
COMMENTS: CVPR 2020 Workshop at Continual Learning (CLVISION)
HIGHLIGHT: In this work, we demonstrate a lifelong 3D convolutional framework -- c(C)la(a)ss increment(t)al net(Net)work (CatNet), which considers temporal information in videos and enables lifelong learning for egocentric gesture video recognition by learning the feature representation of an exemplar set selected from previous class samples.
60, TITLE: Learning Geometric Word Meta-Embeddings
http://arxiv.org/abs/2004.09219
AUTHORS: Pratik Jawanpuria ; N T V Satya Dev ; Anoop Kunchukuttan ; Bamdev Mishra
HIGHLIGHT: We propose a geometric framework for learning meta-embeddings of words from different embedding sources.
61, TITLE: A Practical Guide to Studying Emergent Communication through Grounded Language Games
http://arxiv.org/abs/2004.09218
AUTHORS: Jens Nevens ; Paul Van Eecke ; Katrien Beuls
COMMENTS: This paper was officially published at the 'Language Learning for Artificial Agents (L2A2) Symposium' of the 2019 Artificial Intelligence and Simulation of Behaviour (AISB) Convention
HIGHLIGHT: The aim of this paper is twofold.
62, TITLE: CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
http://arxiv.org/abs/2004.09249
AUTHORS: Shinji Watanabe ; Michael Mandel ; Jon Barker ; Emmanuel Vincent
HIGHLIGHT: This paper provides a baseline description of the CHiME-6 challenge for both segmented multispeaker speech recognition (Track 1) and unsegmented multispeaker speech recognition (Track 2).
63, TITLE: The complexity of approximating averages on bounded-degree graphs
http://arxiv.org/abs/2004.09238
AUTHORS: Andreas Galanis ; Daniel Stefankovic ; Eric Vigoda
HIGHLIGHT: Our work extends to the antiferromagnetic Ising model and generalizes to all 2-spin antiferromagnetic models, establishing hardness of computing the average magnetization in the tree non-uniqueness region.
64, TITLE: Uncertainty-Aware Consistency Regularization for Cross-Domain Semantic Segmentation
http://arxiv.org/abs/2004.08878
AUTHORS: Qianyu Zhou ; Zhengyang Feng ; Guangliang Cheng ; Xin Tan ; Jianping Shi ; Lizhuang Ma
HIGHLIGHT: In this paper, we propose an uncertainty-aware consistency regularization method to tackle this issue for semantic segmentation.
65, TITLE: Make E Smart Again
http://arxiv.org/abs/2004.08858
AUTHORS: Zarathustra Amadeus Goertzel
COMMENTS: 8 pages, 2 figures, IJCAR2020
HIGHLIGHT: In this work in progress, we demonstrate a new use-case for the ENIGMA system.
66, TITLE: Role-Wise Data Augmentation for Knowledge Distillation
http://arxiv.org/abs/2004.08861
AUTHORS: Jie Fu ; Xue Geng ; Zhijian Duan ; Bohan Zhuang ; Xingdi Yuan ; Adam Trischler ; Jie Lin ; Chris Pal ; Hao Dong
HIGHLIGHT: We compare our approach with existing KD methods on training popular neural architectures and demonstrate that role-wise data augmentation improves the effectiveness of KD over strong prior approaches.
67, TITLE: Space Debris Ontology for ADR Capture Methods Selection
http://arxiv.org/abs/2004.08866
AUTHORS: Marko Jankovic ; Mehmed Yüksel ; Mohammad Mohammadzadeh Babr ; Francesca Letizia ; Vitali Braun
COMMENTS: 32 pages, 7 figures and 6 tables
HIGHLIGHT: To bridge this gap we present a domain-ontology of intact derelict objects, i.e. payloads and rocket bodies, for ADR capture methods selection.
68, TITLE: A Biologically Interpretable Two-stage Deep Neural Network (BIT-DNN) For Hyperspectral Imagery Classification
http://arxiv.org/abs/2004.08886
AUTHORS: Yue Shi ; Liangxiu Han ; Wenjiang Huang ; Sheng Chang ; Yingying Dong ; Darren Dancey ; Lianghao Han
COMMENTS: 13 pages, 11 figures
HIGHLIGHT: This study proposes an interpretable deep learning model -- a biologically interpretable two-stage deep neural network (BIT-DNN), by integrating biochemical and biophysical associated information into the proposed framework, capable of achieving both high accuracy and interpretability on HSI based classification tasks.
69, TITLE: A Weighted Population Update Rule for PACO Applied to the Single Machine Total Weighted Tardiness Problem
http://arxiv.org/abs/2004.08433
AUTHORS: Daniel Abitz ; Tom Hartmann ; Martin Middendorf
HIGHLIGHT: In this paper a new population update rule for population based ant colony optimization (PACO) is proposed.
70, TITLE: Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks
http://arxiv.org/abs/2004.08423
AUTHORS: Xin Chen ; Lingxi Xie ; Jun Wu ; Longhui Wei ; Yuhui Xu ; Qi Tian
COMMENTS: 17 pages, 2 figures and 3 tables
HIGHLIGHT: To accelerate it, researchers proposed weight-sharing methods which first train a super-network to reuse computation among different operators, from which exponentially many sub-networks can be sampled and efficiently evaluated.
71, TITLE: Organ at Risk Segmentation for Head and Neck Cancer using Stratified Learning and Neural Architecture Search
http://arxiv.org/abs/2004.08426
AUTHORS: Dazhou Guo ; Dakai Jin ; Zhuotun Zhu ; Tsung-Ying Ho ; Adam P. Harrison ; Chun-Hung Chao ; Jing Xiao ; Alan Yuille ; Chien-Yu Lin ; Le Lu
HIGHLIGHT: This is the goal of our work, where we introduce stratified organ at risk segmentation (SOARS), an approach that stratifies OARs into anchor, mid-level, and small & hard (S&H) categories.
72, TITLE: A Chinese Corpus for Fine-grained Entity Typing
http://arxiv.org/abs/2004.08825
AUTHORS: Chin Lee ; Hongliang Dai ; Yangqiu Song ; Xin Li
COMMENTS: LREC 2020
HIGHLIGHT: In this paper, we introduce a corpus for Chinese fine-grained entity typing that contains 4,800 mentions manually labeled through crowdsourcing.
73, TITLE: Improving Robot Dual-System Motor Learning with Intrinsically Motivated Meta-Control and Latent-Space Experience Imagination
http://arxiv.org/abs/2004.08830
AUTHORS: Muhammad Burhan Hafez ; Cornelius Weber ; Matthias Kerzel ; Stefan Wermter
HIGHLIGHT: In this paper, we present a novel dual-system motor learning approach where a meta-controller arbitrates online between model-based and model-free decisions based on an estimate of the local reliability of the learned model.
74, TITLE: Dynamic Knowledge Graph-based Dialogue Generation with Improved Adversarial Meta-Learning
http://arxiv.org/abs/2004.08833
AUTHORS: Hongcai Xu ; Junpeng Bao ; Gaojie Zhang
COMMENTS: 10 pages, 3 figures
HIGHLIGHT: This paper proposes a dynamic Knowledge graph-based dialogue generation method with improved adversarial Meta-Learning (KDAD).
75, TITLE: Graph-Structured Referring Expression Reasoning in The Wild
http://arxiv.org/abs/2004.08814
AUTHORS: Sibei Yang ; Guanbin Li ; Yizhou Yu
COMMENTS: CVPR 2020 Accepted Oral Paper. Data and code are available at https://github.com/sibeiyang/sgmn
HIGHLIGHT: In this paper, we propose a scene graph guided modular network (SGMN), which performs reasoning over a semantic graph and a scene graph with neural modules under the guidance of the linguistic structure of the expression.
76, TITLE: Human Activity Recognition using Inertial, Physiological and Environmental Sensors: a Comprehensive Survey
http://arxiv.org/abs/2004.08821
AUTHORS: Florenc Demrozi ; Graziano Pravadelli ; Azra Bihorac ; Parisa Rashidi
COMMENTS: 31 Pages, 11 Figures, 6 Tabels Paper under review process
HIGHLIGHT: This survey focuses on critical applications of Machine Learning (ML) in the fields of HAR, largely oriented to Daily Life Activities by presenting an overview of the publications on HAR based on ML and inertial, physiological and environmental sensors.
77, TITLE: Cosmetic-Aware Makeup Cleanser
http://arxiv.org/abs/2004.09147
AUTHORS: Yi Li ; Huaibo Huang ; Junchi Yu ; Ran He ; Tieniu Tan
COMMENTS: Accepted by BTAS 2019 (the 10th IEEE International Conference on Biometrics: Theory, Applications and Systems)
HIGHLIGHT: With the rapid development of deep generative models, this paper proposes a semanticaware makeup cleanser (SAMC) to remove facial makeup under different poses and expressions and achieve verification via generation.
78, TITLE: Spatial Action Maps for Mobile Manipulation
http://arxiv.org/abs/2004.09141
AUTHORS: Jimmy Wu ; Xingyuan Sun ; Andy Zeng ; Shuran Song ; Johnny Lee ; Szymon Rusinkiewicz ; Thomas Funkhouser
COMMENTS: Project webpage: https://spatial-action-maps.cs.princeton.edu
HIGHLIGHT: This paper proposes a new action representation for learning to perform complex mobile manipulation tasks.
79, TITLE: Recurrent Convolutional Neural Networks help to predict location of Earthquakes
http://arxiv.org/abs/2004.09140
AUTHORS: Roman Kail ; Alexey Zaytsev ; Evgeny Burnaev
HIGHLIGHT: Our data-based classification model aims to predict if an earthquake with the magnitude above a threshold takes place at a given area of size $10 \times 10$ kilometers in $30$-$180$ days from a given moment.
80, TITLE: Transformer Reasoning Network for Image-Text Matching and Retrieval
http://arxiv.org/abs/2004.09144
AUTHORS: Nicola Messina ; Fabrizio Falchi ; Andrea Esuli ; Giuseppe Amato
COMMENTS: Submitted to ICPR 2020
HIGHLIGHT: In this work, we consider the problem of accurate image-text matching for the task of multi-modal large-scale information retrieval.
81, TITLE: Variational Inference for Learning Representations of Natural Language Edits
http://arxiv.org/abs/2004.09143
AUTHORS: Edison Marrese-Taylor ; Machel Reid ; Yutaka Matsuo
HIGHLIGHT: With this in mind, we propose a novel approach that employs variational inference to learn a continuous latent space of vector representations to capture the underlying semantic information with regard to the document editing process.
82, TITLE: Joint Distribution and Transitions of Pain and Activity in Critically Ill Patients
http://arxiv.org/abs/2004.09134
AUTHORS: Florenc Demrozi ; Graziano Pravadelli ; Patrick J Tighe ; Azra Bihorac ; Parisa Rashidi
COMMENTS: Accepted for Publication in EMBC 2020
HIGHLIGHT: In this study, we collected activity intensity data from 57 ICU patients, using the Actigraph GT3X device.
83, TITLE: Pose Manipulation with Identity Preservation
http://arxiv.org/abs/2004.09169
AUTHORS: Andrei-Timotei Ardelean ; Lucian Mircea Sasu
COMMENTS: 9 pages, journal article
HIGHLIGHT: This paper describes a new model which generates images in novel poses e.g. by altering face expression and orientation, from just a few instances of a human subject.
84, TITLE: VOC-ReID: Vehicle Re-identification based on Vehicle-Orientation-Camera
http://arxiv.org/abs/2004.09164
AUTHORS: Xiangyu Zhu ; Zhenbo Luo ; Pei Fu ; Xiang Ji
HIGHLIGHT: In this work, we focus on the failure cases caused by similar background and shape.
85, TITLE: CheXbert: Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT
http://arxiv.org/abs/2004.09167
AUTHORS: Akshay Smit ; Saahil Jain ; Pranav Rajpurkar ; Anuj Pareek ; Andrew Y. Ng ; Matthew P. Lungren
HIGHLIGHT: In this work, we investigate BERT-based approaches to medical image report labeling that exploit both the scale of available rule-based systems and the quality of expert annotations.
86, TITLE: Invariant Integration in Deep Convolutional Feature Space
http://arxiv.org/abs/2004.09166
AUTHORS: Matthias Rath ; Alexandru Paul Condurache
COMMENTS: Accepted at ESANN 2020 (European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning)
HIGHLIGHT: In this contribution, we show how to incorporate prior knowledge to a deep neural network architecture in a principled manner.
87, TITLE: On the Encoder-Decoder Incompatibility in Variational Text Modeling and Beyond
http://arxiv.org/abs/2004.09189
AUTHORS: Chen Wu ; Prince Zizhuang Wang ; William Yang Wang
COMMENTS: Accepted to ACL 2020
HIGHLIGHT: To this end, we propose Coupled-VAE, which couples a VAE model with a deterministic autoencoder with the same structure and improves the encoder and decoder parameterizations via encoder weight sharing and decoder signal matching.
88, TITLE: Evolving Diverse Sets of Tours for the Travelling Salesperson Problem
http://arxiv.org/abs/2004.09188
AUTHORS: Anh Viet Do ; Jakob Bossek ; Aneta Neumann ; Frank Neumann
COMMENTS: 11 pages, 3 tables, 3 figures, to be published in GECCO '20
HIGHLIGHT: With this paper, we contribute to this area of research by examining evolutionary diversity optimisation approaches for the classical Traveling Salesperson Problem (TSP).
89, TITLE: GraN: An Efficient Gradient-Norm Based Detector for Adversarial and Misclassified Examples
http://arxiv.org/abs/2004.09179
AUTHORS: Julia Lust ; Alexandru Paul Condurache
COMMENTS: Accepted at ESANN 2020 (European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning)
HIGHLIGHT: This paper therefore proposes GraN, a time- and parameter-efficient method that is easily adaptable to any DNN.
90, TITLE: Landmark Detection and 3D Face Reconstruction for Caricature using a Nonlinear Parametric Model
http://arxiv.org/abs/2004.09190
AUTHORS: Juyong Zhang ; Hongrui Cai ; Yudong Guo ; Zhuang Peng
COMMENTS: The code is available at https://github.com/Juyong/CaricatureFace
HIGHLIGHT: Based on the constructed dataset and the nonlinear parametric model, we propose a neural network based method to regress the 3D face shape and orientation from the input 2D caricature image. To this end, we first build a dataset with various styles of 2D caricatures and their corresponding 3D shapes, and then build a parametric model on vertex based deformation space for 3D caricature face.
91, TITLE: LSM: Learning Subspace Minimization for Low-level Vision
http://arxiv.org/abs/2004.09197
AUTHORS: Chengzhou Tang ; Lu Yuan ; Ping Tan
COMMENTS: To be presented at CVPR2020
HIGHLIGHT: We study the energy minimization problem in low-level vision tasks from a novel perspective.
92, TITLE: Generative Feature Replay For Class-Incremental Learning
http://arxiv.org/abs/2004.09199
AUTHORS: Xialei Liu ; Chenshen Wu ; Mikel Menta ; Luis Herranz ; Bogdan Raducanu ; Andrew D. Bagdanov ; Shangling Jui ; Joost van de Weijer
COMMENTS: Accepted at CVPR2020: Workshop on Continual Learning in Computer Vision
HIGHLIGHT: We propose a solution to the imbalance problem based on generative feature replay which does not require any exemplars.
93, TITLE: CausalVAE: Structured Causal Disentanglement in Variational Autoencoder
http://arxiv.org/abs/2004.08697
AUTHORS: Mengyue Yang ; Furui Liu ; Zhitang Chen ; Xinwei Shen ; Jianye Hao ; Jun Wang
HIGHLIGHT: We thus propose a new VAE based framework named CausalVAE, which includes causal layers to transform independent factors into causal factors that correspond to causally related concepts in data.
94, TITLE: A fast semi-automatic method for classification and counting the number and types of blood cells in an image
http://arxiv.org/abs/2004.08690
AUTHORS: Hamed Sadeghi ; Shahram Shirani ; David W. Capson
HIGHLIGHT: A novel and fast semi-automatic method for segmentation, locating and counting blood cells in an image is proposed.
95, TITLE: Syn-QG: Syntactic and Shallow Semantic Rules for Question Generation
http://arxiv.org/abs/2004.08694
AUTHORS: Kaustubh D. Dhole ; Christopher D. Manning
COMMENTS: In Proceedings of the 2020 Annual Conference of the Association for Computational Linguistics (ACL 2020)
HIGHLIGHT: We implement this observation by developing Syn-QG, a set of transparent syntactic rules leveraging universal dependencies, shallow semantic parsing, lexical resources, and custom rules which transform declarative sentences into question-answer pairs.
96, TITLE: Attention, please: A Spatio-temporal Transformer for 3D Human Motion Prediction
http://arxiv.org/abs/2004.08692
AUTHORS: Emre Aksan ; Peng Cao ; Manuel Kaufmann ; Otmar Hilliges
HIGHLIGHT: In this paper, we propose a novel architecture for the task of 3D human motion modelling.
97, TITLE: A Large Dataset of Historical Japanese Documents with Complex Layouts
http://arxiv.org/abs/2004.08686
AUTHORS: Zejiang Shen ; Kaixuan Zhang ; Melissa Dell
COMMENTS: 8 pages, 8 figures, accepted at CVPR2020 Workshop on Text and Documents in the Deep Learning Era
HIGHLIGHT: To this end, we present HJDataset, a Large Dataset of Historical Japanese Documents with Complex Layouts.
98, TITLE: Compositionality and Generalization in Emergent Languages
http://arxiv.org/abs/2004.09124
AUTHORS: Rahma Chaabouni ; Eugene Kharitonov ; Diane Bouchacourt ; Emmanuel Dupoux ; Marco Baroni
HIGHLIGHT: In this paper, we study whether the language emerging in deep multi-agent simulations possesses a similar ability to refer to novel primitive combinations, and whether it accomplishes this feat by strategies akin to human-language compositionality.
99, TITLE: SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings
http://arxiv.org/abs/2004.08728
AUTHORS: Masoud Jalili Sabet ; Philipp Dufter ; Hinrich Schütze
HIGHLIGHT: We propose word alignment methods that require no parallel data.
100, TITLE: Pro-Russian Biases in Anti-Chinese Tweets about the Novel Coronavirus
http://arxiv.org/abs/2004.08726
AUTHORS: Autumn Toney ; Akshat Pandey ; Wei Guo ; David Broniatowski ; Aylin Caliskan
COMMENTS: 6 pages, 2 tables
HIGHLIGHT: We introduce extensions of the Word Embedding Association Test from Caliskan et.
101, TITLE: Enhancing Pharmacovigilance with Drug Reviews and Social Media
http://arxiv.org/abs/2004.08731
AUTHORS: Brent Biseda ; Katie Mo
HIGHLIGHT: This paper explores whether the use of drug reviews and social media could be leveraged as potential alternative sources for pharmacovigilance of adverse drug reactions (ADRs).
102, TITLE: Fewer colors for perfect simulation of proper colorings
http://arxiv.org/abs/2004.08716
AUTHORS: Mark Huber
COMMENTS: 13 pages
HIGHLIGHT: Here a new randomized algorithm is presented based upon the randomness recycler protocol introduced by the author and Fill at FOCS 2000.
103, TITLE: Device Authentication Codes based on RF Fingerprinting using Deep Learning
http://arxiv.org/abs/2004.08742
AUTHORS: Joshua Bassey ; Xiangfang Li ; Lijun Qian
HIGHLIGHT: In this paper, we propose Device Authentication Code (DAC), a novel method for authenticating IoT devices with wireless interface by exploiting their radio frequency (RF) signatures.
104, TITLE: An end-to-end CNN framework for polarimetric vision tasks based on polarization-parameter-constructing network
http://arxiv.org/abs/2004.08740
AUTHORS: Yong Wang ; Qi Liu ; Hongyu Zu ; Xiao Liu ; Ruichao Xie ; Feng Wang
HIGHLIGHT: In this paper, a novel end-to-end CNN framework for polarization vision tasks is proposed, which enables the networks to take full advantage of polarimetric images.
105, TITLE: Tensor completion using enhanced multiple modes low-rank prior and total variation
http://arxiv.org/abs/2004.08747
AUTHORS: Haijin Zeng ; Xiaozhen Xie ; Jifeng Ning
HIGHLIGHT: In this paper, we propose a novel model to recover a low-rank tensor by simultaneously performing double nuclear norm regularized low-rank matrix factorizations to the all-mode matricizations of the underlying tensor.
106, TITLE: Learning to Evaluate Perception Models Using Planner-Centric Metrics
http://arxiv.org/abs/2004.08745
AUTHORS: Jonah Philion ; Amlan Kar ; Sanja Fidler
COMMENTS: CVPR 2020 poster
HIGHLIGHT: In this paper, we propose a principled metric for 3D object detection specifically for the task of self-driving.
107, TITLE: Are we pretraining it right? Digging deeper into visio-linguistic pretraining
http://arxiv.org/abs/2004.08744
AUTHORS: Amanpreet Singh ; Vedanuj Goswami ; Devi Parikh
COMMENTS: 23 pages, 6 figures. First two authors contributed equally. More info at https://github.com/facebookresearch/pythia
HIGHLIGHT: In this work, we question some of the default choices made in literature.
108, TITLE: Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization
http://arxiv.org/abs/2004.08763
AUTHORS: Homanga Bharadhwaj ; Kevin Xie ; Florian Shkurti
COMMENTS: L4DC 2020; Accepted for presentation in the 2nd Annual Conference on Learning for Dynamics and Control
HIGHLIGHT: We propose a method to solve this planning problem by interleaving CEM and gradient descent steps in optimizing the action sequence.
109, TITLE: TriGAN: Image-to-Image Translation for Multi-Source Domain Adaptation
http://arxiv.org/abs/2004.08769
AUTHORS: Subhankar Roy ; Aliaksandr Siarohin ; Enver Sangineto ; Nicu Sebe ; Elisa Ricci
HIGHLIGHT: In this paper we propose the first approach for Multi-Source Domain Adaptation (MSDA) based on Generative Adversarial Networks.
110, TITLE: Lightweight Mask R-CNN for Long-Range Wireless Power Transfer Systems
http://arxiv.org/abs/2004.08761
AUTHORS: Hao Li ; Aozhou Wu ; Wen Fang ; Qingqing Zhang ; Mingqing Liu ; Qingwen Liu ; Wei Chen
HIGHLIGHT: Thus, we propose a machine learning detection approach which provides a lighter and faster model based on traditional Mask R-CNN.
111, TITLE: Knowledge-graph based Proactive Dialogue Generation with Improved Meta-Learning
http://arxiv.org/abs/2004.08798
AUTHORS: Hongcai Xu ; Junpeng Bao ; Junqing Wang
COMMENTS: 15 pages,7 figures
HIGHLIGHT: To overcome this drawback, we propose a knowledge graph based proactive dialogue generation model (KgDg) with three components, improved model-agnostic meta-learning algorithm (MAML), knowledge selection in knowledge triplets embedding, and knowledge aware proactive response generator.
112, TITLE: When Residual Learning Meets Dense Aggregation: Rethinking the Aggregation of Deep Neural Networks
http://arxiv.org/abs/2004.08796
AUTHORS: Zhiyu Zhu ; Zhen-Peng Bian ; Junhui Hou ; Yi Wang ; Lap-Pui Chau
HIGHLIGHT: To handle these challenging issues, we propose Micro-Dense Nets, a novel architecture with global residual learning and local micro-dense aggregations.
113, TITLE: Extractive Summarization as Text Matching
http://arxiv.org/abs/2004.08795
AUTHORS: Ming Zhong ; Pengfei Liu ; Yiran Chen ; Danqing Wang ; Xipeng Qiu ; Xuanjing Huang
COMMENTS: Accepted by ACL 2020
HIGHLIGHT: This paper creates a paradigm shift with regard to the way we build neural extractive summarization systems.
114, TITLE: On the Unusual Effectiveness of Type-aware Mutations for Testing SMT Solvers
http://arxiv.org/abs/2004.08799
AUTHORS: Dominik Winterer ; Chengyu Zhang ; Zhendong Su
HIGHLIGHT: We propose type-aware mutation testing, a simple and effective approach for testing SMT solvers.The key idea is to mutate operators of same type in SMT formulas to generate type-correct mutant formulas.
115, TITLE: UNet 3+: A Full-Scale Connected UNet for Medical Image Segmentation
http://arxiv.org/abs/2004.08790
AUTHORS: Huimin Huang ; Lanfen Lin ; Ruofeng Tong ; Hongjie Hu ; Qiaowei Zhang ; Yutaro Iwamoto ; Xianhua Han ; Yen-Wei Chen ; Jian Wu
HIGHLIGHT: In this paper, we propose a novel UNet 3+, which takes advantage of full-scale skip connections and deep supervisions.
116, TITLE: Pattern Learning for Detecting Defect Reports and Improvement Requests in App Reviews
http://arxiv.org/abs/2004.08793
AUTHORS: Gino V. H. Mangnoesing ; Maria Mihaela Trusca ; Flavius Frasincar
COMMENTS: Accepted for publication in the 25th International Conference on Natural Language & Information Systems (NLDB 2020), DFKI Saarbr\"ucken Germany, June 24-26 2020
HIGHLIGHT: In this study, we follow novel approaches that target this absence of actionable insights by classifying reviews as defect reports and requests for improvement.
117, TITLE: AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-identification
http://arxiv.org/abs/2004.08787
AUTHORS: Yunpeng Zhai ; Shijian Lu ; Qixiang Ye ; Xuebo Shan ; Jie Chen ; Rongrong Ji ; Yonghong Tian
HIGHLIGHT: This paper presents a novel augmented discriminative clustering (AD-Cluster) technique that estimates and augments person clusters in target domains and enforces the discrimination ability of re-ID models with the augmented clusters.
118, TITLE: BanFakeNews: A Dataset for Detecting Fake News in Bangla
http://arxiv.org/abs/2004.08789
AUTHORS: Md Zobaer Hossain ; Md Ashraful Rahman ; Md Saiful Islam ; Sudipta Kar
COMMENTS: LREC 2020
HIGHLIGHT: In this work, we propose an annotated dataset of ~50K news that can be used for building automated fake news detection systems for a low resource language like Bangla.
119, TITLE: Decision Problems in Information Theory
http://arxiv.org/abs/2004.08783
AUTHORS: Mahmoud Abo Khamis ; Phokion G. Kolaitis ; Hung Q. Ngo ; Dan Suciu
HIGHLIGHT: Here, we initiate an investigation of decision problems about constraints on entropies by placing several different such problems into levels of the arithmetical hierarchy.
120, TITLE: Deep Learning Improves Contrast in Low-Fluence Photoacoustic Imaging
http://arxiv.org/abs/2004.08782
AUTHORS: Ali Hariri ; Kamran Alipour ; Yash Mantri ; Jurgen P. Schulze ; Jesse V. Jokerst
COMMENTS: submitted to Biomedical Optics Express journal
HIGHLIGHT: Here, we propose a denoising method using a multi-level wavelet-convolutional neural network to map low fluence illumination source images to its corresponding high fluence excitation map.
121, TITLE: Efficient Synthesis of Compact Deep Neural Networks
http://arxiv.org/abs/2004.08704
AUTHORS: Wenhan Xia ; Hongxu Yin ; Niraj K. Jha
HIGHLIGHT: In this paper, we review major approaches for automatically synthesizing compact, yet accurate, DNN/LSTM models suitable for real-world applications.
122, TITLE: Adaptive Attention Span in Computer Vision
http://arxiv.org/abs/2004.08708
AUTHORS: Jerrod Parker ; Shakti Kumar ; Joe Roussy
COMMENTS: 6 pages with 1 Algorithm, 4 figures, 1 Table and 1 page appendix
HIGHLIGHT: In this work we propose a novel method for learning the local self-attention kernel size.
123, TITLE: Incorporating External Knowledge through Pre-training for Natural Language to Code Generation
http://arxiv.org/abs/2004.09015
AUTHORS: Frank F. Xu ; Zhengbao Jiang ; Pengcheng Yin ; Bogdan Vasilescu ; Graham Neubig
COMMENTS: Accepted by ACL 2020
HIGHLIGHT: Motivated by the intuition that developers usually retrieve resources on the web when writing code, we explore the effectiveness of incorporating two varieties of external knowledge into NL-to-code generation: automatically mined NL-code pairs from the online programming QA forum StackOverflow and programming language API documentation.
124, TITLE: DeepSDF x Sim(3): Extending DeepSDF for automatic 3D shape retrieval and similarity transform estimation
http://arxiv.org/abs/2004.09048
AUTHORS: Oladapo Afolabi ; Allen Yang ; Shankar S. Sastry
COMMENTS: 10 pages
HIGHLIGHT: In this work, we present a formulation that overcomes this issue by jointly estimating the shape and similarity transformation parameters.
125, TITLE: Learning as Reinforcement: Applying Principles of Neuroscience for More General Reinforcement Learning Agents
http://arxiv.org/abs/2004.09043
AUTHORS: Eric Zelikman ; William Yin ; Kenneth Wang
COMMENTS: Originally completed as part of Stanford's CS 234 "Reinforcement Learning." Presented at the California Cognitive Science Conference 2019
HIGHLIGHT: A significant challenge in developing AI that can generalize well is designing agents that learn about their world without being told what to learn, and apply that learning to challenges with sparse rewards.
126, TITLE: Taming the Expressiveness and Programmability of Graph Analytical Queries
http://arxiv.org/abs/2004.09045
AUTHORS: Lu Qin ; Longbin Lai ; Kongzhang Hao ; Zhongxin Zhou ; Yiwei Zhao ; Yuxing Han ; Xuemin Lin ; Zhengping Qian ; Jingren Zhou
COMMENTS: 22 pages
HIGHLIGHT: We focus on analytical queries in this paper.
127, TITLE: Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense
http://arxiv.org/abs/2004.09044
AUTHORS: Yixin Zhu ; Tao Gao ; Lifeng Fan ; Siyuan Huang ; Mark Edmonds ; Hangxin Liu ; Feng Gao ; Chi Zhang ; Siyuan Qi ; Ying Nian Wu ; Joshua B. Tenenbaum ; Song-Chun Zhu
COMMENTS: For high quality figures, please refer to http://wellyzhang.github.io/attach/dark.pdf
HIGHLIGHT: In this paper, we call for a shift that flips this paradigm upside down. Specifically, we propose a "small data for big tasks" paradigm, wherein a single artificial intelligence (AI) system is challenged to develop "common sense", enabling it to solve a wide range of tasks with little training data.
128, TITLE: X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions
http://arxiv.org/abs/2004.09039
AUTHORS: Michael Danielczuk ; Anelia Angelova ; Vincent Vanhoucke ; Ken Goldberg
COMMENTS: 8 pages, 6 figures
HIGHLIGHT: For mechanical search, we introduce X-Ray, an algorithm based on learned occupancy distributions.
129, TITLE: Gated Convolutional Bidirectional Attention-based Model for Off-topic Spoken Response Detection
http://arxiv.org/abs/2004.09036
AUTHORS: Yefei Zha ; Ruobing Li ; Hui Lin
COMMENTS: 10 pages, 4 figures and 5 tables
HIGHLIGHT: In this paper, we propose a novel approach for off-topic spoken response detection with high off-topic recall on both seen and unseen prompts.
130, TITLE: Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision
http://arxiv.org/abs/2004.09034
AUTHORS: Damien Teney ; Ehsan Abbasnedjad ; Anton van den Hengel
HIGHLIGHT: We propose an auxiliary training objective that improves the generalization capabilities of neural networks by leveraging an overlooked supervisory signal found in existing datasets.
131, TITLE: OSLNet: Deep Small-Sample Classification with an Orthogonal Softmax Layer
http://arxiv.org/abs/2004.09033
AUTHORS: Xiaoxu Li ; Dongliang Chang ; Zhanyu Ma ; Zheng-Hua Tan ; Jing-Hao Xue ; Jie Cao ; Jingyi Yu ; Jun Guo
COMMENTS: TIP 2020. Code available at https://github.com/dongliangchang/OSLNet
HIGHLIGHT: To this end, this paper aims to find a subspace of neural networks that can facilitate a large decision margin.
132, TITLE: Semantic Correspondence via 2D-3D-2D Cycle
http://arxiv.org/abs/2004.09061
AUTHORS: Yang You ; Chengkun Li ; Yujing Lou ; Zhoujun Cheng ; Lizhuang Ma ; Cewu Lu ; Weiming Wang
HIGHLIGHT: In this paper, we propose a new method on predicting semantic correspondences by leveraging it to 3D domain and then project corresponding 3D models back to 2D domain, with their semantic labels.
133, TITLE: Adaptation of a Lexical Organization for Social Engineering Detection and Response Generation
http://arxiv.org/abs/2004.09050
AUTHORS: Archna Bhatia ; Adam Dalton ; Brodie Mather ; Sashank Santhanam ; Samira Shaikh ; Alan Zemel ; Tomek Strzalkowski ; Bonnie J. Dorr
COMMENTS: Accepted at STOC
HIGHLIGHT: We present a paradigm for extensible lexicon development based on Lexical Conceptual Structure to support social engineering detection and response generation.
134, TITLE: Airborne LiDAR Point Cloud Classification with Graph Attention Convolution Neural Network
http://arxiv.org/abs/2004.09057
AUTHORS: Congcong Wen ; Xiang Li ; Xiaojing Yao ; Ling Peng ; Tianhe Chi
HIGHLIGHT: In this paper, we present a graph attention convolution neural network (GACNN) that can be directly applied to the classification of unstructured 3D point clouds obtained by airborne LiDAR.
135, TITLE: Colonoscope tracking method based on shape estimation network
http://arxiv.org/abs/2004.09056
AUTHORS: Masahiro Oda ; Holger R. Roth ; Takayuki Kitasaka ; Kazuhiro Furukawa ; Ryoji Miyahara ; Yoshiki Hirooka ; Nassir Navab ; Kensaku Mori
COMMENTS: Accepted paper as an oral presentation at SPIE Medical Imaging 2019, San Diego, CA, USA
HIGHLIGHT: This paper presents a colonoscope tracking method utilizing a colon shape estimation method.
136, TITLE: Deep Exposure Fusion with Deghosting via Homography Estimation and Attention Learning
http://arxiv.org/abs/2004.09089
AUTHORS: Sheng-Yeh Chen ; Yung-Yu Chuang
COMMENTS: ICASSP 2020
HIGHLIGHT: This paper proposes a deep network for exposure fusion.
137, TITLE: CatSIM: A Categorical Image Similarity Metric
http://arxiv.org/abs/2004.09073
AUTHORS: Geoffrey Z. Thompson ; Ranjan Maitra
COMMENTS: 17 pages, 16 figures, 10 tables
HIGHLIGHT: We introduce CatSIM, a new similarity metric for binary and multinary two- and three-dimensional images and volumes.
138, TITLE: Automatic Grading of Knee Osteoarthritis on the Kellgren-Lawrence Scale from Radiographs Using Convolutional Neural Networks
http://arxiv.org/abs/2004.08572
AUTHORS: Sudeep Kondal ; Viraj Kulkarni ; Ashrika Gaikwad ; Amit Kharat ; Aniruddha Pant
COMMENTS: 5 pages, 3 figures, 5 tables
HIGHLIGHT: In this paper, we propose a novel method using convolutional neural networks to automatically grade knee radiographs on the KL scale.
139, TITLE: On the Synergies between Machine Learning and Stereo: a Survey
http://arxiv.org/abs/2004.08566
AUTHORS: Matteo Poggi ; Fabio Tosi ; Konstantinos Batsos ; Philippos Mordohai ; Stefano Mattoccia
COMMENTS: Paper version of our CVPR 2019 tutorial: "Learning-based depth estimation from stereo and monocular images: successes, limitations and future challenges" (https://sites.google.com/view/cvpr-2019-depth-from-image/home)
HIGHLIGHT: In this paper, we review recent research in the field of learning-based depth estimation from images highlighting the synergies, the successes achieved so far and the open challenges the community is going to face in the immediate future.
140, TITLE: Characters as Graphs: Recognizing Online Handwritten Chinese Characters via Spatial Graph Convolutional Network
http://arxiv.org/abs/2004.09412
AUTHORS: Ji Gan ; Weiqiang Wang ; Ke Lu
COMMENTS: 8 pages, 4 figures. A full version of this paper has been submitted to an international journal
HIGHLIGHT: Instead of viewing characters as either static images or temporal trajectories, here we propose to represent characters as geometric graphs, retaining both spatial structures and temporal orders.
141, TITLE: Shape-Oriented Convolution Neural Network for Point Cloud Analysis
http://arxiv.org/abs/2004.09411
AUTHORS: Chaoyi Zhang ; Yang Song ; Lina Yao ; Weidong Cai
COMMENTS: 8 pages, 6 figures, AAAI2020
HIGHLIGHT: Shape-Oriented Convolution Neural Network for Point Cloud Analysis
142, TITLE: VOWEL: A Local Online Learning Rule for Recurrent Networks of Probabilistic Spiking Winner-Take-All Circuits
http://arxiv.org/abs/2004.09416
AUTHORS: Hyeryung Jang ; Nicolas Skatchkovsky ; Osvaldo Simeone
COMMENTS: 14 pages, submitted for possible conference publication
HIGHLIGHT: In this paper, we develop a variational online local training rule for WTA-SNNs, referred to as VOWEL, that leverages only local pre- and post-synaptic information for visible circuits, and an additional common reward signal for hidden circuits.
143, TITLE: PHINC: A Parallel Hinglish Social Media Code-Mixed Corpus for Machine Translation
http://arxiv.org/abs/2004.09447
AUTHORS: Vivek Srivastava ; Mayank Singh
HIGHLIGHT: This paper presents a parallel corpus of the 13,738 code-mixed English-Hindi sentences and their corresponding translation in English. We are releasing the parallel corpus to facilitate future research opportunities in code-mixed machine translation.
144, TITLE: Three Modern Roles for Logic in AI
http://arxiv.org/abs/2004.08599
AUTHORS: Adnan Darwiche
COMMENTS: To be published in PODS 2020
HIGHLIGHT: We consider three modern roles for logic in artificial intelligence, which are based on the theory of tractable Boolean circuits: (1) logic as a basis for computation, (2) logic for learning from a combination of data and knowledge, and (3) logic for reasoning about the behavior of machine learning systems.
145, TITLE: A Spatially Constrained Deep Convolutional Neural Network for Nerve Fiber Segmentation in Corneal Confocal Microscopic Images using Inaccurate Annotations
http://arxiv.org/abs/2004.09443
AUTHORS: Ning Zhang ; Susan Francis ; Rayaz Malik ; Xin Chen
COMMENTS: 4 pages, accepted for publication at IEEE International Symposium on Biomedical Imaging (ISBI) 2020
HIGHLIGHT: In this paper, we propose a spatially constrained deep convolutional neural network (DCNN) to achieve smooth and robust image segmentation using inaccurately annotated labels for training.
146, TITLE: DAPnet: A double self-attention convolutional network for segmentation of point clouds
http://arxiv.org/abs/2004.08596
AUTHORS: Li Chen ; Zewei Xu ; Yongjian Fu ; Haozhe Huang ; Shaowen Wang ; Haifeng Li
COMMENTS: 13 pages, 9 figures
HIGHLIGHT: In this study, we propose a double self-attention convolutional network, called DAPnet, by combining geometric and contextual features to generate better segmentation results.
147, TITLE: Dynamic Feature Integration for Simultaneous Detection of Salient Object, Edge and Skeleton
http://arxiv.org/abs/2004.08595
AUTHORS: Jiang-Jiang Liu ; Qibin Hou ; Ming-Ming Cheng
HIGHLIGHT: In this paper, we solve three low-level pixel-wise vision problems, including salient object segmentation, edge detection, and skeleton extraction, within a unified framework.
148, TITLE: Improving correlation method with convolutional neural networks
http://arxiv.org/abs/2004.09430
AUTHORS: Dmitriy Goncharov ; Rostislav Starikov
COMMENTS: 8 pages, 3 figures, 2 tables, 1 formula
HIGHLIGHT: We present a convolutional neural network for the classification of correlation responses obtained by correlation filters.
149, TITLE: BiFNet: Bidirectional Fusion Network for Road Segmentation
http://arxiv.org/abs/2004.08582
AUTHORS: Haoran Li ; Yaran Chen ; Qichao Zhang ; Dongbin Zhao
HIGHLIGHT: Considering the bird's eye views(BEV) of the LiDAR remains the space structure in horizontal plane, this paper proposes a bidirectional fusion network(BiFNet) to fuse the image and BEV of the point cloud.
150, TITLE: StereoSet: Measuring stereotypical bias in pretrained language models
http://arxiv.org/abs/2004.09456
AUTHORS: Moin Nadeem ; Anna Bethke ; Siva Reddy
COMMENTS: 9 pages, 6 tables, and 3 figures
HIGHLIGHT: We present StereoSet, a large-scale natural dataset in English to measure stereotypical biases in four domains: gender, profession, race, and religion.
151, TITLE: The Quantum Approximate Optimization Algorithm Needs to See the Whole Graph: A Typical Case
http://arxiv.org/abs/2004.09002
AUTHORS: Edward Farhi ; David Gamarnik ; Sam Gutmann
COMMENTS: 19 pages, no figures
HIGHLIGHT: The Quantum Approximate Optimization Algorithm can naturally be applied to combinatorial search problems on graphs.
152, TITLE: Bringing Old Photos Back to Life
http://arxiv.org/abs/2004.09484
AUTHORS: Ziyu Wan ; Bo Zhang ; Dongdong Chen ; Pan Zhang ; Dong Chen ; Jing Liao ; Fang Wen
COMMENTS: CVPR 2020 Oral, project website: http://raywzy.com/Old_Photo/
HIGHLIGHT: We propose to restore old photos that suffer from severe degradation through a deep learning approach.
153, TITLE: Music Gesture for Visual Sound Separation
http://arxiv.org/abs/2004.09476
AUTHORS: Chuang Gan ; Deng Huang ; Hang Zhao ; Joshua B. Tenenbaum ; Antonio Torralba
COMMENTS: CVPR 2020. Project page: http://music-gesture.csail.mit.edu
HIGHLIGHT: To address this, we propose "Music Gesture," a keypoint-based structured representation to explicitly model the body and finger movements of musicians when they perform music.
154, TITLE: Attention Routing: track-assignment detailed routing using attention-based reinforcement learning
http://arxiv.org/abs/2004.09473
AUTHORS: Haiguang Liao ; Qingyi Dong ; Xuliang Dong ; Wentai Zhang ; Wangyang Zhang ; Weiyi Qi ; Elias Fallon ; Levent Burak Kara
HIGHLIGHT: In this work, we propose a new router: attention router, which is the first attempt to solve the track-assignment detailed routing problem using reinforcement learning.
155, TITLE: Accumulator Bet Selection Through Stochastic Diffusion Search
http://arxiv.org/abs/2004.08607
AUTHORS: Nassim Dehouche
HIGHLIGHT: We address this relatively under-studied combinatorial aspect of sports betting, and propose a binary optimization model for the problem of selecting the most promising combinations of matches, in terms of their total potential payout and probability of a win, to form an accumulator bet.
156, TITLE: Feathers dataset for Fine-Grained Visual Categorization
http://arxiv.org/abs/2004.08606
AUTHORS: Alina Belko ; Konstantin Dobratulin ; Andrey Kuznetsov
COMMENTS: 6 pages, 6 figures, 3 tables
HIGHLIGHT: This paper introduces a novel dataset FeatherV1, containing 28,272 images of feathers categorized by 595 bird species.
157, TITLE: Underwater image enhancement with Image Colorfulness Measure
http://arxiv.org/abs/2004.08609
AUTHORS: Hui Li ; Xi Yang ; ZhenMing Li ; TianLun Zhang
HIGHLIGHT: To improve the visual quality of underwater images, we proposed a novel enhancement model, which is a trainable end-to-end neural model.
158, TITLE: Halluci-Net: Scene Completion by Exploiting Object Co-occurrence Relationships
http://arxiv.org/abs/2004.08614
AUTHORS: Kuldeep Kulkarni ; Tejas Gokhale ; Rajhans Singh ; Pavan Turaga ; Aswin Sankaranarayanan
COMMENTS: Image synthesis, GAN, Scene completion, Label maps
HIGHLIGHT: We address the new problem of complex scene completion from sparse label maps.
159, TITLE: Time Adaptive Reinforcement Learning
http://arxiv.org/abs/2004.08600
AUTHORS: Chris Reinke
COMMENTS: ICLR 2020 Workshop: Beyond Tabula Rasa in Reinforcement Learning
HIGHLIGHT: We define such problems as Time Adaptive Markov Decision Processes and introduce two model-free, value-based algorithms: the Independent Gamma-Ensemble and the n-Step Ensemble.
160, TITLE: Single-step Adversarial training with Dropout Scheduling
http://arxiv.org/abs/2004.08628
AUTHORS: Vivek B. S. ; R. Venkatesh Babu
COMMENTS: CVPR 2020
HIGHLIGHT: In this work, (i) we show that models trained using single-step adversarial training method learn to prevent the generation of single-step adversaries, and this is due to over-fitting of the model during the initial stages of training, and (ii) to mitigate this effect, we propose a single-step adversarial training method with dropout scheduling.
161, TITLE: Occluded Prohibited Items Detection: An X-ray Security Inspection Benchmark and De-occlusion Attention Module
http://arxiv.org/abs/2004.08656
AUTHORS: Yanlu Wei ; Renshuai Tao ; Zhangjie Wu ; Yuqing Ma ; Libo Zhang ; Xianglong Liu
COMMENTS: 10 pages, 8 figures; for data and code, see https://github.com/OPIXray-author/OPIXray
HIGHLIGHT: We evaluate our method on the OPIXray dataset and compare it to several baselines, including popular methods for detection and attention mechanisms. Second, we present a well-directed dataset, of which the images in the dataset are annotated manually by the professional inspectors from Beijing Capital International Airport.
162, TITLE: Motion Segmentation using Frequency Domain Transformer Networks
http://arxiv.org/abs/2004.08638
AUTHORS: Hafez Farazi ; Sven Behnke
COMMENTS: 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), Bruges, Belgium, 2020
HIGHLIGHT: To address this task, we propose a novel end-to-end learnable architecture that predicts the next frame by modeling foreground and background separately while simultaneously estimating and predicting the foreground motion using Frequency Domain Transformer Networks.
163, TITLE: A Deep Learning Approach to Object Affordance Segmentation
http://arxiv.org/abs/2004.08644
AUTHORS: Spyridon Thermos ; Petros Daras ; Gerasimos Potamianos
COMMENTS: 5 pages, 4 figures, ICASSP 2020
HIGHLIGHT: In this paper, we propose a novel approach that exploits the spatio-temporal nature of human-object interaction for affordance segmentation. For evaluation purposes, we introduce the SOR3D-AFF corpus, which consists of human-object interaction sequences and supports 9 types of affordances in terms of pixel-wise annotation, covering typical manipulations of tool-like objects.
164, TITLE: Modeling Survival in model-based Reinforcement Learning
http://arxiv.org/abs/2004.08648
AUTHORS: Saeed Moazami ; Peggy Doerschuk
HIGHLIGHT: To that end, a substitute model for the reward function approximator is introduced that learns to avoid terminal states rather than to maximize accumulated rewards from safe states.
165, TITLE: Macro-Action-Based Deep Multi-Agent Reinforcement Learning
http://arxiv.org/abs/2004.08646
AUTHORS: Yuchen Xiao ; Joshua Hoffman ; Christopher Amato
HIGHLIGHT: This paper proposes two Deep Q-Network (DQN) based methods for learning decentralized and centralized macro-action-value functions with novel macro-action trajectory replay buffers introduced for each case.
166, TITLE: A Hybrid Approach for Aspect-Based Sentiment Analysis Using Deep Contextual Word Embeddings and Hierarchical Attention
http://arxiv.org/abs/2004.08673
AUTHORS: Maria Mihaela Trusca ; Daan Wassenberg ; Flavius Frasincar ; Rommert Dekker
COMMENTS: Accepted for publication in the 20th International Conference on Web Engineering (ICWE 2020), Helsinki Finland, 9-12 June 2020
HIGHLIGHT: In this paper we extend the state-of-the-art Hybrid Approach for Aspect-Based Sentiment Analysis (HAABSA) method in two directions.
167, TITLE: iCORPP: Interleaved Commonsense Reasoning and Probabilistic Planning on Robots
http://arxiv.org/abs/2004.08672
AUTHORS: Shiqi Zhang ; Peter Stone
HIGHLIGHT: In this article, we present a novel algorithm, called iCORPP, to simultaneously estimate the current world state, reason about world dynamics, and construct task-oriented controllers.
168, TITLE: Dual Embedding Expansion for Vehicle Re-identification
http://arxiv.org/abs/2004.08665
AUTHORS: Clint Sebastian ; Raffaele Imbriaco ; Egor Bondarev ; Peter H. N. de With
HIGHLIGHT: We propose an efficient approach for combining the outputs of multiple models at various scales while exploiting tracklet and neighbor information, called dual embedding expansion (DEx).
169, TITLE: The $(1+(λ,λ))$ Genetic Algorithm for Permutations
http://arxiv.org/abs/2004.08664
AUTHORS: Anton Bassin ; Maxim Buzdalov
COMMENTS: This contribution was submitted to the GECCO 2020 workshop on permutation-based problems
HIGHLIGHT: We aim at improving this situation by proposing an adaptation of the $(1+(\lambda,\lambda))$ genetic algorithm to permutation-based problems.
170, TITLE: The State and Fate of Linguistic Diversity and Inclusion in the NLP World
http://arxiv.org/abs/2004.09095
AUTHORS: Pratik Joshi ; Sebastin Santy ; Amar Budhiraja ; Kalika Bali ; Monojit Choudhury
COMMENTS: Accepted at ACL 2020 (10 pages + 3 pages Appendix)
HIGHLIGHT: In this paper we look at the relation between the types of languages, resources, and their representation in NLP conferences to understand the trajectory that different languages have followed over time.
==========Updates to Previous Papers==========
1, TITLE: Towards personalized diagnosis of Glioblastoma in Fluid-attenuated inversion recovery (FLAIR) by topological interpretable machine learning
http://arxiv.org/abs/1912.08167
AUTHORS: Matteo Rucco ; Lorenzo Falsetti ; Giovanna Viticchi
COMMENTS: 22 pages; 16 figures
HIGHLIGHT: In the present investigation, we intend to demonstrate by means of numerical experiments that topological features combined with textural features can be enrolled for GBM analysis and morphological characterization on FLAIR.
2, TITLE: Generative Adversarial Network Rooms in Generative Graph Grammar Dungeons for The Legend of Zelda
http://arxiv.org/abs/2001.05065
AUTHORS: Jake Gutierrez ; Jacob Schrum
COMMENTS: Congress on Evolutionary Computation 2020
HIGHLIGHT: This paper combines a GAN approach to generating individual rooms with a graph grammar approach to combining rooms into a dungeon.
3, TITLE: Track to Reconstruct and Reconstruct to Track
http://arxiv.org/abs/1910.00130
AUTHORS: Jonathon Luiten ; Tobias Fischer ; Bastian Leibe
COMMENTS: RA-L 2020 and ICRA 2020
HIGHLIGHT: We propose a novel method that closes this loop, first tracking to reconstruct, and then reconstructing to track.
4, TITLE: XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation
http://arxiv.org/abs/2004.01401
AUTHORS: Yaobo Liang ; Nan Duan ; Yeyun Gong ; Ning Wu ; Fenfei Guo ; Weizhen Qi ; Ming Gong ; Linjun Shou ; Daxin Jiang ; Guihong Cao ; Xiaodong Fan ; Bruce Zhang ; Rahul Agrawal ; Edward Cui ; Sining Wei ; Taroon Bharti ; Ying Qiao ; Jiun-Hung Chen ; Winnie Wu ; Shuguang Liu ; Fan Yang ; Rangan Majumder ; Ming Zhou
HIGHLIGHT: In this paper, we introduce XGLUE, a new benchmark dataset to train large-scale cross-lingual pre-trained models using multilingual and bilingual corpora, and evaluate their performance across a diverse set of cross-lingual tasks.
5, TITLE: Verification of Markov Decision Processes with Risk-Sensitive Measures
http://arxiv.org/abs/1803.00091
AUTHORS: Murat Cubuktepe ; Ufuk Topcu
COMMENTS: 7 pages, to appear in ACC 2018
HIGHLIGHT: Specifically, we use a particular risk-sensitive measure from cumulative prospect theory, which has been previously adopted in psychology and economics.
6, TITLE: MulayCap: Multi-layer Human Performance Capture Using A Monocular Video Camera
http://arxiv.org/abs/2004.05815
AUTHORS: Zhaoqi Su ; Weilin Wan ; Tao Yu ; Lingjie Liu ; Lu Fang ; Wenping Wang ; Yebin Liu
HIGHLIGHT: We introduce MulayCap, a novel human performance capture method using a monocular video camera without the need for pre-scanning.
7, TITLE: A Deep Learning Framework for Detection of Targets in Thermal Images to Improve Firefighting
http://arxiv.org/abs/1910.03617
AUTHORS: Manish Bhattarai ; Manel Martínez-Ramón
HIGHLIGHT: The objective of this research is to create an automated system that is capable of real-time, intelligent object detection and recognition and facilitates the improved situational awareness of firefighters during an emergency response.