-
Notifications
You must be signed in to change notification settings - Fork 6
/
2020.05.19.txt
1741 lines (1433 loc) · 134 KB
/
2020.05.19.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
==========New Papers==========
1, TITLE: Large scale weakly and semi-supervised learning for low-resource video ASR
http://arxiv.org/abs/2005.07850
AUTHORS: Kritika Singh ; Vimal Manohar ; Alex Xiao ; Sergey Edunov ; Ross Girshick ; Vitaliy Liptchinsky ; Christian Fuegen ; Yatharth Saraf ; Geoffrey Zweig ; Abdelrahman Mohamed
HIGHLIGHT: We investigate distillation methods at the frame level and the sequence level for hybrid, encoder-only CTC-based, and encoder-decoder speech recognition systems on Dutch and Romanian languages using 27,000 and 58,000 hours of unlabeled audio respectively.
2, TITLE: Neural Multi-Task Learning for Teacher Question Detection in Online Classrooms
http://arxiv.org/abs/2005.07845
AUTHORS: Gale Yan Huang ; Jiahao Chen ; Haochen Liu ; Weiping Fu ; Wenbiao Ding ; Jiliang Tang ; Songfan Yang ; Guoliang Li ; Zitao Liu
COMMENTS: The 21th International Conference on Artificial Intelligence in Education(AIED), 2020
HIGHLIGHT: Therefore, in this work, we build an end-to-end neural framework that automatically detects questions from teachers' audio recordings.
3, TITLE: COCAS: A Large-Scale Clothes Changing Person Dataset for Re-identification
http://arxiv.org/abs/2005.07862
AUTHORS: Shijie Yu ; Shihua Li ; Dapeng Chen ; Rui Zhao ; Junjie Yan ; Yu Qiao
COMMENTS: Accepted by CVPR2020
HIGHLIGHT: Based on COCAS, we introduce a new person re-id setting for clothes changing problem, where the query includes both a clothes template and a person image taking another clothes. To address the clothes changing person re-id problem, we construct a novel large-scale re-id benchmark named ClOthes ChAnging Person Set (COCAS), which provides multiple images of the same identity with different clothes.
4, TITLE: Attribute2Font: Creating Fonts You Want From Attributes
http://arxiv.org/abs/2005.07865
AUTHORS: Yizhi Wang ; Yue Gao ; Zhouhui Lian
COMMENTS: SIGGRAPH 2020 techniqual paper; Wang and Gao contribute equally; Code: https://hologerry.github.io/Attr2Font/
HIGHLIGHT: Inspired by this fact, we propose a novel model, Attribute2Font, to automatically create fonts by synthesizing visually-pleasing glyph images according to user-specified attributes and their corresponding values.
5, TITLE: Partial Domain Adaptation Using Graph Convolutional Networks
http://arxiv.org/abs/2005.07858
AUTHORS: Seunghan Yang ; Youngeun Kim ; Dongki Jung ; Changick Kim
HIGHLIGHT: To overcome these problems, we propose a graph partial domain adaptation (GPDA) network, which exploits Graph Convolutional Networks for jointly considering data structure and the feature distribution of each class.
6, TITLE: Concept Learning in Deep Reinforcement Learning
http://arxiv.org/abs/2005.07870
AUTHORS: Diego Gomez ; Nicanor Quijano Silva ; Luis Felipe Giraldo
HIGHLIGHT: Deep reinforcement learning techniques have shown to be a promising path to solve very complex tasks that once were thought to be out of the realm of machines.
7, TITLE: MicroNet for Efficient Language Modeling
http://arxiv.org/abs/2005.07877
AUTHORS: Zhongxia Yan ; Hanrui Wang ; Demi Guo ; Song Han
COMMENTS: Accepted by PMLR
HIGHLIGHT: In this paper, we provide the winning solution to the NeurIPS 2019 MicroNet Challenge in the language modeling track.
8, TITLE: Integrating Semantic and Structural Information with Graph Convolutional Network for Controversy Detection
http://arxiv.org/abs/2005.07886
AUTHORS: Lei Zhong ; Juan Cao ; Qiang Sheng ; Junbo Guo ; Ziang Wang
COMMENTS: 12 pages, 3 figures, 6 tables; To appear in ACL 2020 (long paper)
HIGHLIGHT: To overcome the first two limitations, we propose Topic-Post-Comment Graph Convolutional Network (TPC-GCN), which integrates the information from the graph structure and content of topics, posts, and comments for post-level controversy detection.
9, TITLE: Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems
http://arxiv.org/abs/2005.08742
AUTHORS: Tingzhi Mao ; Yerbolat Khassanov ; Van Tung Pham ; Haihua Xu ; Hao Huang ; Eng Siong Chng
HIGHLIGHT: In this paper, we present a series of complementary approaches to improve the recognition of underrepresented named entities (NE) in hybrid ASR systems without compromising overall word error rate performance.
10, TITLE: Multi-scale Grouped Dense Network for VVC Intra Coding
http://arxiv.org/abs/2005.07896
AUTHORS: Xin Li ; Simeng Sun ; Zhizheng Zhang ; Zhibo Chen
HIGHLIGHT: In this paper, we design the multi-scale grouped dense network (MSGDN) to further reduce the compression artifacts by combining the multi-scale and grouped dense block, which are integrated as the post-process network of VVC intra coding.
11, TITLE: Glottal Source Estimation using an Automatic Chirp Decomposition
http://arxiv.org/abs/2005.07897
AUTHORS: Thomas Drugman ; Baris Bozkurt ; Thierry Dutoit
HIGHLIGHT: A method is proposed for determining automatically this contour by inspecting the root distribution.
12, TITLE: Improving Named Entity Recognition in Tor Darknet with Local Distance Neighbor Feature
http://arxiv.org/abs/2005.08746
AUTHORS: Mhd Wesam Al-Nabki ; Francisco Jañez-Martino ; Roberto A. Vasco-Carofilis ; Eduardo Fidalgo ; Javier Velasco-Mata
COMMENTS: 2 pages, 1 figure, to be published in conference JNIC 2020
HIGHLIGHT: This paper adopts and improves the approach of Aguilar et al. by presenting a novel feature, called Local Distance Neighbor, which substitutes gazetteers.
13, TITLE: Learning Spatial-Spectral Prior for Super-Resolution of Hyperspectral Imagery
http://arxiv.org/abs/2005.08752
AUTHORS: Junjun Jiang ; He Sun ; Xianming Liu ; Jiayi Ma
COMMENTS: Accepted for publication at IEEE Transactions on Computational Imaging
HIGHLIGHT: In this paper, we make a step forward by investigating how to adapt state-of-the-art residual learning based single gray/RGB image super-resolution approaches for computationally efficient single hyperspectral image super-resolution, referred as SSPSR.
14, TITLE: Evaluating Performance of an Adult Pornography Classifier for Child Sexual Abuse Detection
http://arxiv.org/abs/2005.08766
AUTHORS: Mhd Wesam Al-Nabki ; Eduardo Fidalgo ; Roberto A. Vasco-Carofilis ; Francisco Jañez-Martino ; Javier Velasco-Mata
COMMENTS: 4 pages, 8 figures, to be published in conference JNIC 2020
HIGHLIGHT: In this paper, we identify which are the hardware and software requirements that may affect the performance of a forensic tool.
15, TITLE: Adapting JPEG XS gains and priorities to tasks and contents
http://arxiv.org/abs/2005.08768
AUTHORS: Benoit Brummer ; Christophe De Vleeschouwer
COMMENTS: CLIC at CVPR 2020
HIGHLIGHT: In this work we show that JPEG XS compression can be adapted to a specific given task and content, such as preserving visual quality on desktop content or maintaining high accuracy in neural network segmentation tasks, by optimizing its gain and priority parameters using the covariance matrix adaptation evolution strategy.
16, TITLE: A Statistical Story of Visual Illusions
http://arxiv.org/abs/2005.08772
AUTHORS: Elad Hirsch ; Ayellet Tal
HIGHLIGHT: Given this tool, we present an approach that manages to support the paradigm and explain visual illusions in a unified manner.
17, TITLE: Corpus of Chinese Dynastic Histories: Gender Analysis over Two Millennia
http://arxiv.org/abs/2005.08793
AUTHORS: Sergey Zinin ; Yang Xu
COMMENTS: 12th Conference on Language Resources and Evaluation (LREC 2020), 9 pages, 7 tables
HIGHLIGHT: This project introduces a new open-source corpus of twenty-four dynastic histories covered by Creative Commons license.
18, TITLE: Causal Feature Learning for Utility-Maximizing Agents
http://arxiv.org/abs/2005.08792
AUTHORS: David Kinney ; David Watson
HIGHLIGHT: We propose a new technique, pragmatic causal feature learning (PCFL), which extends the original CFL algorithm in useful and intuitive ways.
19, TITLE: TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
http://arxiv.org/abs/2005.08314
AUTHORS: Pengcheng Yin ; Graham Neubig ; Wen-tau Yih ; Sebastian Riedel
COMMENTS: To Appear at ACL 2020
HIGHLIGHT: In this paper we present TaBERT, a pretrained LM that jointly learns representations for NL sentences and (semi-)structured tables.
20, TITLE: AC-VRNN: Attentive Conditional-VRNN for Multi-Future Trajectory Prediction
http://arxiv.org/abs/2005.08307
AUTHORS: Alessia Bertugli ; Simone Calderara ; Pasquale Coscia ; Lamberto Ballan ; Rita Cucchiara
HIGHLIGHT: To this end, we propose a new generative model for multi-future trajectory prediction based on Conditional Variational Recurrent Neural Networks (C-VRNNs).
21, TITLE: Context-Based Quotation Recommendation
http://arxiv.org/abs/2005.08319
AUTHORS: Ansel MacLaughlin ; Tao Chen ; Burcu Karagol Ayan ; Dan Roth
COMMENTS: 11 pages, 2 figures
HIGHLIGHT: We therefore propose a novel context-aware quote recommendation system which utilizes the content an author has already written to generate a ranked list of quotable paragraphs and spans of tokens from a given source document.
22, TITLE: A Survey on Unknown Presentation Attack Detection for Fingerprint
http://arxiv.org/abs/2005.08337
AUTHORS: Jag Mohan Singh ; Ahmed Madhun ; Guoqiang Li ; Raghavendra Ramachandra
COMMENTS: Submitted to 3rd International Conference on Intelligent Technologies and Applications INTAP 2020
HIGHLIGHT: In this survey paper, we present a comprehensive survey on existing PAD algorithms for fingerprint recognition systems, specifically from the standpoint of detecting unknown PAD.
23, TITLE: Subject Identification Across Large Expression Variations Using 3D Facial Landmarks
http://arxiv.org/abs/2005.08339
AUTHORS: Sk Rahatul Jannat ; Diego Fabiano ; Shaun Canavan ; Tempestt Neal
HIGHLIGHT: Considering this, we propose to use 3D facial landmarks for the task of subject identification, over a range of expressed emotion.
24, TITLE: Cross-Lingual Word Embeddings for Turkic Languages
http://arxiv.org/abs/2005.08340
AUTHORS: Elmurod Kuriyozov ; Yerai Doval ; Carlos Gómez-Rodríguez
COMMENTS: Final version, published in the proceedings of LREC 2020
HIGHLIGHT: In this paper, we present the first viability study of established techniques to align monolingual embedding spaces for Turkish, Uzbek, Azeri, Kazakh and Kyrgyz, members of the Turkic family which is heavily affected by the low-resource constraint.
25, TITLE: Impact of multiple modalities on emotion recognition: investigation into 3d facial landmarks, action units, and physiological data
http://arxiv.org/abs/2005.08341
AUTHORS: Diego Fabiano ; Manikandan Jaishanker ; Shaun Canavan
HIGHLIGHT: Considering this, we present an analysis of 3D facial data, action units, and physiological data as it relates to their impact on emotion recognition.
26, TITLE: Detecting Forged Facial Videos using convolutional neural network
http://arxiv.org/abs/2005.08344
AUTHORS: Neilesh Sambhu ; Shaun Canavan
HIGHLIGHT: In this paper, we propose to detect forged videos, of faces, in online videos.
27, TITLE: Facial Action Unit Detection using 3D Facial Landmarks
http://arxiv.org/abs/2005.08343
AUTHORS: Saurabh Hinduja ; Shaun Canavan
HIGHLIGHT: In this paper, we propose to detect facial action units (AU) using 3D facial landmarks.
28, TITLE: Wake Word Detection with Alignment-Free Lattice-Free MMI
http://arxiv.org/abs/2005.08347
AUTHORS: Yiming Wang ; Hang Lv ; Daniel Povey ; Lei Xie ; Sanjeev Khudanpur
COMMENTS: Submitted to INTERSPEECH 2020
HIGHLIGHT: We present novel methods to train a hybrid DNN/HMM wake word detection system from partially labeled training data, and to use it in on-line applications: (i) we remove the prerequisite of frame-level alignments in the LF-MMI training algorithm, permitting the use of un-transcribed training examples that are annotated only for the presence/absence of the wake word; (ii) we show that the classical keyword/filler model must be supplemented with an explicit non-speech (silence) model for good performance; (iii) we present an FST-based decoder to perform online detection.
29, TITLE: Forecasting Solar Activity with Two Computational Intelligence Models (A Comparative Study)
http://arxiv.org/abs/2005.08350
AUTHORS: M. Parsapoor ; U. Bilstrup ; B. Svensson
HIGHLIGHT: Recently, we have proposed BELFIS (Brain Emotional Learning-based Fuzzy Inference System) as a tool for the forecasting of chaotic systems.
30, TITLE: MixingBoard: a Knowledgeable Stylized Integrated Text Generation Platform
http://arxiv.org/abs/2005.08365
AUTHORS: Xiang Gao ; Michel Galley ; Bill Dolan
COMMENTS: accepted at ACL 2020
HIGHLIGHT: We present MixingBoard, a platform for quickly building demos with a focus on knowledge grounded stylized text generation.
31, TITLE: Multi-Objective level generator generation with Marahel
http://arxiv.org/abs/2005.08368
AUTHORS: Ahmed Khalifa ; Julian Togelius
COMMENTS: Submitted to PCGWorkshop 2020, 8pages, 7 figures
HIGHLIGHT: This paper introduces a new system to design constructive level generators by searching the space of constructive level generators defined by Marahel language.
32, TITLE: A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
http://arxiv.org/abs/2005.08271
AUTHORS: Vladimir Iashin ; Esa Rahtu
COMMENTS: Project page is available on https://v-iashin.github.io/bmt
HIGHLIGHT: In this paper, we introduce Bi-modal Transformer which generalizes the Transformer architecture for a bi-modal input.
33, TITLE: Support-BERT: Predicting Quality of Question-Answer Pairs in MSDN using Deep Bidirectional Transformer
http://arxiv.org/abs/2005.08294
AUTHORS: Bhaskar Sen ; Nikhil Gopal ; Xinwei Xue
HIGHLIGHT: In this brief paper, we tackle the quality Q&A modeling problems from the community support websites using a recently developed deep learning model using bidirectional transformers.
34, TITLE: Feature Fusion Strategies for End-to-End Evaluation of Cognitive Behavior Therapy Sessions
http://arxiv.org/abs/2005.07809
AUTHORS: Zhuohao Chen ; Nikolaos Flemotomos ; Victor Ardulov ; Torrey A. Creed ; Zac E. Imel ; David C. Atkins ; Shrikanth Narayanan
COMMENTS: Submitted to Interspeech 2020
HIGHLIGHT: In this paper, we develop an end-to-end pipeline that converts speech audio to diarized and transcribed text and extracts linguistic features to code the CBT sessions automatically.
35, TITLE: KEIS@JUST at SemEval-2020 Task 12: Identifying Multilingual Offensive Tweets Using Weighted Ensemble and Fine-Tuned BERT
http://arxiv.org/abs/2005.07820
AUTHORS: Saja Khaled Tawalbeh ; Mahmoud Hammad ; Mohammad AL-Smadi
COMMENTS: 8 pages without references, 4 figures, SemEval 2020 conference
HIGHLIGHT: This research presents our team KEIS@JUST participation at SemEval-2020 Task 12 which represents shared task on multilingual offensive language.
36, TITLE: Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification
http://arxiv.org/abs/2005.07817
AUTHORS: Yanpei Shi ; Qiang Huang ; Thomas Hain
COMMENTS: Submitted to Interspeech2020
HIGHLIGHT: In this paper, a hierarchical attention network is proposed to solve a weakly labelled speaker identification problem.
37, TITLE: Speaker Re-identification with Speaker Dependent Speech Enhancement
http://arxiv.org/abs/2005.07818
AUTHORS: Yanpei Shi ; Qiang Huang ; Thomas Hain
COMMENTS: Submitted to Interspeech2020
HIGHLIGHT: This paper introduces a novel approach that cascades speech enhancement and speaker recognition.
38, TITLE: Multi-step-ahead Prediction from Short-term Data by Delay-embedding-based Forecast Machine
http://arxiv.org/abs/2005.07842
AUTHORS: Hao Peng ; Pei Chen ; Rui Liu
COMMENTS: 18 pages, 6 figures
HIGHLIGHT: In this work, we proposed a novel framework, Delay-Embedding-based Forecast Machine (DEFM), to predict the future values of a target variable in an accurate and multi-step-ahead manner based on the high-dimensional short-term measurements.
39, TITLE: Generalizing The Davenport-Mahler-Mignotte Bound -- The Weighted Case
http://arxiv.org/abs/2005.07843
AUTHORS: Vikram Sharma
HIGHLIGHT: In this paper, we generalize these results by allowing arbitrary positive integer weights on the edges of the graph, i.e., for a weight function $w: E \rightarrow \mathbb{Z}_{>0}$, we derive an amortized lower bound on $\prod_{(\alpha,\beta) \in E}|\alpha-\beta|^{w(\alpha,\beta)}$.
40, TITLE: Joint Progressive Knowledge Distillation and Unsupervised Domain Adaptation
http://arxiv.org/abs/2005.07839
AUTHORS: Le Thanh Nguyen-Meidine ; Eric Granger ; Madhu Kiran ; Jose Dolz ; Louis-Antoine Blais-Morin
COMMENTS: Accepted to WCCI/IJCNN 2020
HIGHLIGHT: In this paper, we propose an unexplored direction -- the joint optimization of CNNs to provide a compressed model that is adapted to perform well for a given target domain.
41, TITLE: AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition
http://arxiv.org/abs/2005.07973
AUTHORS: Afroz Ahamad ; Ankit Anand ; Pranesh Bhargava
COMMENTS: Proceedings of the 12th Language Resources and Evaluation Conference - LREC, 2020
HIGHLIGHT: Thus, our work aims to aid ASR systems at every stage of development with a database for training, classification models for feature augmentation, and neutralization systems for acoustic transformations of non-native accents of English.
42, TITLE: Critical Impact of Social Networks Infodemic on Defeating Coronavirus COVID-19 Pandemic: Twitter-Based Study and Research Directions
http://arxiv.org/abs/2005.08820
AUTHORS: Azzam Mourad ; Ali Srour ; Haidar Harmanani ; Cathia Jenainatiy ; Mohamad Arafeh
COMMENTS: 11 pages, 10 figures, Journal Article
HIGHLIGHT: This paper presents a large-scale study based on data mined from Twitter.
43, TITLE: Machine learning on Big Data from Twitter to understand public reactions to COVID-19
http://arxiv.org/abs/2005.08817
AUTHORS: Jia Xue ; Junxiang Chen ; Chen Chen ; ChengDa Zheng ; Tingshao Zhu
HIGHLIGHT: The study aims to understand Twitter users' discussions and reactions about the COVID-19.
44, TITLE: Multi-level Feature Fusion-based CNN for Local Climate Zone Classification from Sentinel-2 Images: Benchmark Results on the So2Sat LCZ42 Dataset
http://arxiv.org/abs/2005.07983
AUTHORS: Chunping Qiu ; Xiaochong Tong ; Michael Schmitt ; Benjamin Bechtel ; Xiao Xiang Zhu
HIGHLIGHT: Using this base network, we propose fusing multi-level features using the extended Sen2LCZ-Net-MF.
45, TITLE: Unsupervised Embedding-based Detection of Lexical Semantic Changes
http://arxiv.org/abs/2005.07979
AUTHORS: Ehsaneddin Asgari ; Christoph Ringlstetter ; Hinrich Schütze
HIGHLIGHT: This paper describes EmbLexChange, a system introduced by the "Life-Language" team for SemEval-2020 Task 1, on unsupervised detection of lexical-semantic changes.
46, TITLE: Inflecting when there's no majority: Limitations of encoder-decoder neural networks as cognitive models for German plurals
http://arxiv.org/abs/2005.08826
AUTHORS: Kate McCurdy ; Sharon Goldwater ; Adam Lopez
COMMENTS: To appear at ACL 2020
HIGHLIGHT: Encoder-decoder models do generalize the most frequently produced plural class, but do not show human-like variability or 'regular' extension of these other plural markers. To investigate this question, we first collect a new dataset from German speakers (production and ratings of plural forms for novel nouns) that is designed to avoid sources of information unavailable to the ED model.
47, TITLE: Visual Memorability for Robotic Interestingness Prediction via Unsupervised Online Learning
http://arxiv.org/abs/2005.08829
AUTHORS: Chen Wang ; Wenshan Wang ; Yuheng Qiu ; Yafei Hu ; Sebastian Scherer
HIGHLIGHT: In this paper, we aim to solve the problem of interesting scene prediction for mobile robots.
48, TITLE: Non-Linearities Improve OrigiNet based on Active Imaging for Micro Expression Recognition
http://arxiv.org/abs/2005.07991
AUTHORS: Monu Verma ; Santosh Kumar Vipparthi ; Girdhari Singh
HIGHLIGHT: In this paper, we propose a new refined rectified linear unit (RReLU), which overcome the problem of vanishing gradient and dying ReLU.
49, TITLE: Revisiting Agglomerative Clustering
http://arxiv.org/abs/2005.07995
AUTHORS: Eric K. Tokuda ; Cesar H. Comin ; Luciano da F. Costa
HIGHLIGHT: More importantly, we adopt a generic model of clusters involving a higher density core surrounded by a transition zone, followed by a sparser set of outliers.
50, TITLE: A Text Reassembling Approach to NaturalLanguage Generation
http://arxiv.org/abs/2005.07988
AUTHORS: Xiao Li ; Kees van Deemter ; Chenghua Lin
HIGHLIGHT: Focussing on some of the key NLG tasks (namely Content Selection, Lexical Choice, and Linguistic Realisation), we propose a novel approach, called the Text Reassembling approach to NLG (TRG), which approaches the ideal of a purely statistical approach very closely, and which is at the same time highly transparent.
51, TITLE: MMFashion: An Open-Source Toolbox for Visual Fashion Analysis
http://arxiv.org/abs/2005.08847
AUTHORS: Xin Liu ; Jiancheng Li ; Jiaqi Wang ; Ziwei Liu
HIGHLIGHT: We welcome all contributions to this still-growing efforts towards open science: https://github.com/open-mmlab/mmfashion.
52, TITLE: Grammatical gender associations outweigh topical gender bias in crosslinguistic word embeddings
http://arxiv.org/abs/2005.08864
AUTHORS: Katherine McCurdy ; Oguz Serbetci
COMMENTS: Extended abstract presented at the WiNLP workshop, ACL 2017
HIGHLIGHT: Recent research has demonstrated that vector space models of semantics can reflect undesirable biases in human culture.
53, TITLE: Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations
http://arxiv.org/abs/2005.08866
AUTHORS: Sam Coope ; Tyler Farghly ; Daniela Gerz ; Ivan Vulić ; Matthew Henderson
COMMENTS: ACL 2020
HIGHLIGHT: We introduce Span-ConveRT, a light-weight model for dialog slot-filling which frames the task as a turn-based span extraction task.
54, TITLE: Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency Maps
http://arxiv.org/abs/2005.08874
AUTHORS: Tobias Huber ; Katharina Weitz ; Elisabeth André ; Ofra Amir
HIGHLIGHT: In this paper, we combine global and local explanation methods, and evaluate their joint and separate contributions, providing (to the best of our knowledge) the first user study of combined local and global explanations for RL agents.
55, TITLE: Deep Implicit Volume Compression
http://arxiv.org/abs/2005.08877
AUTHORS: Danhang Tang ; Saurabh Singh ; Philip A. Chou ; Christian Haene ; Mingsong Dou ; Sean Fanello ; Jonathan Taylor ; Philip Davidson ; Onur G. Guleryuz ; Yinda Zhang ; Shahram Izadi ; Andrea Tagliasacchi ; Sofien Bouaziz ; Cem Keskin
COMMENTS: Danhang Tang and Saurabh Singh have equal contribution
HIGHLIGHT: We describe a novel approach for compressing truncated signed distance fields (TSDF) stored in 3D voxel grids, and their corresponding textures.
56, TITLE: Content analysis of Persian/Farsi Tweets during COVID-19 pandemic in Iran using NLP
http://arxiv.org/abs/2005.08400
AUTHORS: Pedram Hosseini ; Poorya Hosseini ; David A. Broniatowski
HIGHLIGHT: In this study, using more than 530,000 original tweets in Persian/Farsi on COVID-19, we analyzed the topics discussed among users, who are mainly Iranians, to gauge and track the response to the pandemic and how it evolved over time.
57, TITLE: The Weifeiler-Leman Algorithm and Recognition of Graph Properties
http://arxiv.org/abs/2005.08887
AUTHORS: Frank Fuhlbrück ; Johannes Köbler ; Ilia Ponomarenko ; Oleg Verbitsky
COMMENTS: 24 pages, 2 figures. This paper supersedes Section 5 in the first version of arXiv:2002.04590
HIGHLIGHT: We address the applicability of $k$-WL to recognition of graph properties.
58, TITLE: Deep Snow: Synthesizing Remote Sensing Imagery with Generative Adversarial Nets
http://arxiv.org/abs/2005.08892
AUTHORS: Christopher X. Ren ; Amanda Ziemann ; James Theiler ; Alice M. S. Durieux
HIGHLIGHT: In this work we demonstrate that generative adversarial networks (GANs) can be used to generate realistic pervasive changes in remote sensing imagery, even in an unpaired training setting.
59, TITLE: Generative Tweening: Long-term Inbetweening of 3D Human Motions
http://arxiv.org/abs/2005.08891
AUTHORS: Yi Zhou ; Jingwan Lu ; Connelly Barnes ; Jimei Yang ; Sitao Xiang ; Hao li
HIGHLIGHT: To this end, we introduce the problem of long-term inbetweening, which involves automatically synthesizing complex motions over a long time interval given very sparse keyframes by users.
60, TITLE: Single-sample writers -- "Document Filter" and their impacts on writer identification
http://arxiv.org/abs/2005.08424
AUTHORS: Fabio Pinhelli ; Alceu S. Britto Jr ; Luiz S. Oliveira ; Yandre M. G. Costa ; Diego Bertolini
HIGHLIGHT: In this work, perform a detailed study in which we dissect whether or not the use of a database with only a single sample taken from some writers may skew the results obtained in the experimental protocol.
61, TITLE: Syntax-guided Controlled Generation of Paraphrases
http://arxiv.org/abs/2005.08417
AUTHORS: Ashutosh Kumar ; Kabir Ahuja ; Raghuram Vadapalli ; Partha Talukdar
COMMENTS: 16 pages, 3 figures, Accepted to TACL 2020
HIGHLIGHT: We address this limitation in the paper and propose Syntax Guided Controlled Paraphraser (SGCP), an end-to-end framework for syntactic paraphrase generation.
62, TITLE: Deep Learning and Bayesian Deep Learning Based Gender Prediction in Multi-Scale Brain Functional Connectivity
http://arxiv.org/abs/2005.08431
AUTHORS: Gengyan Zhao ; Gyujoon Hwang ; Cole J. Cook ; Fang Liu ; Mary E. Meyerand ; Rasmus M. Birn
COMMENTS: 40 pages, 10 figures
HIGHLIGHT: Hence, in this study we propose to predict gender from multiple scales of brain FC with deep learning, which can extract full FC patterns as features.
63, TITLE: The NTNU System at the Interspeech 2020 Non-Native Children's Speech ASR Challenge
http://arxiv.org/abs/2005.08433
AUTHORS: Tien-Hong Lo ; Fu-An Chao ; Shi-Yan Weng ; Berlin Chen
COMMENTS: Submitted to Interspeech 2020 Special Session: Shared Task on Automatic Speech Recognition for Non-Native Children's Speech
HIGHLIGHT: This paper describes the NTNU ASR system participating in the Interspeech 2020 Non-Native Children's Speech ASR Challenge supported by the SIG-CHILD group of ISCA.
64, TITLE: An Effective End-to-End Modeling Approach for Mispronunciation Detection
http://arxiv.org/abs/2005.08440
AUTHORS: Tien-Hong Lo ; Shi-Yan Weng ; Hsiu-Jui Chang ; Berlin Chen
COMMENTS: Submitted to Interspeech 2020
HIGHLIGHT: Despite the widespread adoption of E2E modeling frameworks on ASR, there still is a dearth of work on investigating the E2E frameworks for use in computer-assisted pronunciation learning (CAPT), particularly for Mispronunciation detection (MD).
65, TITLE: Cross-Task Transfer for Multimodal Aerial Scene Recognition
http://arxiv.org/abs/2005.08449
AUTHORS: Di Hu ; Xuhong Li ; Lichao Mou ; Pu Jin ; Dong Chen ; Liping Jing ; Xiaoxiang Zhu ; Dejing Dou
HIGHLIGHT: In this paper, for improving the performance on the aerial scene recognition, we explore a novel audiovisual aerial scene recognition task using both images and sounds as input. For this purpose, we have constructed a new dataset named AuDio Visual Aerial sceNe reCognition datasEt (ADVANCE).
66, TITLE: Deep Convolutional Sparse Coding Networks for Image Fusion
http://arxiv.org/abs/2005.08448
AUTHORS: Shuang Xu ; Zixiang Zhao ; Yicheng Wang ; Chunxia Zhang ; Junmin Liu ; Jiangshe Zhang
HIGHLIGHT: This paper presents three deep convolutional sparse coding (CSC) networks for three kinds of image fusion tasks (i.e., infrared and visible image fusion, multi-exposure image fusion, and multi-modal image fusion).
67, TITLE: Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels
http://arxiv.org/abs/2005.08455
AUTHORS: Junran Peng ; Xingyuan Bu ; Ming Sun ; Zhaoxiang Zhang ; Tieniu Tan ; Junjie Yan
COMMENTS: CVPR2020 oral. The first two authors contribute equally
HIGHLIGHT: In this work, we quantitatively analyze these label problems and provide a simple but effective solution.
68, TITLE: Bayesian convolutional neural network based MRI brain extraction on nonhuman primates
http://arxiv.org/abs/2005.08460
AUTHORS: Gengyan Zhao ; Fang Liu ; Jonathan A. Oler ; Mary E. Meyerand ; Ned H. Kalin ; Rasmus M. Birn
COMMENTS: 37 pages, 14 figures
HIGHLIGHT: To overcome the challenges of brain extraction in nonhuman primates, we propose a fully-automated brain extraction pipeline combining deep Bayesian convolutional neural network (CNN) and fully connected three-dimensional (3D) conditional random field (CRF).
69, TITLE: Feature Transformation Ensemble Model with Batch Spectral Regularization for Cross-Domain Few-Shot Classification
http://arxiv.org/abs/2005.08463
AUTHORS: Bingyu Liu ; Zhen Zhao ; Zhenpeng Li ; Jianan Jiang ; Yuhong Guo ; Haifeng Shen ; Jieping Ye
HIGHLIGHT: In this paper, we propose a feature transformation ensemble model with batch spectral regularization and label propagation for the CD-FSL challenge.
70, TITLE: Context-aware and Scale-insensitive Temporal Repetition Counting
http://arxiv.org/abs/2005.08465
AUTHORS: Huaidong Zhang ; Xuemiao Xu ; Guoqiang Han ; Shengfeng He
COMMENTS: Accepted by CVPR2020
HIGHLIGHT: In this paper, we tailor a context-aware and scale-insensitive framework, to tackle the challenges in repetition counting caused by the unknown and diverse cycle-lengths. To benefit the training and evaluation of temporal repetition counting area, we construct a new and largest benchmark, which contains 526 videos with diverse repetitive actions.
71, TITLE: Text Classification with Few Examples using Controlled Generalization
http://arxiv.org/abs/2005.08469
AUTHORS: Abhijit Mahabal ; Jason Baldridge ; Burcu Karagol Ayan ; Vincent Perot ; Dan Roth
HIGHLIGHT: This produces task-specific semantic vectors; here, we show that a feed-forward network over these vectors is especially effective in low-data scenarios, compared to existing state-of-the-art methods.
72, TITLE: Extreme Low-Light Imaging with Multi-granulation Cooperative Networks
http://arxiv.org/abs/2005.08001
AUTHORS: Keqi Wang ; Peng Gao ; Steven Hoi ; Qian Guo ; Yuhua Qian
HIGHLIGHT: In this paper, we propose a novel method of multi-granulation cooperative networks (MCN) with bidirectional information flow to enhance extreme low-light images, and design an illumination map estimation function (IMEF) to preserve high dynamic range (HDR).
73, TITLE: Deep Lighting Environment Map Estimation from Spherical Panoramas
http://arxiv.org/abs/2005.08000
AUTHORS: Vasileios Gkitsas ; Nikolaos Zioulis ; Federico Alvarez ; Dimitrios Zarpalas ; Petros Daras
COMMENTS: Code and models available at https://vcl3d.github.io/DeepPanoramaLighting
HIGHLIGHT: In this work we present a data-driven model that estimates an HDR lighting environment map from a single LDR monocular spherical panorama.
74, TITLE: Towards in-store multi-person tracking using head detection and track heatmaps
http://arxiv.org/abs/2005.08009
AUTHORS: Aibek Musaev ; Jiangping Wang ; Liang Zhu ; Cheng Li ; Yi Chen ; Jialin Liu ; Wanqi Zhang ; Juan Mei ; De Wang
HIGHLIGHT: In this paper, we study the problem of computer vision based customer tracking in retail industry. To this end, we introduce a dataset collected from a camera in an office environment where participants mimic various behaviors of customers in a supermarket.
75, TITLE: Attention-based Transducer for Online Speech Recognition
http://arxiv.org/abs/2005.08497
AUTHORS: Bin Wang ; Yan Yin ; Hui Lin
COMMENTS: submitted to Interspeech 2020
HIGHLIGHT: We propose attention-based transducer with modification over RNN-T in two aspects.
76, TITLE: Fixed Point Semantics for Stream Reasoning
http://arxiv.org/abs/2005.08384
AUTHORS: Christian Antić
HIGHLIGHT: This paper fixes all of the aforementioned shortcomings of LARS.
77, TITLE: Vector-Quantized Autoregressive Predictive Coding
http://arxiv.org/abs/2005.08392
AUTHORS: Yu-An Chung ; Hao Tang ; James Glass
HIGHLIGHT: In this work, we propose Vector-Quantized Autoregressive Predictive Coding (VQ-APC), a novel model that produces quantized representations, allowing us to explicitly control the amount of information encoded in the representations.
78, TITLE: A tutorial introduction to quantum circuit programming in dependently typed Proto-Quipper
http://arxiv.org/abs/2005.08396
AUTHORS: Peng Fu ; Kohei Kishida ; Neil J. Ross ; Peter Selinger
COMMENTS: To appear in Proceedings of the 12th International Conference on Reversible Computation (RC 2020), Oslo, Norway, 2020
HIGHLIGHT: We introduce dependently typed Proto-Quipper, or Proto-Quipper-D for short, an experimental quantum circuit programming language with linear dependent types.
79, TITLE: T-VSE: Transformer-Based Visual Semantic Embedding
http://arxiv.org/abs/2005.08399
AUTHORS: Muhammet Bastan ; Arnau Ramisa ; Mehmet Tek
COMMENTS: To appear: CVPR 2020 Workshop on Computer Vision for Fashion, Art and Design (CVFAD 2020)
HIGHLIGHT: In this paper, we show that dataset scale and training strategy are critical and demonstrate that transformer-based cross-modal embeddings outperform word average and RNN-based embeddings by a large margin, when trained on a large dataset of e-commerce product image-title pairs.
80, TITLE: Oscillating Statistical Moments for Speech Polarity Detection
http://arxiv.org/abs/2005.07901
AUTHORS: Thomas Drugman ; Thierry Dutoit
HIGHLIGHT: This paper proposes a new approach of polarity detection relying on oscillating statistical moments.
81, TITLE: The Power of Triply Complementary Priors for Image Compressive Sensing
http://arxiv.org/abs/2005.07902
AUTHORS: Zhiyuan Zha ; Xin Yuan ; Joey Tianyi Zhou ; Jiantao Zhou ; Bihan Wen ; Ce Zhu
HIGHLIGHT: In this paper, we propose a joint low-rank and deep (LRD) image model, which contains a pair of triply complementary priors, namely \textit{external} and \textit{internal}, \textit{deep} and \textit{shallow}, and \textit{local} and \textit{non-local} priors.
82, TITLE: Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition
http://arxiv.org/abs/2005.07903
AUTHORS: Zhengkun Tian ; Jiangyan Yi ; Jianhua Tao ; Ye Bai ; Shuai Zhang ; Zhengqi Wen
COMMENTS: 5 pages
HIGHLIGHT: To address this problem and improve the inference speed, we propose a spike-triggered non-autoregressive transformer model for end-to-end speech recognition, which introduces a CTC module to predict the length of the target sequence and accelerate the convergence.
83, TITLE: A Dichotomy for Real Boolean Holant Problems
http://arxiv.org/abs/2005.07906
AUTHORS: Shuai Shao ; Jin-Yi Cai
COMMENTS: 91 pages, 4 figures
HIGHLIGHT: We prove a complexity dichotomy for Holant problems on the boolean domain with arbitrary sets of real-valued constraint functions.
84, TITLE: Reducing Spelling Inconsistencies in Code-Switching ASR using Contextualized CTC Loss
http://arxiv.org/abs/2005.07920
AUTHORS: Burin Naowarat ; Thananchai Kongthaworn ; Korrawe Karunratanakul ; Sheng Hui Wu ; Ekapol Chuangsuwanich
COMMENTS: 7 pages, 5 figures, submitted to INTERSPEECH 2020
HIGHLIGHT: We propose Contextualized Connectionist Temporal Classification (CCTC) loss to encourage spelling consistencies of a character-based non-autoregressive ASR which allows for faster inference.
85, TITLE: Deep-learning of Parametric Partial Differential Equations from Sparse and Noisy Data
http://arxiv.org/abs/2005.07916
AUTHORS: Hao Xu ; Dongxiao Zhang ; Junsheng Zeng
COMMENTS: 30 pages, 6 figures, and 7 tables
HIGHLIGHT: In this work, a new framework, which combines neural network, genetic algorithm and adaptive methods, is put forward to address all of these challenges simultaneously.
86, TITLE: Deep feature fusion for self-supervised monocular depth prediction
http://arxiv.org/abs/2005.07922
AUTHORS: Vinay Kaushik ; Brejesh Lall
COMMENTS: 4 pages, 2 Tables, 2 Figures
HIGHLIGHT: We propose a deep feature fusion method utilising features at multiple scales for learning self-supervised depth from scratch.
87, TITLE: Sequential Sentence Matching Network for Multi-turn Response Selection in Retrieval-based Chatbots
http://arxiv.org/abs/2005.07923
AUTHORS: Chao Xiong ; Che Liu ; Zijun Xu ; Junfeng Jiang ; Jieping Ye
COMMENTS: 10 pages, 4 figures
HIGHLIGHT: In this work, we propose a matching network, called sequential sentence matching network (S2M), to use the sentence-level semantic information to address the problem.
88, TITLE: Artificial Intelligence Assisted Collaborative Edge Caching in Small Cell Networks
http://arxiv.org/abs/2005.07941
AUTHORS: Md Ferdous Pervej ; Le Thanh Tan ; Rose Qingyang Hu
COMMENTS: Submitted for possible publication
HIGHLIGHT: Thanks to artificial intelligence (AI), based on the methodologies of the conventional particle swarm optimization (PSO), we propose a modified PSO (M-PSO) to efficiently solve the complex constraint problem in a reasonable time.
89, TITLE: ApplicaAI at SemEval-2020 Task 11: On RoBERTa-CRF, Span CLS and Whether Self-Training Helps Them
http://arxiv.org/abs/2005.07934
AUTHORS: Dawid Jurkiewicz ; Łukasz Borchmann ; Izabela Kosmala ; Filip Graliński
HIGHLIGHT: An ensemble of RoBERTa-based models was proposed for the TC task, with one of them making use of Span CLS layers we introduce in the present paper.
90, TITLE: Logical Inferences with Comparatives and Generalized Quantifiers
http://arxiv.org/abs/2005.07954
AUTHORS: Izumi Haruta ; Koji Mineshima ; Daisuke Bekki
COMMENTS: To appear in the Proceedings of the Association for Computational Linguistics: Student Research Workshop (ACL-SRW 2020)
HIGHLIGHT: In this paper, we present a compositional semantics that maps various comparative constructions in English to semantic representations via Combinatory Categorial Grammar (CCG) parsers and combine it with an inference system based on automated theorem proving.
91, TITLE: Polynomial-time approximation algorithms for the antiferromagnetic Ising model on line graphs
http://arxiv.org/abs/2005.07944
AUTHORS: Martin Dyer ; Marc Heinrich ; Mark Jerrum ; Haiko Müller
COMMENTS: 17 pages
HIGHLIGHT: We present a polynomial-time Markov chain Monte Carlo algorithm for estimating the partition function of the antiferromagnetic Ising model on any line graph.
92, TITLE: Data Driven Aircraft Trajectory Prediction with Deep Imitation Learning
http://arxiv.org/abs/2005.07960
AUTHORS: Alevizos Bastas ; Theocharis Kravaris ; George A. Vouros
HIGHLIGHT: In this paper we approach the data-driven trajectory prediction problem as an imitation learning task, where we aim to imitate experts "shaping" the trajectory.
93, TITLE: Hierarchical and Efficient Learning for Person Re-Identification
http://arxiv.org/abs/2005.08812
AUTHORS: Jiangning Zhang ; Liang Liu ; Chao Xu ; Yong Liu
HIGHLIGHT: In this paper, we propose a novel Hierarchical and Efficient Network (HENet) that learns hierarchical global, partial, and recovery features ensemble under the supervision of multiple loss combinations.
94, TITLE: Interaction Matching for Long-Tail Multi-Label Classification
http://arxiv.org/abs/2005.08805
AUTHORS: Sean MacAvaney ; Franck Dernoncourt ; Walter Chang ; Nazli Goharian ; Ophir Frieder
HIGHLIGHT: We present an elegant and effective approach for addressing limitations in existing multi-label classification models by incorporating interaction matching, a concept shown to be useful for ad-hoc search result ranking.
95, TITLE: Niose-Sampling Cross Entropy Loss: Improving Disparity Regression Via Cost Volume Aware Regularizer
http://arxiv.org/abs/2005.08806
AUTHORS: Yang Chen ; Zongqing Lu ; Xuechen Zhang ; Lei Chen ; Qinming Liao
COMMENTS: Accepted by IEEE ICIP 2020
HIGHLIGHT: In this paper, inspired by previous canonical definition of cost volume, we propose the noise-sampling cross entropy loss function to regularize the cost volume produced by deep neural networks to be unimodal and coherent.
96, TITLE: VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization
http://arxiv.org/abs/2005.08501
AUTHORS: Cheng Gong ; Yao Chen ; Ye Lu ; Tao Li ; Cong Hao ; Deming Chen
COMMENTS: 14 pages, 9 figures, Journal
HIGHLIGHT: In this paper, we propose a novel metric called Vector Loss.
97, TITLE: Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
http://arxiv.org/abs/2005.08514
AUTHORS: Cunjun Yu ; Xiao Ma ; Jiawei Ren ; Haiyu Zhao ; Shuai Yi
COMMENTS: 19 pages, 8 figures, 2 tables
HIGHLIGHT: In this paper, we present STAR, a Spatio-Temporal grAph tRansformer framework, which tackles trajectory prediction by only attention mechanisms.
98, TITLE: Robust Training of Vector Quantized Bottleneck Models
http://arxiv.org/abs/2005.08520
AUTHORS: Adrian Łańcucki ; Jan Chorowski ; Guillaume Sanchez ; Ricard Marxer ; Nanxin Chen ; Hans J. G. A. Dolfing ; Sameer Khurana ; Tanel Alumäe ; Antoine Laurent
COMMENTS: Published at IJCNN 2020
HIGHLIGHT: In this paper we demonstrate methods for reliable and efficient training of discrete representation using Vector-Quantized Variational Auto-Encoder models (VQ-VAEs).
99, TITLE: Automatic Knowledge Acquisition for Object-Oriented Expert Systems
http://arxiv.org/abs/2005.08517
AUTHORS: Joël Colloc ; Danielle Boulanger
HIGHLIGHT: We describe an Object Oriented Model for building Expert Systems.
100, TITLE: Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks
http://arxiv.org/abs/2005.08516
AUTHORS: Swaroop Mishra ; Arindam Mitra ; Neeraj Varshney ; Bhavdeep Sachdeva ; Chitta Baral
COMMENTS: 10 pages
HIGHLIGHT: In pursuit of this goal, we introduce NUMBERGAME, a multifaceted benchmark to evaluate model performance across numerical reasoning tasks of eight diverse formats.
101, TITLE: SemEval-2020 Task 5: Detecting Counterfactuals by Disambiguation
http://arxiv.org/abs/2005.08519
AUTHORS: Hanna Abi Akl ; Dominique Mariko ; Estelle Labidurie
HIGHLIGHT: In this paper, we explore strategies to detect and evaluate counterfactual sentences.
102, TITLE: An Algebraic Model For Quorum Systems
http://arxiv.org/abs/2005.08536
AUTHORS: Alex Pellegrini ; Luca Zanolini
COMMENTS: 15 pages, 3 algorithms
HIGHLIGHT: In this paper we give a new interpretation of quorum systems, starting with classical majority-based quorum systems and extending this to Byzantine quorum systems.
103, TITLE: Omni-supervised Facial Expression Recognition: A Simple Baseline
http://arxiv.org/abs/2005.08551
AUTHORS: Ping Liu ; Yunchao Wei ; Zibo Meng ; Weihong Deng ; Joey Tianyi Zhou ; Yi Yang
HIGHLIGHT: In this paper, we target on advancing the performance in facial expression recognition (FER) by exploiting omni-supervised learning.
104, TITLE: Learning to Model and Calibrate Optics via a Differentiable Wave Optics Simulator
http://arxiv.org/abs/2005.08562
AUTHORS: Josue Page ; Paolo Favaro
COMMENTS: 6 pages, 3 figures, for source code see https://github.com/pvjosue/WaveBlocks, to be published in IEEE 2020 International Conference on Image Processing (ICIP 2020)
HIGHLIGHT: We present a novel learning-based method to build a differentiable computational model of a real fluorescence microscope.
105, TITLE: Audio-visual Multi-channel Recognition of Overlapped Speech
http://arxiv.org/abs/2005.08571
AUTHORS: Jianwei Yu ; Bo Wu ; Rongzhi Gu Shi-Xiong Zhang Lianwu Chen Yong Xu Meng Yu ; Dan Su ; Dong Yu ; Xunying Liu ; Helen Meng
COMMENTS: submitted to Interspeech 2020
HIGHLIGHT: Motivated by the invariance of visual modality to acoustic signal corruption, this paper presents an audio-visual multi-channel overlapped speech recognition system featuring tightly integrated separation front-end and recognition back-end.
106, TITLE: Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
http://arxiv.org/abs/2005.08575
AUTHORS: Po-Han Chi ; Pei-Hung Chung ; Tsung-Han Wu ; Chun-Cheng Hsieh ; Shang-Wen Li ; Hung-yi Lee
COMMENTS: 5 pages, 6 figures
HIGHLIGHT: In this paper, we propose Audio ALBERT, a lite version of the self-supervised speech representation model.
107, TITLE: Single-Stage Semantic Segmentation from Image Labels
http://arxiv.org/abs/2005.08104
AUTHORS: Nikita Araslanov ; Stefan Roth
COMMENTS: To appear at CVPR 2020; minor corrections in Eq. (9). Code: https://github.com/visinf/1-stage-wseg
HIGHLIGHT: We show that despite its simplicity, our method achieves results that are competitive with significantly more complex pipelines, substantially outperforming earlier single-stage methods.
108, TITLE: Learning Probabilistic Sentence Representations from Paraphrases
http://arxiv.org/abs/2005.08105
AUTHORS: Mingda Chen ; Kevin Gimpel
COMMENTS: Repl4NLP at ACL 2020, short paper
HIGHLIGHT: In this paper we define probabilistic models that produce distributions for sentences.
109, TITLE: Analytic Signal Phase in $N-D$ by Linear Symmetry Tensor--fingerprint modeling
http://arxiv.org/abs/2005.08108
AUTHORS: Josef Bigun ; Fernando Alonso-Fernandez
HIGHLIGHT: We reveal that the Analytic Signal phase, and its gradient have a hitherto unstudied discontinuity in $2-D $ and higher dimensions.
110, TITLE: Efficient Wait-k Models for Simultaneous Machine Translation
http://arxiv.org/abs/2005.08595
AUTHORS: Maha Elbayad ; Laurent Besacier ; Jakob Verbeek
HIGHLIGHT: Wait-k decoders offer a simple but efficient approach for this problem.
111, TITLE: RPD: A Distance Function Between Word Embeddings
http://arxiv.org/abs/2005.08113
AUTHORS: Xuhui Zhou ; Zaixiang Zheng ; Shujian Huang
COMMENTS: ACL Student Research Workshop 2020
HIGHLIGHT: In this paper, we propose a novel metric called Relative pairwise inner Product Distance (RPD) to quantify the distance between different sets of word embeddings.
112, TITLE: Mutual Information Maximization for Robust Plannable Representations
http://arxiv.org/abs/2005.08114
AUTHORS: Yiming Ding ; Ignasi Clavera ; Pieter Abbeel
COMMENTS: Accepted at NeurIPS 2019 Workshop on Robot Learning: Control and Interaction in the Real World
HIGHLIGHT: In this work, we present MIRO, an information theoretic representational learning algorithm for model-based reinforcement learning.
113, TITLE: From Boundaries to Bumps: when closed (extremal) contours are critical
http://arxiv.org/abs/2005.08116
AUTHORS: Benjamin Kunsberg ; Steven W. Zucker
HIGHLIGHT: From Boundaries to Bumps: when closed (extremal) contours are critical
114, TITLE: That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages
http://arxiv.org/abs/2005.08118
AUTHORS: Piotr Żelasko ; Laureano Moro-Velázquez ; Mark Hasegawa-Johnson ; Odette Scharenborg ; Najim Dehak
COMMENTS: Submitted to Interspeech 2020. For some reason, the ArXiv Latex engine rendered it in more than 4 pages
HIGHLIGHT: In this work, we focus on gaining a deeper understanding of how general these representations might be, and how individual phones are getting improved in a multilingual setting.
115, TITLE: Neural Collaborative Reasoning
http://arxiv.org/abs/2005.08129
AUTHORS: Hanxiong Chen ; Shaoyun Shi ; Yunqi Li ; Yongfeng Zhang
COMMENTS: 10 pages, 5 figures
HIGHLIGHT: Inspired by recent progress on neural-symbolic machine learning, we propose a framework to integrate the power of embedding learning and logical reasoning, where the embeddings capture similarity patterns in data from perceptual perspectives, and the logic facilitates cognitive reasoning for informed decision making.
116, TITLE: VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change
http://arxiv.org/abs/2005.08135
AUTHORS: Mubariz Zaffar ; Shoaib Ehsan ; Michael Milford ; David Flynn ; Klaus McDonald-Maier
COMMENTS: Currently under-review, 25 pages, 16 figures
HIGHLIGHT: In this paper we address these key challenges through a new comprehensive open-source evaluation framework, dubbed 'VPR-Bench'.
117, TITLE: Train in Germany, Test in The USA: Making 3D Object Detectors Generalize
http://arxiv.org/abs/2005.08139
AUTHORS: Yan Wang ; Xiangyu Chen ; Yurong You ; Li Erran ; Bharath Hariharan ; Mark Campbell ; Kilian Q. Weinberger ; Wei-Lun Chao
COMMENTS: Accepted to 2020 Conference on Computer Vision and Pattern Recognition (CVPR 2020)
HIGHLIGHT: In this paper we consider the task of adapting 3D object detectors from one dataset to another.
118, TITLE: IntelliCode Compose: Code Generation Using Transformer
http://arxiv.org/abs/2005.08025
AUTHORS: Alexey Svyatkovskiy ; Shao Kun Deng ; Shengyu Fu ; Neel Sundaresan
COMMENTS: 15 pages, 6 figures
HIGHLIGHT: In this paper, we introduce IntelliCode Compose $-$ a general-purpose multilingual code completion tool which is capable of predicting sequences of code tokens of arbitrary types, generating up to entire lines of syntactically correct code.
119, TITLE: Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
http://arxiv.org/abs/2005.08024
AUTHORS: Tao Tu ; Yuan-Jui Chen ; Alexander H. Liu ; Hung-yi Lee
COMMENTS: Submitted to Interspeech 2020
HIGHLIGHT: In this work, we propose a semi-supervised learning approach for multi-speaker TTS.
120, TITLE: Various Total Variation for Snapshot Video Compressive Imaging
http://arxiv.org/abs/2005.08028
AUTHORS: Xin Yuan
COMMENTS: 5 pages, 4 figures
HIGHLIGHT: This paper aims to answer the question of which TV penalty (anisotropic TV, isotropic TV and vectorized TV) works best for video SCI reconstruction?
121, TITLE: Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory
http://arxiv.org/abs/2005.08042
AUTHORS: Chunyang Wu ; Yongqiang Wang ; Yangyang Shi ; Ching-Feng Yeh ; Frank Zhang
COMMENTS: submitted to Interspeech 2020
HIGHLIGHT: In this work, we proposed a novel augmentedmemory self-attention, which attends on a short segment of theinput sequence and a bank of memories.
122, TITLE: Visual Relationship Detection using Scene Graphs: A Survey
http://arxiv.org/abs/2005.08045
AUTHORS: Aniket Agarwal ; Ayush Mangal ; Vipul
HIGHLIGHT: In this paper, we present a detailed survey on the various techniques for scene graph generation, their efficacy to represent visual relationships and how it has been used to solve various downstream tasks.
123, TITLE: Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models
http://arxiv.org/abs/2005.08053
AUTHORS: Qiang Huang ; Thomas Hain
COMMENTS: Submitted to InterSpeech 2020
HIGHLIGHT: In this paper, a novel model for audio quality assessment is proposed by jointly using bidirectional long short-term memory and an attention mechanism.
124, TITLE: Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehensio
http://arxiv.org/abs/2005.08056
AUTHORS: Hongyu Gong ; Yelong Shen ; Dian Yu ; Jianshu Chen ; Dong Yu
HIGHLIGHT: In this paper, we study machine reading comprehension (MRC) on long texts, where a model takes as inputs a lengthy document and a question and then extracts a text span from the document as an answer.
125, TITLE: Distributed Bounded Model Checking
http://arxiv.org/abs/2005.08063
AUTHORS: Prantik Chatterjee ; Subhajit Roy ; Bui Phi Diep ; Akash Lal
HIGHLIGHT: We present an algorithm that dynamically unfolds the call graph of the program and frequently splits it to create sub-tasks that can be solved in parallel.
126, TITLE: Model-Augmented Actor-Critic: Backpropagating through Paths
http://arxiv.org/abs/2005.08068
AUTHORS: Ignasi Clavera ; Violet Fu ; Pieter Abbeel
COMMENTS: Accepted paper at ICLR 2020
HIGHLIGHT: In this paper, we show how to make more effective use of the model by exploiting its differentiability.
127, TITLE: Ontology and Cognitive Outcomes
http://arxiv.org/abs/2005.08078
AUTHORS: David Limbaugh ; David Kasmier ; Ronald Rudnicki ; James Llinas ; Barry Smith
COMMENTS: 15 pages, 3 figures
HIGHLIGHT: Herein we describe an approach to utilizing outcomes-based learning (OBL) to support these efforts that is based on an ontology of the cognitive processes performed by intelligence analysts.
128, TITLE: A Robust Experimental Evaluation of Automated Multi-Label Classification Methods
http://arxiv.org/abs/2005.08083
AUTHORS: Alex G. C. de Sá ; Cristiano G. Pimenta ; Gisele L. Pappa ; Alex A. Freitas
COMMENTS: GECCO'2020 paper: Submitted and accepted
HIGHLIGHT: In this work, we provide a general comparison of five automated multi-label classification methods -- two evolutionary methods, one Bayesian optimization method, one random search and one greedy search -- on 14 datasets and three designed search spaces.
129, TITLE: Universal Adversarial Perturbations: A Survey
http://arxiv.org/abs/2005.08087
AUTHORS: Ashutosh Chaubey ; Nikhil Agrawal ; Kavya Barnwal ; Keerat K. Guliani ; Pramod Mehta
COMMENTS: 20 pages, 17 figures
HIGHLIGHT: In this paper, we attempt to provide a detailed discussion on the various data-driven and data-independent methods for generating universal perturbations, along with measures to defend against such perturbations.
130, TITLE: Layer-Wise Cross-View Decoding for Sequence-to-Sequence Learning
http://arxiv.org/abs/2005.08081
AUTHORS: Fenglin Liu ; Xuancheng Ren ; Guangxiang Zhao ; Xu Sun
COMMENTS: Achieve state-of-the-art BLEU scores on WMT14 EN-DE, EN-FR, and IWSLT DE-EN datasets
HIGHLIGHT: In this work, we explore to reuse the representations from different encoder layers for layer-wise cross-view decoding, that is, different views of the source sequences are presented to different decoder layers.
131, TITLE: Improving Robustness using Joint Attention Network For Detecting Retinal Degeneration From Optical Coherence Tomography Images
http://arxiv.org/abs/2005.08094
AUTHORS: Sharif Amit Kamran ; Alireza Tavakkoli ; Stewart Lee Zuckerbrod
COMMENTS: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
HIGHLIGHT: In this paper we propose the use of disease-specific feature representation as a novel architecture comprised of two joint networks -- one for supervised encoding of disease model and the other for producing attention maps in an unsupervised manner to retain disease specific spatial information.
132, TITLE: Reducibility and Statistical-Computational Gaps from Secret Leakage
http://arxiv.org/abs/2005.08099
AUTHORS: Matthew Brennan ; Guy Bresler
COMMENTS: 175 pages; subsumes preliminary draft arXiv:1908.06130
HIGHLIGHT: The insight in this work is that a slight generalization of the planted clique conjecture -- secret leakage planted clique -- gives rise to a variety of new average-case reduction techniques, yielding a web of reductions among problems with very different structure.
133, TITLE: FiberStars: Visual Comparison of Diffusion Tractography Data between Multiple Subjects
http://arxiv.org/abs/2005.08090
AUTHORS: Loraine Franke ; Daniel Karl I. Weidele ; Fan Zhang ; Suheyla Cetin-Karayumak ; Steve Pieper ; Lauren J. O'Donnell ; Yogesh Rathi ; Daniel Haehn
COMMENTS: 10 pages, 9 figures
HIGHLIGHT: In this paper, we present the design and implementation of FiberStars, a visual analysis tool for tractography data that allows the interactive and scalable visualization of brain fiber clusters in 2D and 3D.
134, TITLE: Imposing Regulation on Advanced Algorithms
http://arxiv.org/abs/2005.08092
AUTHORS: Fotios Fitsilis
COMMENTS: XXI, 82 pages, 5 figures. Cham: Springer
HIGHLIGHT: Imposing Regulation on Advanced Algorithms
135, TITLE: Approximation Algorithms and Hardness for Strong Unique Games
http://arxiv.org/abs/2005.08918
AUTHORS: Suprovat Ghoshal ; Anand Louis
COMMENTS: 67 Pages
HIGHLIGHT: In this paper, we give new algorithmic and hardness results for STRONG UNIQUE GAMES.
136, TITLE: Joint Multi-Dimension Pruning
http://arxiv.org/abs/2005.08931
AUTHORS: Zechun Liu ; Xiangyu Zhang ; Zhiqiang Shen ; Zhe Li ; Yichen Wei ; Kwang-Ting Cheng ; Jian Sun
HIGHLIGHT: We present joint multi-dimension pruning (named as JointPruning), a new perspective of pruning a network on three crucial aspects: spatial, depth and channel simultaneously.
137, TITLE: Reconstructing Maps from Text
http://arxiv.org/abs/2005.08932
AUTHORS: Johnathan E. Avery ; Robert L. Goldstone ; Michael N. Jones
HIGHLIGHT: In this paper we investigate the statistical sources required in language to infer maps, and resulting constraints placed on mechanisms of semantic representation.
138, TITLE: Portrait Shadow Manipulation
http://arxiv.org/abs/2005.08925
AUTHORS: Xuaner Cecilia Zhang ; J onathan T. Barron ; Yun-Ta Tsai ; Rohit Pandey ; Xiuming Zhang ; Ren Ng ; David E. Jacobs
COMMENTS: SIGGRAPH 2020;Project webpage: https://people.eecs.berkeley.edu/~cecilia77/project-pages/portrait Video: https://youtu.be/M_qYTXhzyac
HIGHLIGHT: In this paper, we present a computational approach that gives casual photographers some of this control, thereby allowing poorly-lit portraits to be relit post-capture in a realistic and easily-controllable way. To train our first network we construct a dataset of real-world portraits wherein synthetic foreign shadows are rendered onto the face, and we show that our network learns to remove those unwanted shadows.
139, TITLE: Uncovering Gender Bias in Media Coverage of Politicians with Machine Learning
http://arxiv.org/abs/2005.07734
AUTHORS: Susan Leavy
COMMENTS: 24 pages, 1 figures, 14 tables, Digital Scholarship in Humanities Journal
HIGHLIGHT: This paper presents research uncovering systematic gender bias in the representation of political leaders in the media, using artificial intelligence.
140, TITLE: Semantic Photo Manipulation with a Generative Image Prior
http://arxiv.org/abs/2005.07727
AUTHORS: David Bau ; Hendrik Strobelt ; William Peebles ; Jonas ; Bolei Zhou ; Jun-Yan Zhu ; Antonio Torralba
COMMENTS: SIGGRAPH 2019
HIGHLIGHT: In this paper, we address these issues by adapting the image prior learned by GANs to image statistics of an individual image.
141, TITLE: Disentangling in Latent Space by Harnessing a Pretrained Generator
http://arxiv.org/abs/2005.07728
AUTHORS: Yotam Nitzan ; Amit Bermano ; Yangyan Li ; Daniel Cohen-Or
COMMENTS: 17 pages, 10 figures
HIGHLIGHT: In this paper, we present a method that learn show to represent data in a disentangled way, with minimal supervision, manifested solely using available pre-trained networks.
142, TITLE: In Layman's Terms: Semi-Open Relation Extraction from Scientific Texts
http://arxiv.org/abs/2005.07751
AUTHORS: Ruben Kruiper ; Julian F. V. Vincent ; Jessica Chen-Burger ; Marc P. Y. Desmulliez ; Ioannis Konstas
COMMENTS: To be published in ACL 2020 conference proceedings
HIGHLIGHT: In this work we combine the output of both types of systems to achieve Semi-Open Relation Extraction, a new task that we explore in the Biology domain.
143, TITLE: A Scientific Information Extraction Dataset for Nature Inspired Engineering
http://arxiv.org/abs/2005.07753
AUTHORS: Ruben Kruiper ; Julian F. V. Vincent ; Jessica Chen-Burger ; Marc P. Y. Desmulliez ; Ioannis Konstas
COMMENTS: Published in Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)
HIGHLIGHT: This paper describes a dataset of 1,500 manually-annotated sentences that express domain-independent relations between central concepts in a scientific biology text, such as trade-offs and correlations.
144, TITLE: Design Choices for X-vector Based Speaker Anonymization
http://arxiv.org/abs/2005.08601
AUTHORS: Brij Mohan Lal Srivastava ; Natalia Tomashenko ; Xin Wang ; Emmanuel Vincent ; Junichi Yamagishi ; Mohamed Maouche ; Aurélien Bellet ; Marc Tommasi
HIGHLIGHT: In this paper, we present a flexible pseudo-speaker selection technique as a baseline for the first VoicePrivacy Challenge.
145, TITLE: Brain-inspired Distributed Cognitive Architecture
http://arxiv.org/abs/2005.08603
AUTHORS: Leendert A Remmelzwaal ; Amit K Mishra ; George F R Ellis
HIGHLIGHT: In this paper we present a brain-inspired cognitive architecture that incorporates sensory processing, classification, contextual prediction, and emotional tagging.
146, TITLE: The presence of occupational structure in online texts based on word embedding NLP models
http://arxiv.org/abs/2005.08612
AUTHORS: Zoltán Kmetty ; Julia Koltai ; Tamás Rudas
COMMENTS: 34 pages, 2 figures, 4 tables. Paper presented at IC2S2 2019 and RC28 summer meeting 2019 (Columbia University)
HIGHLIGHT: This research focuses on the positions of occupations in the semantic space represented by large amounts of textual data.
147, TITLE: DDD20 End-to-End Event Camera Driving Dataset: Fusing Frames and Events with Deep Learning for Improved Steering Prediction
http://arxiv.org/abs/2005.08605
AUTHORS: Yuhuang Hu ; Jonathan Binas ; Daniel Neil ; Shih-Chii Liu ; Tobi Delbruck
COMMENTS: Accepted in The 23rd IEEE International Conference on Intelligent Transportation Systems (Special Session: Beyond Traditional Sensing for Intelligent Transportation)
HIGHLIGHT: To enable studies of using event cameras in automobile driving applications, this paper reports a new end-to-end driving dataset called DDD20.
148, TITLE: Decoder Modulation for Indoor Depth Completion
http://arxiv.org/abs/2005.08607
AUTHORS: Dmitry Senushkin ; Ilia Belikov ; Anton Konushin
HIGHLIGHT: The main contributions of our work are two-fold.
149, TITLE: End-to-End Lip Synchronisation
http://arxiv.org/abs/2005.08606
AUTHORS: You Jin Kim ; Hee Soo Heo ; Soo-Whan Chung ; Bong-Jin Lee
COMMENTS: interspeech 2020 submit
HIGHLIGHT: The goal of this work is to synchronise audio and video of a talking face using deep neural network models.
150, TITLE: On the Hardness of Red-Blue Pebble Games
http://arxiv.org/abs/2005.08609
AUTHORS: Pál András Papp ; Roger Wattenhofer
HIGHLIGHT: We present various hardness results in different red-blue pebbling variants, with a focus on the oneshot model.
151, TITLE: C3VQG: Category Consistent Cyclic Visual Question Generation
http://arxiv.org/abs/2005.07771
AUTHORS: Shagun Uppal ; Anish Madan ; Sarthak Bhagat ; Yi Yu ; Rajiv Ratn Shah
HIGHLIGHT: In this paper, we try to exploit the different visual cues and concepts in an image to generate questions using a variational autoencoder without the need for ground-truth answers.
152, TITLE: Evolving Antennas for Ultra-High Energy Neutrino Detection
http://arxiv.org/abs/2005.07772
AUTHORS: Julie Rolla ; Amy Connolly ; Kai Staats ; Stephanie Wissel ; Dean Arakaki ; Ian Best ; Adam Blenk ; Brian Clark ; Maximillian Clowdus ; Suren Gourapura ; Corey Harris ; Hannah Hasan ; Luke Letwin ; David Liu ; Carl Pfendner ; Jordan Potter ; Cade Sbrocco ; Tom Sinha ; Jacob Trevithick
COMMENTS: 8 pages including references, 6 figures, presented at 36th International Cosmic Ray Conference (ICRC 2019)
HIGHLIGHT: Evolving Antennas for Ultra-High Energy Neutrino Detection
153, TITLE: Learn Class Hierarchy using Convolutional Neural Networks
http://arxiv.org/abs/2005.08622
AUTHORS: Riccardo La Grassa ; Ignazio Gallo ; Nicola Landro
COMMENTS: 7 pages
HIGHLIGHT: In this paper, we propose a new architecture for hierarchical classification of images, introducing a stack of deep linear layers with cross-entropy loss functions and center loss combined.
154, TITLE: Universalization of any adversarial attack using very few test examples
http://arxiv.org/abs/2005.08632
AUTHORS: Sandesh Kamath ; Amit Deshpande ; K V Subrahmanyam
HIGHLIGHT: In this paper, we propose a simple universalization technique to take any input-dependent adversarial attack and construct a universal attack by only looking at very few adversarial test examples.
155, TITLE: A Learning-from-noise Dilated Wide Activation Network for denoising Arterial Spin Labeling (ASL) Perfusion Images
http://arxiv.org/abs/2005.07784
AUTHORS: Danfeng Xie ; Yiran Li ; Hanlu Yang ; Li Bai ; Lei Zhang ; Ze Wang
HIGHLIGHT: In this study, we proposed a new ASLDN to test whether similar or even better ASL CBF image quality can be achieved in the case of highly noisy training reference.
156, TITLE: A flexible, extensible software framework for model compression based on the LC algorithm
http://arxiv.org/abs/2005.07786
AUTHORS: Yerlan Idelbayev ; Miguel Á. Carreira-Perpiñán
COMMENTS: 15 pages, 4 figures, 2 tables
HIGHLIGHT: We propose a software framework based on the ideas of the Learning-Compression (LC) algorithm, that allows a user to compress a neural network or other machine learning model using different compression schemes with minimal effort.
157, TITLE: WW-Nets: Dual Neural Networks for Object Detection
http://arxiv.org/abs/2005.07787
AUTHORS: Mohammad K. Ebrahimpour ; J. Ben Falandays ; Samuel Spevack ; Ming-Hsuan Yang ; David C. Noelle
COMMENTS: 8 pages, 3 figures
HIGHLIGHT: We propose a new deep convolutional neural network framework that uses object location knowledge implicit in network connection weights to guide selective attention in object detection tasks.
158, TITLE: A Novel Column Generation Heuristic for Airline Crew Pairing Optimization with Large-scale Complex Flight Networks
http://arxiv.org/abs/2005.08636
AUTHORS: Divyam Aggarwal ; Dhish Kumar Saxena ; Thomas Bäck ; Michael Emmerich
COMMENTS: 22 pages, 6 figures, Manuscript to be submitted to a refereed journal
HIGHLIGHT: To bridge the research-gap, this paper proposes a novel CG heuristic, which has enabled in-house development of an Airline Crew Pairing Optimizer (AirCROP ).
159, TITLE: Transformation Based Deep Anomaly Detection in Astronomical Images
http://arxiv.org/abs/2005.07779
AUTHORS: Esteban Reyes ; Pablo A. Estévez
COMMENTS: 8 pages, 6 figures, 4 tables. Accepted for publication in proceedings of the IEEE World Congress on Computational Intelligence (IEEE WCCI), Glasgow, UK, 19-24 July, 2020
HIGHLIGHT: In this work, we propose several enhancements to a geometric transformation based model for anomaly detection in images (GeoTranform).
160, TITLE: FuSSI-Net: Fusion of Spatio-temporal Skeletons for Intention Prediction Network
http://arxiv.org/abs/2005.07796
AUTHORS: Francesco Piccoli ; Rajarathnam Balakrishnan ; Maria Jesus Perez ; Moraldeepsingh Sachdeo ; Carlos Nunez ; Matthew Tang ; Kajsa Andreasson ; Kalle Bjurek ; Ria Dass Raj ; Ebba Davidsson ; Colin Eriksson ; Victor Hagman ; Jonas Sjoberg ; Ying Li ; L. Srikar Muppirisetty ; Sohini Roychowdhury
COMMENTS: 5 pages, 6 figures, 5 tables, IEEE Asilomar SSC
HIGHLIGHT: In this work, we develop an end-to-end pedestrian intention framework that performs well on day- and night- time scenarios.
161, TITLE: JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment
http://arxiv.org/abs/2005.07799
AUTHORS: Dan Lim ; Won Jang ; Gyeonghwan O ; Hyeyeong Park ; Bongwan Kim ; Jesam Yoon
COMMENTS: submitted to INTERSPEECH 2020
HIGHLIGHT: We propose Jointly trained Duration Informed Transformer (JDI-T), a feed-forward Transformer with a duration predictor jointly trained without explicit alignments in order to generate an acoustic feature sequence from an input text.
162, TITLE: Building BROOK: A Multi-modal and Facial Video Database for Human-Vehicle Interaction Research
http://arxiv.org/abs/2005.08637
AUTHORS: Xiangjun Peng ; Zhentao Huang ; Xu Sun
COMMENTS: Conference: ACM CHI Conference on Human Factors in Computing Systems Workshops (CHI'20 Workshops)At: Honolulu, Hawaii, USA URL:https://emergentdatatrails.com
HIGHLIGHT: In this paper, we present our work-in-progress BROOK, a public multi-modal database with facial video records, which could be used to characterize drivers' affective states and driving styles.
163, TITLE: Conversational Search -- A Report from Dagstuhl Seminar 19461
http://arxiv.org/abs/2005.08658
AUTHORS: Avishek Anand ; Lawrence Cavedon ; Matthias Hagen ; Hideo Joho ; Mark Sanderson ; Benno Stein
COMMENTS: contains arXiv:2001.06910, arXiv:2001.02912
HIGHLIGHT: The ideas and findings presented in this report should serve as one of the main sources for diverse research programs on Conversational Search.
164, TITLE: An Overview of Privacy in Machine Learning
http://arxiv.org/abs/2005.08679
AUTHORS: Emiliano De Cristofaro
HIGHLIGHT: In this document, we set to review privacy challenges in this space, providing a systematic review of the relevant research literature, also exploring possible countermeasures.
165, TITLE: Building a Hebrew Semantic Role Labeling Lexical Resource from Parallel Movie Subtitles
http://arxiv.org/abs/2005.08206
AUTHORS: Ben Eyal ; Michael Elhadad
COMMENTS: 9 pages, 7 figures, accepted to LREC 2020
HIGHLIGHT: We present a semantic role labeling resource for Hebrew built semi-automatically through annotation projection from English.
166, TITLE: Quantifying the Impact on Software Complexity of Composable Inductive Programming using Zoea
http://arxiv.org/abs/2005.08211
AUTHORS: Edward McDaid ; Sarah McDaid
COMMENTS: 8 pages, 8 figures
HIGHLIGHT: This paper presents the results of a quantitative comparison of the software complexity of equivalent code implemented in Zoea and also in a conventional programming language.
167, TITLE: Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
http://arxiv.org/abs/2005.08213
AUTHORS: Won Ik Cho ; Donghyun Kwak ; Jiwon Yoon ; Nam Soo Kim
COMMENTS: Preprint; 5 pages, 1 figure, 4 tables
HIGHLIGHT: We demonstrate the validity of our proposal upon the performance on the Fluent Speech Command dataset.
168, TITLE: Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
http://arxiv.org/abs/2005.08209
AUTHORS: K R Prajwal ; Rudrabha Mukhopadhyay ; Vinay Namboodiri ; C V Jawahar
COMMENTS: 10 pages (including references), 5 figures, Accepted in CVPR, 2020
HIGHLIGHT: In this work, we explore the task of lip to speech synthesis, i.e., learning to generate natural speech given only the lip movements of a speaker. To this end, we collect and release a large-scale benchmark dataset, the first of its kind, specifically to train and evaluate the single-speaker lip to speech task in natural settings.
169, TITLE: LiSSS: A toy corpus of Literary Spanish Sentences Sentiment for Emotions Detection
http://arxiv.org/abs/2005.08223
AUTHORS: Juan-Manuel Torres-Moreno ; Luis-Gil Moreno-Jiménez
COMMENTS: 8 pages, 3 tables
HIGHLIGHT: In this work we present a new and small corpus in the area of Computational Creativity (CC), the Literary Sentiment Sentence Spanish Corpus (LISSS).
170, TITLE: Deep Learning for Community Detection: Progress, Challenges and Opportunities
http://arxiv.org/abs/2005.08225
AUTHORS: Fanzhen Liu ; Shan Xue ; Jia Wu ; Chuan Zhou ; Wenbin Hu ; Cecile Paris ; Surya Nepal ; Jian Yang ; Philip S. Yu
COMMENTS: Accepted Paper in the 29th International Joint Conference on Artificial Intelligence (IJCAI 20), Survey Track
HIGHLIGHT: Structured into three broad research streams in this domain - deep neural networks, deep graph embedding, and graph neural networks, this article summarizes the contributions of the various frameworks, models, and algorithms in each stream along with the current challenges that remain unsolved and the future research opportunities yet to be explored.
171, TITLE: #Coronavirus or #Chinesevirus?!: Understanding the negative sentiment reflected in Tweets with racist hashtags across the development of COVID-19
http://arxiv.org/abs/2005.08224
AUTHORS: Xin Pei ; Deval Mehta
HIGHLIGHT: Especially, we propose a stage-based approach to capture how the negative sentiment changes along with the three development stages of COVID-19, under which it transformed from a domestic epidemic into an international public health emergency and later, into the global pandemic.
172, TITLE: Graph Density-Aware Losses for Novel Compositions in Scene Graph Generation
http://arxiv.org/abs/2005.08230
AUTHORS: Boris Knyazev ; Harm de Vries ; Cătălina Cangea ; Graham W. Taylor ; Aaron Courville ; Eugene Belilovsky
COMMENTS: 17 pages, the code is available at https://github.com/bknyaz/sgg
HIGHLIGHT: In this paper, we identify two key issues that limit such generalization.
173, TITLE: Studying the Transfer of Biases from Programmers to Programs
http://arxiv.org/abs/2005.08231
AUTHORS: Tore Pedersen ; Christian Johansen ; Johanna Johansen
COMMENTS: 40 pages of which 7 pages of Appendix, 26 Figures, 2 Tables
HIGHLIGHT: It is generally agreed that one origin of machine bias is resulting from characteristics within the dataset on which the algorithms are trained, i.e., the data does not warrant a generalized inference.
174, TITLE: FuCiTNet: Improving the generalization of deep learning networks by the fusion of learned class-inherent transformations
http://arxiv.org/abs/2005.08235
AUTHORS: Manuel Rey-Area ; Emilio Guirado ; Siham Tabik ; Javier Ruiz-Hidalgo
HIGHLIGHT: This work presents a new approach, independent but complementary to the previous mentioned techniques, for improving the generalization of DNNs on very small datasets in which the involved classes share many visual features.
175, TITLE: Dual Learning: Theoretical Study and an Algorithmic Extension
http://arxiv.org/abs/2005.08238
AUTHORS: Zhibing Zhao ; Yingce Xia ; Tao Qin ; Lirong Xia ; Tie-Yan Liu
COMMENTS: 11 pages, 2 figures
HIGHLIGHT: In this paper, we aim at understanding why and when dual learning works.
176, TITLE: Dampen the Stop-and-Go Traffic with Connected and Automated Vehicles -- A Deep Reinforcement Learning Approach
http://arxiv.org/abs/2005.08245
AUTHORS: Liming Jiang ; Yuanchang Xie ; Danjue Chen ; Tienan Li ; Nicholas G. Evans
HIGHLIGHT: Instead of using analytical model, this study adopts reinforcement learning to control the behavior of CAV and put a single CAV at the 2nd position of a vehicle fleet with the purpose to dampen the speed oscillation from the fleet leader and help following human drivers adopt more smooth driving behavior.
177, TITLE: On the Combined Use of Extrinsic Semantic Resources for Medical Information Search
http://arxiv.org/abs/2005.08259
AUTHORS: Mohammed Maree ; Israa Noor ; Khaled Rabayah ; Mohammed Belkhatir ; Saadat M. Alhashmi
HIGHLIGHT: In this article, we explore the combination of multiple extrinsic semantic resources in the development of a full-fledged medical information search framework to: i) highlight and expand head medical concepts in verbose medical queries (i.e. concepts among query terms that significantly contribute to the informativeness and intent of a given query), ii) build semantically enhanced inverted index documents, iii) contribute to a heuristical weighting technique in the query document matching process.
178, TITLE: High-dimensional Convolutional Networks for Geometric Pattern Recognition
http://arxiv.org/abs/2005.08144
AUTHORS: Christopher Choy ; Junha Lee ; Rene Ranftl ; Jaesik Park ; Vladlen Koltun