简单整理下人脸方向ICCV2019相关的论文。
Oral 论文
11.Photo-Realistic Facial Details Synthesis from Single Image
作者:Anpei Chen, Zhang Chen, Guli Zhang, Ziheng Zhang, Kenny Mitchell, Jingyi Yu
论文链接:https://arxiv.org/abs/1903.10873
GitHub链接:https://github.com/apchenstu/Facial_Details_Synthesis
论文解读:ICCV 2019 Oral | 三维”ZAO”脸: 单张图片估计人脸几何,效果堪比真实皮肤
10.Learnable Triangulation of Human Pose
作者:Karim Iskakov, Egor Burkov, Victor Lempitsky, Yury Malkov
论文链接:https://arxiv.org/abs/1905.05754
Github链接:https://github.com/karfly/learnable-triangulation-pytorch
项目链接:https://saic-violet.github.io/learnable-triangulation/
9.Learning Implicit Generative Models by Matching Perceptual Features
作者:Cicero Nogueira dos Santos, Youssef Mroueh, Inkit Padhi, Pierre Dognin
论文链接:https://arxiv.org/abs/1904.02762v1
8.COCO-GAN: Generation by Parts via Conditional Coordinating
作者:Chieh Hubert Lin, Chia-Che Chang, Yu-Sheng Chen, Da-Cheng Juan, Wei Wei, Hwann-Tzong Chen
论文链接:https://arxiv.org/abs/1904.00284
Github链接:https://github.com/hubert0527/COCO-GAN
项目链接:https://hubert0527.github.io/COCO-GAN/
7.SlowFast Networks for Video Recognition
作者:Christoph Feichtenhofer, Haoqi Fan, Jitendra Malik, and Kaiming He
论文链接:https://arxiv.org/abs/1812.03982
6.Exploring Randomly Wired Neural Networks for Image Recognition
作者:Saining Xie, Alexander Kirillov, Ross Girshick, and Kaiming He
论文链接:https://arxiv.org/abs/1904.01569
5.Can GCNs Go as Deep as CNNs?
作者:Guohao Li, Matthias Müller, Ali Thabet, Bernard Ghanem
论文链接:https://arxiv.org/abs/1904.03751
Github链接:https://github.com/lightaime/deep_gcns
4.Deep SR-ITM: Joint Learning of Super-resolution and Inverse Tone-Mapping for 4K UHD HDR Applications
作者:Soo Ye Kim, Jihyong Oh, Munchurl Kim
论文链接:https://arxiv.org/abs/1904.11176
3.Meta-Sim Learning to Generate Synthetic Datasets
作者:Amlan Kar, Aayush Prakash, Ming-Yu Liu, Eric Cameracci, Justin Yuan, Matt Rusiniak, David Acuna, Antonio Torralba, Sanja Fidler
论文链接:https://arxiv.org/abs/1904.11621
项目链接:https://nv-tlabs.github.io/meta-sim/
2.Deep HoughVoting for 3D Object Detection in Point Clouds
作者:Charles R. Qi, Or Litany, Kaiming He, Leonidas J. Guibas
论文链接:https://arxiv.org/abs/1904.09664
1.Variational Adversarial Active Learning
作者:Samarth Sinha, Sayna Ebrahimi, Trevor Darrell
论文链接:https://arxiv.org/abs/1904.00370
目标检测
1、ThunderNet: Towards Real-time Generic Object Detection
ThunderNet:走向实时通用目标检测
作者:Zheng Qin, Zeming Li, Zhaoning Zhang, Yiping Bao, Gang Yu, Yuxing Peng, Jian Sun
论文链接:https://arxiv.org/abs/1903.11752
论文解读:http://bbs.cvmart.net/articles/361
2、MemorizingNormality to Detect Anomaly: Memory-augmented Deep Autoencoder (MemAE) forUnsupervised Anomaly Detection
MemorizingNormality检测异常:内存增强深度自动编码器(MemAE)用于非监督异常检测
作者:Dong Gong, Lingqiao Liu, Vuong Le, Budhaditya Saha, Moussa Reda Mansour, Svetha Venkatesh, Anton van den Hengel
项目链接:https://donggong1.github.io/anomdec-memae.html
论文链接:https://arxiv.org/abs/1904.02639
GitHub:https://github.com/donggong1/memae-anomaly-detection
3、Deep Hough Voting for 3D Object Detection in Point Clouds(Oral)
深入投票进行点云中的三维目标检测
作者:Charles R. Qi, Or Litany, Kaiming He, Leonidas J. Guibas
论文链接:https://arxiv.org/abs/1904.09664
4、Multi-adversarial Faster-RCNN for Unrestricted Object Detection
用于无限制目标检测的多对抗性更快RCNN
作者:Zhenwei He, Lei Zhang
论文链接:https://arxiv.org/abs/1907.10343
5、FCOS: Fully Convolutional One-Stage Object Detection
FCOS:完全卷积一级目标检测
作者:Zhi Tian, Chunhua Shen, Hao Chen, Tong He
论文链接:https://arxiv.org/abs/1904.01355
Github链接:https://github.com/tianzhi0549/FCOS/
论文解读: https://mp.weixin.qq.com/s/N93TrVnUuvAgfcoHXevTHw
6、Simultaneous multi-view instance detection with learned geometric soft-constraints
使用学习的几何软约束同时进行多视图实例检测
作者:Ahmed Samy Nassar, Sebastien Lefevre, Jan D. Wegner
论文链接:https://arxiv.org/abs/1907.10892
7、Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection
Cap2Det:学习放大目标检测的弱字幕监控
作者:Keren Ye, Mingda Zhang, Adriana Kovashka, Wei Li, Danfeng Qin, Jesse Berent
论文链接:https://arxiv.org/abs/1907.10164
8、Towards Adversarially Robust Object Detection
对抗强大的目标检测
作者:Haichao Zhang, Jianyu Wang
论文链接:https://arxiv.org/abs/1907.10310
9、Few-shot Object Detection via Feature Reweighting
通过特征重新加权的快速物体检测
作者:Bingyi Kang, Zhuang Liu, Xin Wang, Fisher Yu, Jiashi Feng, Trevor Darrell
论文链接:https://arxiv.org/pdf/1812.01866.pdf
10、Optimizing the F-measure for Threshold-free Salient Object Detection
优化无阈值显着物体检测的F-测量
作者:Kai Zhao, Shanghua Gao, Wenguan Wang, Ming-ming Cheng
论文链接:http://data.kaizhao.net/publications/iccv2019fmeasure.pdf
Github链接:https://github.com/zeakey/iccv2019-fmeasure
项目链接:http://kaizhao.net/fmeasure
11、Depth-induced Multi-scale Recurrent Attention Network for Saliency Detection
深度诱导多尺度重复注意网络用于显著性检测
作者:Yongri Piao, Wei Ji, Jingjing Li, Miao Zhang, Huchuan Lu
Github链接:https://github.com/jiwei0921/DMRA_RGBD-SOD
10、Learning Lightweight Lane Detection CNNs by Self Attention Distillation
通过自注意蒸馏学习轻量级车道检测神经网络
作者:Yuenan Hou, Zheng Ma, Chunxiao Liu, Chen Change Loy
论文链接:https://arxiv.org/abs/1908.00821
Github链接:https://github.com/cardwing/Codes-for-Lane-Detection
11、Towards High-Resolution Salient Object Detection
实现高分辨率突出目标检测
作者:Yi Zeng, Pingping Zhang, Jianming Zhang, Zhe Lin, Huchuan Lu
论文链接:https://arxiv.org/abs/1908.07274
Github链接:https://github.com/yi94code/HRSOD
12、Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection
教师指导学生如何从部分标记的图像中学习识别面部地标
作者:Xuanyi Dong, Yi Yang
论文链接:https://arxiv.org/abs/1908.02116
Github链接:https://github.com/D-X-Y/landmark-detection
13、Temporally-Aggregating Spatial Encoder-Decoder for Video Saliency Detection
用于视频显著性检测的时间聚合空间编解码器
Github链接:https://github.com/kylemin/TASED-Net
14、SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects
SCRDet:对小的,杂乱的和旋转的物体进行更加稳健的检测
作者:Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Sun Xian, Kun Fu
论文链接:https://arxiv.org/abs/1811.07126
Github链接:https://github.com/DetectionTeamUCAS
15、Clustered Object detection in aerial images
航拍图像中的聚类物体检测
作者:Fan Yang, Heng Fan, Peng Chu, Erik Blasch, Haibin Ling
论文链接:https://arxiv.org/pdf/1904.08008
16、Relation Distillation Networks for Video Object Detection
用于视频对象检测的关联蒸馏网络
作者: Jiajun Deng, Yingwei Pan, Ting Yao, Wengang Zhou, Houqiang Li, Tao Mei
论文链接:https://arxiv.org/abs/1908.09511
17、Scaling Object Detection by Transferring Classification Weights
通过转移分类权值来缩放目标检测
作者:Jason Kuen, Federico Perazzi, Zhe Lin, Jianming Zhang, Yap-Peng Tan
论文链接:https://arxiv.org/abs/1909.06804
Github链接:https://github.com/xternalz/AE-WTN
18、WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection
WSOD^2:学习自底向上和自顶向下的对象精馏,用于弱监督对象检测
作者:Zhaoyang Zeng, Bei Liu, Jianlong Fu, Hongyang Chao, Lei Zhang
论文链接:https://arxiv.org/abs/1909.04972
图像分割
1、Incremental Class Discovery for Semantic Segmentation with RGBD Sensing
用RGBD传感进行语义分割的增量类发现
作者:Yoshikatsu Nakajima, Byeongkeun Kang, Hideo Saito, Kris Kitani
论文链接:https://arxiv.org/abs/1907.10008
2、TensorMask: A Foundation for Dense Object Segmentation
TensorMask:密集对象分割的基础
作者:Xinlei Chen, Ross Girshick, Kaiming He, and Piotr Dollár
论文链接:https://arxiv.org/abs/1903.12174
3、Orientation-aware Semantic Segmentation on Icosahedron Spheres
二十面体球面上的方向感知语义分割
作者:Chao Zhang, Stephan Liwicki, William Smith, Roberto Cipolla
论文链接:https://arxiv.org/abs/1907.12849
4、Expectation-Maximization Attention Networks for Semantic Segmentation (Oral)
语义分割的期望最大化注意网络
作者: Xia Li, Zhisheng Zhong, Jianlong Wu, Yibo Yang, Zhouchen Lin, Hong Liu
论文链接:https://arxiv.org/abs/1907.13426
5、ACE: Adapting to Changing Environments for Semantic Segmentation
ACE:适应不断变化的环境进行语义分割
作者:Zuxuan Wu, Xin Wang, Joseph E. Gonzalez, Tom Goldstein, Larry S. Davis
论文链接:https://arxiv.org/pdf/1904.06268.pdf
6、CCNet: Criss-Cross Attention for Semantic Segmentation
CCNet:交叉关注语义分割
作者:Zilong Huang, Xinggang Wang, Lichao Huang, Chang Huang, Yunchao Wei, Wenyu Liu
论文链接:https://arxiv.org/abs/1811.11721
Github链接:https://github.com/speedinghzl/CCNet
7、SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
SemanticKITTI:一个用于激光雷达序列语义场景理解的数据集
作者:J. Behley, M. Garbade, A. Milioto, J. Quenzel, S. Behnke, C. Stachniss, and J. Gall
论文链接:https://arxiv.org/abs/1904.01416
8、DADA: Depth-Aware Domain Adaptation in Semantic Segmentation
作者:Tuan-Hung Vu, Himalaya Jain, Maxime Bucher, Matthieu Cord, Patrick Pérez
论文链接:https://arxiv.org/abs/1904.01886
9、Weakly Supervised Energy-Based Learning for Action Segmentation(Oral)
弱监督能源行动学习分割
Github链接:https://github.com/JunLi-Galios/CDFL
10、Explicit Shape Encoding for Real-Time Instance Segmentation
实时实例分割的显式形状编码
作者:Wenqiang Xu, Haiyang Wang, Fubo Qi, Cewu Lu
论文链接:https://arxiv.org/abs/1908.04067
11、ACFNet: Attentional Class Feature Network for Semantic Segmentation
用于语义分割的注意类特征网络
作者:Fan Zhang, Yanqin Chen, Zhihang Li, Zhibin Hong, Jingtuo Liu, Feifei Ma, Junyu Han, Errui Ding
论文链接:https://arxiv.org/abs/1909.09408
12、Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation
层次式点-边交互网络用于点云语义分割
作者:Li Jiang, Hengshuang Zhao, Shu Liu, Xiaoyong Shen, Chi-Wing Fu, Jiaya Jia
论文链接:https://arxiv.org/abs/1909.10469
13、SSAP: Single-Shot Instance Segmentation With Affinity Pyramid
SSAP:带有亲缘金字塔的单点实例分割
作者:Naiyu Gao, Yanhu Shan, Yupei Wang, Xin Zhao, Yinan Yu, Ming Yang, Kaiqi Huang
论文链接:https://arxiv.org/abs/1909.01616
姿态估计
1、Ego-Pose Estimation and Forecasting as Real-Time PD Control
Ego-Pose估计和预测作为实时PD控制
作者:Ye Yuan, Kris Kitani
论文链接:https://arxiv.org/abs/1906.03173
2、xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera
xR-EgoPose:HMD相机的以自我为中心的3D人体姿态
作者:Denis Tome, Patrick Peluse, Lourdes Agapito, Hernan Badino
论文链接:https://arxiv.org/abs/1907.10045
3、Selectivity or Invariance: Boundary-aware Salient Object Detection
选择性或不变性:边界感知的突出物体检测
作者:Jinming Su, Jia Li1, Yu Zhang, Changqun Xia and Yonghong Tian
论文链接:https://arxiv.org/pdf/1812.10066.pdf
4、Learnable Triangulation of Human Pose(Oral)
人体姿态的可验证三角测量
作者:Karim Iskakov, Egor Burkov, Victor Lempitsky, Yury Malkov
论文链接:https://arxiv.org/abs/1905.05754
Github链接:https://github.com/karfly/learnable-triangulation-pytorch
项目链接:https://saic-violet.github.io/learnable-triangulation/
5、Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image
来自单个RGB图像的3D多人姿态估计的摄像距离感知自上而下方法
作者:Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee
论文链接:https://arxiv.org/abs/1907.11346
Github链接:https://github.com/mks0601/3DMPPE_ROOTNET_RELEASE
6、Pose-aware Dynamic Attention for Human Object Interaction Detection
姿态感知动态注意用于人体目标交互检测
Github链接:https://github.com/bobwan1995/Pose-aware-Dynamic-Attention-for-Human-Object-Interaction-Detection
7、SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation with Semi-supervised Learning
基于半监督学习的三维手部姿态估计自组织网络
Github链接:https://github.com/TerenceCYJ/SO-HandNet
8、Dynamic Kernel Distillation for Efficient Pose Estimation in Videos
动态核蒸馏在视频中的有效姿态估计
作者:Xuecheng Nie, Yuncheng Li, Linjie Luo, Ning Zhang, Jiashi Feng
论文链接:https://arxiv.org/abs/1908.09216
9、Single-Stage Multi-Person Pose Machines
单级多人姿势机器
作者:Xuecheng Nie, Jianfeng Zhang, Shuicheng Yan, Jiashi Feng
论文链接:https://arxiv.org/abs/1908.09220
10、Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images
利用多视图图像进行人体姿态和形状重建
作者:Junbang Liang, Ming C. Lin
论文链接:https://arxiv.org/abs/1908.09464
11、Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
整体++场景理解:单视图三维整体场景解析和人-物交互和物理常识下的人体姿态估计
作者:Yixin Chen, Siyuan Huang, Tao Yuan, Siyuan Qi, Yixin Zhu, Song-Chun Zhu
论文链接:https://arxiv.org/abs/1909.01507
12、Imitation Learning for Human Pose Prediction
模仿学习用于人体姿态预测
作者:Borui Wang, Ehsan Adeli, Hsu-kuang Chiu, De-An Huang, Juan Carlos Niebles
论文链接:https://arxiv.org/abs/1909.03449
目标跟踪
1、Joint Monocular 3D Detection and Tracking
联合单目3D检测和跟踪
作者:Hou-Ning Hu, Qizhi Cai, Dequan Wang, Ji Lin, Min Sun, Philipp Krähenbühl, Trevor Darrell, Fisher Yu
论文链接:https://arxiv.org/abs/1811.10742
项目链接:https://eborboihuc.github.io/Mono-3DT/?fbclid=IwAR1maTNHE5z-vEwAJKIcNEpbMWwBcjWJQ0gEHOwHB-u51w5dfeiZNCh0y-U
GitHub:https://github.com/ucbdrive/3d-vehicle-tracking
2、Deep Meta Learning for Real-Time Target-Aware Visual Tracking
用于实时目标感知视觉跟踪的深度元学习
作者:Janghoon Choi, Junseok Kwon, and Kyoung Mu Lee
论文链接:https://arxiv.org/pdf/1712.09153.pdf
3、Learning Aberrance Repressed Correlation Filters for Real-Time UAV Tracking
学习畸变抑制相关滤波器用于无人机实时跟踪
作者:Ziyuan Huang, Changhong Fu, Yiming Li, Fuling Lin, Peng Lu
论文链接:https://arxiv.org/abs/1908.02231
4、Robust Multi-Modality Multi-Object Tracking
鲁棒多模态多目标跟踪
作者:Wenwei Zhang, Hui Zhou, Shuyang Sun, Zhe Wang, Jianping Shi, Chen Change Loy
论文链接:https://arxiv.org/abs/1909.03850
Github链接:https://github.com/ZwwWayne/mmMOT
人脸
1、Video Face Clustering with Unknown Number of Clusters
视频人脸聚类,聚类个数未知
作者:M. Tapaswi, M. T. Law, and S. Fidler
Github链接:https://github.com/makarandtapaswi/BallClustering_ICCV2019
2、Probabilistic Face Embeddings
作者:Yichun Shi, Anil K. Jain
论文链接:https://arxiv.org/abs/1904.09658
Github链接:https://github.com/seasonSH/Probabilistic-Face-Embeddings
3、Photo-Realistic Facial Details Synthesis from Single Image(Oral)
从单张图像合成的真实面部细节
作者:Anpei Chen, Zhang Chen, Guli Zhang, Ziheng Zhang, Kenny Mitchell, Jingyi Yu
论文链接:https://arxiv.org/abs/1903.10873
Github链接: https://github.com/apchenstu/Facial_Details_Synthesis
ReID
1、One Shot Domain Adaptation for Person Re-Identification(Oral )
作者:Yang Fu, Yunchao Wei, Guanshuo Wang, Jiwei Li, Xi Zhou, Honghui Shi, Thomas Huang
论文链接:https://arxiv.org/abs/1811.10144
Github链接:https://github.com/OasisYang/SSG
2、ABD-Net: Attentive but Diverse Person Re-Identification
ABD-Net:专注而多元的人重新识别
作者:Tianlong Chen, Shaojin Ding, Jingyi Xie, Ye Yuan, Wuyang Chen, Yang Yang, Zhou Ren, Zhangyang Wang
论文链接:https://arxiv.org/abs/1908.01114
Github链接:https://github.com/TAMU-VITA/ABD-Net
3、A Novel Unsupervised Camera-aware Domain Adaptation Framework for Person Re-identification
一种新的无监督摄像机感知域适应框架,用于人员重新识别
作者:Lei Qi, Lei Wang, Jing Huo, Luping Zhou, Yinghuan Shi, Yang Gao
论文链接:https://arxiv.org/abs/1904.03425
4、advPattern: Physical-World Attacks on Deep Person Re-Identification via Adversarially Transformable Patterns
advPattern:物理世界通过可逆变换模式对人重新识别的攻击
作者:Zhibo Wangy, Siyan Zhengy, Mengkai Songy, Qian Wangy, Alireza Rahimpourz, Hairong Qi
论文链接:https://arxiv.org/abs/1908.09327
5、Re-ID Driven Localization Refinement for Person Search
reid驱动的人员搜索本地化细化
作者:Chuchu Han, Jiacheng Ye, Yunshan Zhong, Xin Tan, Chi Zhang, Changxin Gao, Nong Sang
论文链接:https://arxiv.org/abs/1909.08580
6、Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation
跨数据集人员重新识别通过无监督的姿态解缠和适应
作者:Yu-Jhe Li, Ci-Siang Lin, Yan-Bo Lin, Yu-Chiang Frank Wang
论文链接:https://arxiv.org/abs/1909.09675
OCR
1、GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition
GA-DAN:用于场景文本检测和识别的几何感知域适应网络
作者:Fangneng Zhan, Chuhui Xue, Shijian Lu
论文链接:https://arxiv.org/abs/1907.09653
2、Cascaded Context Pyramid for Full-Resolution 3D Semantic Scene Completion(Oral )
用于全分辨率3D语义场景完成的嵌入式上下文金字塔
作者:Pingping Zhang, Wei Liu, Yinjie Lei, Huchuan Lu, Xiaoyun Yang
论文链接:https://arxiv.org/abs/1908.00382
3、Symmetry-constrained Rectification Network for Scene Text Recognition
用于场景文本识别的对称约束校正网络
作者:MingKun Yang, Yushuo Guan, Minghui Liao, Xin He, Kaigui Bian, Song Bai, Cong Yao, Xiang Bai
论文链接:https://arxiv.org/abs/1908.01957
4、Towards Unconstrained End-to-End Text Spotting(Oral )
Towards无约束的端到端文本定位
作者:Siyang Qin, Alessandro Bissacco, Michalis Raptis, Yasuhisa Fujii, Ying Xiao
论文链接:https://arxiv.org/abs/1908.09231
5、SPGNet: Semantic Prediction Guidance for Scene Parsing
SPGNet:场景分析的语义预测指南
作者:Bowen Cheng, Liang-Chieh Chen, Yunchao Wei, Yukun Zhu, Zilong Huang, Jinjun Xiong, Thomas Huang, Wen-Mei Hwu, Honghui Shi
论文链接:https://arxiv.org/abs/1908.09798
6、CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
CAMP:用于文本-图像检索的跨模式自适应消息传递
作者:Zihao Wang, Xihui Liu, Hongsheng Li, Lu Sheng, Junjie Yan, Xiaogang Wang, Jing Shao
论文链接:https://arxiv.org/abs/1909.05506
7、Chinese Street View Text: Large-scale Chinese Text Reading with Partially Supervised Learning
中文街景文本:部分监督学习的大型中文文本阅读
作者: Yipeng Sun, Jiaming Liu, Wei Liu, Junyu Han, Errui Ding, Jingtuo Liu
论文链接:https://arxiv.org/abs/1909.07808
8、Large-scale Tag-based Font Retrieval with Generative Feature Learning
基于标签的大规模字体检索与生成特征学习
作者:Tianlang Chen, Zhaowen Wang, Ning Xu, Hailin Jin, Jiebo Luo
论文链接:https://arxiv.org/abs/1909.02072
9、Visual Semantic Reasoning for Image-Text Matching(Oral)
图像-文本匹配的视觉语义推理
作者:Kunpeng Li, Yulun Zhang, Kai Li, Yuanyuan Li, Yun Fu
论文链接:https://arxiv.org/abs/1909.02701
Github链接:https://github.com/KunpengLi1994/VSRN
10、Dynamic Context Correspondence Network for Semantic Alignment
用于语义对齐的动态上下文通信网络
作者:Shuaiyi Huang, Qiuyue Wang, Songyang Zhang, Shipeng Yan, Xuming He
论文链接:https://arxiv.org/abs/1909.03444
视频
1、HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
HowTo100M:通过观看数以亿计的视频剪辑来学习文本视频嵌入
作者:Antoine Miech, Dimitri Zhukov, Jean-Baptiste Alayrac, Makarand Tapaswi, Ivan Laptev, Josef Sivic
论文链接:https://arxiv.org/abs/1906.03327
2、VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
用于视频和语言研究的大规模,高质量多语言数据集
作者:Xin Wang, Jiawei Wu, Junkun Chen, Lei Li, Yuan-Fang Wang, William Yang Wang
论文链接:https://arxiv.org/abs/1904.03493
项目链接:http://vatex.org/main/index.html
论文解读:https://mp.weixin.qq.com/s/bOpKXshitpQ1YKE53WUPEw
3、BMN: Boundary-Matching Network for Temporal Action Proposal Generation
BMN:用于生成时间行动提案的边界匹配网络
作者:Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen
论文链接:https://arxiv.org/abs/1907.09702
4、Free-form Video Inpainting with 3D Gated Convolution and Temporal PatchGAN
使用3D门控卷积和时间PatchGAN进行自由视频修复
作者:Ya-Liang Chang, Zhe Yu Liu, Kuan-Ying Lee, Winston Hsu
论文链接:https://arxiv.org/abs/1904.10247
Github链接:https://github.com/amjltc295/Free-Form-Video-Inpainting
5、SlowFast Networks for Video Recognition(Oral)
用于视频识别的SlowFast网络
作者:Christoph Feichtenhofer, Haoqi Fan, Jitendra Malik, and Kaiming He
论文链接:https://arxiv.org/abs/1812.03982
6、Point-to-Point Video Generation
点对点视频生成
作者:Tsun-Hsuan Wang, Yen-Chi Cheng, Chieh Hubert Lin, Hwann-Tzong Chen, Min Sun
论文链接:https://arxiv.org/abs/1904.02912
项目链接:https://zswang666.github.io/P2PVG-Project-Page/?fbclid=IwAR1Cr-T54keo5zzaWLQuYNQMcPoKzXGr6-YrTDoauW6Hb5bOSwgluZQ3fIE
7、Disentangling Propagation and Generation for Video Prediction
作者:Hang Gao, Huazhe Xu, Qi-Zhi Cai, Ruth Wang, Fisher Yu, Trevor Darrell
论文链接:https://arxiv.org/pdf/1812.00452.pdf
8、Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition (Oral)
作者:Wenhao Wu, Dongliang He, Xiao Tan, Shifeng Chen, Shilei Wen
论文链接:https://arxiv.org/abs/1907.13369
9、VideoBERT: A Joint Model for Video and Language Representation Learning ( Oral )
VideoBERT:视频和语言表征学习的联合模型
作者:Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, Cordelia Schmid
论文链接:https://arxiv.org/abs/1904.01766
10、TSM: Temporal Shift Module for Efficient Video Understanding
时间转移模块,用于高效的视频理解
作者:Ji Lin, Chuang Gan, Song Han
论文链接:https://arxiv.org/abs/1811.08383
Github链接:https://github.com/mit-han-lab/temporal-shift-module
11、Exploiting temporal consistency for real-time video depth estimation
利用时间一致性进行实时视频深度估计
Github链接:https://github.com/hkzhang91/ST-CLSTM
12、EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
EPIC-Fusion:以自我为中心的动作识别的视听时间绑定
作者:Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen
Github链接:https://github.com/ekazakos/temporal-binding-network
13、Remote Heart Rate Measurement from Highly Compressed Facial Videos: an End-to-end Deep Learning Solution with Video Enhancement
远程压缩面部视频的心率测量:具有视频增强功能的端到端深度学习解决方案
作者:Zitong Yu, Wei Peng, Xiaobai Li, Xiaopeng Hong, Guoying Zhao
论文链接:https://arxiv.org/abs/1907.11921
14、Onion-Peel Networks for Deep Video Completion
用于深度视频完成的Onion-Peel网络
作者:Seoung Wug Oh, Sungho Lee, Joon-Young Lee, Seon Joo Kim
论文链接:https://arxiv.org/abs/1908.08718
15、An Internal Learning Approach to Video Inpainting
一种内部学习方法的视频Inpainting
作者:Haotian Zhang, Long Mai, Ning Xu, Zhaowen Wang, John Collomosse, Hailin Jin
论文链接:https://arxiv.org/abs/1909.07957
16、Graph Convolutional Networks for Temporal Action Localization
时间动作定位的图卷积网络
作者:Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan
论文链接:https://arxiv.org/abs/1909.03252
超分辨率
1、Deep SR-ITM: Joint Learning of Super-resolution and Inverse Tone-Mapping for 4K UHD HDR Applications(Oral)
Deep SR-ITM:4K UHD HDR应用的超分辨率和反色调映射联合学习
作者:Soo Ye Kim, Jihyong Oh, Munchurl Kim
论文链接:https://arxiv.org/abs/1904.11176
2、Toward Real-World Single Image Super-Resolution: A New Benchmark and A New Model
面向现实世界的单幅图像超分辨率:一个新的基准和一个新的模型
论文链接:https://csjcai.github.io/papers/RealSR.pdf
Github链接:https://github.com/csjcai/RealSR
自动驾驶
1、Exploring the Limitations of Behavior Cloning for Autonomous Driving
探讨自动驾驶行为克隆的局限性
作者:Felipe Codevilla, Eder Santana, Antonio M. López, Adrien Gaidon
论文链接:https://arxiv.org/abs/1904.08980
Github链接:https://github.com/felipecode/coiltraine/blob/master/docs/exploring_limitations.md
2、Scalable Place Recognition Under Appearance Change for Autonomous Driving(Oral )
可伸缩位置识别下外观变化自主驾驶
作者:Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Yu Liu, Thanh-Toan Do, Ian Reid
论文链接:https://arxiv.org/abs/1908.00178
3D、点云
1、Deep Hough Voting for 3D Object Detection in Point Clouds(Oral)
DHV:点云中的三维物体检测
作者:Charles R. Qi, Or Litany, Kaiming He, Leonidas J. Guibas
论文链接:https://arxiv.org/abs/1904.09664
2、3D Point Cloud Learning for Large-scale Environment Analysis and Place Recognition
3D点云学习用于大规模环境分析和场所识别
作者:Zhe Liu, Shunbo Zhou, Chuanzhe Suo, Yingtian Liu, Hesheng Wang, Yun-Hui Liu
论文链接:https://arxiv.org/abs/1812.07050
3、Learning to Reconstruct 3D Manhattan Wireframes from a Single Image ( Oral )
学习从单个图像重建3D曼哈顿线框
作者:Yichao Zhou, Haozhi Qi, Yuexiang Zhai, Qi Sun, Zhili Chen, Li-Yi Wei, Yi Ma
论文链接:https://arxiv.org/abs/1905.07482
4、GarNet: A Two-stream Network for Fast and Accurate 3D Cloth Draping
GarNet:一个快速准确的3D布料覆盖双流网络
作者:Erhan Gundogdu, Victor Constantin, Amrollah Seifoddini, Minh Dang, Mathieu Salzmann, Pascal Fua
论文链接:https://arxiv.org/abs/1811.10983v2
项目链接:https://cvlab.epfl.ch/research/garment-simulation/garnet/
5、3D-RelNet: Joint Object and Relational Network for 3D Prediction
3D-RelNet:用于3D预测的联合对象和关系网络
作者:Nilesh Kulkarni, Ishan Misra, Shubham Tulsiani, Abhinav Gupta
论文链接:https://arxiv.org/pdf/1906.02729.pdf
6、PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows(Oral)
PointFlow:使用连续正常化流程生成3D点云
作者:Guandao Yang, Xun Huang, Zekun Hao, Ming-Yu Liu, Serge Belongie, Bharath Hariharan
论文链接:https://arxiv.org/abs/1906.12320
Github链接:https://github.com/stevenygd/PointFlow
项目链接:https://www.guandaoyang.com/PointFlow/
7、Multi-Angle Point Cloud-VAE: Unsupervised Feature Learning for 3D Point Clouds from Multiple Angles by Joint Self-Reconstruction and Half-to-Half Prediction
多角度点云-VAE:通过联合自我重建和半对半预测从多角度的三维点云进行无监督特征学习
作者:Zhizhong Han, Xiyang Wang, Yu-Shen Liu, Matthias Zwicker
论文链接:https://arxiv.org/abs/1907.12704
8、SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation
SceneGraphNet:神经消息传递3D室内场景增强
作者:Yang Zhou, Zachary While, Evangelos Kalogerakis
论文链接:https://arxiv.org/abs/1907.11308
9、HoloGAN: Unsupervised learning of 3D representations from natural images
HoloGAN:从自然图像中无监督地学习3D表示
作者:Thu Nguyen-Phuoc, Chuan Li, Lucas Theis, Christian Richardt, Yong-Liang Yang
论文链接:https://arxiv.org/abs/1904.01326
项目链接:https://www.monkeyoverflow.com/#/hologan-unsupervised-learning-of-3d-representations-from-natural-images/
10、FrameNet: Learning Local Canonical Frames of 3D Surfaces from a Single RGB Image
FrameNet:从单个RGB图像学习3D表面的局部规范框架
作者:Jingwei Huang, Yichao Zhou, Thomas Funkhouser, Leonidas Guibas
论文链接:https://arxiv.org/pdf/1903.12305.pdf
11、Face De-occlusion using 3D Morphable Model and Generative Adversarial
使用3D可变模型和生成对抗性进行面部去遮挡
作者:Xaiowei Yuan and In Kyu Park
论文链接:http://image.inha.ac.kr/paper/ICCV2019_Xaiowei.pdf
12、Multi-layer Depth and Epipolar Feature Transformers for 3D Scene Reconstruction
用于三维场景重建的多层深度和极线特征变换器
作者:Daeyun Shin, Zhile Ren, Erik B. Sudderth, Charless C. Fowlkes
论文链接:https://arxiv.org/abs/1902.06729
13、Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images
Pix2Vox:上下文感知的三维重建从单一和多视图图像
作者:Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Shengping Zhang
论文链接:https://arxiv.org/abs/1901.11153
Github链接:https://github.com/hzxie/Pix2Vox
14、MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation
单目三维行人定位与不确定性估计
作者:Lorenzo Bertoni, Sven Kreiss, Alexandre Alahi
论文链接:https://arxiv.org/abs/1906.06059
Github链接:https://github.com/vita-epfl/monoloco
15、Moulding Humans: Non-parametric 3D Human Shape Estimation from Single Images
塑造人类:从单个图像中估计非参数三维人体形状
作者:Valentin Gabeur, Jean-Sebastien Franco, Xavier Martin, Cordelia Schmid, Gregory Rogez
论文链接:https://arxiv.org/abs/1908.00439
16、Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation
Pixel2Mesh++:通过变形生成多视图3D网格
作者:Chao Wen, Yinda Zhang, Zhuwen Li, Yanwei Fu
论文链接:https://arxiv.org/abs/1908.01491
17、View N-gram Network for 3D Object Retrieval
查看N-gram网络三维对象检索
作者:Xinwei He, Tengteng Huang, Song Bai, Xiang Bai
论文链接:https://arxiv.org/abs/1908.01958
18、GP2C: Geometric Projection Parameter Consensus for Joint 3D Pose and Focal Length Estimation in the Wild
GP2C:野外关节三维位姿和焦距估计的几何投影参数一致性
作者:Alexander Grabner, Peter M. Roth, Vincent Lepetit
论文链接:https://arxiv.org/abs/1908.02809
19、Neural 3D Morphable Models: Spiral Convolutional Networks for 3D Shape Representation Learning and Generation
用于视频显著性检测的时间聚合空间编解码器
作者:Giorgos Bouritsas, Sergiy Bokhnyak, Stylianos Ploumpis, Michael Bronstein, Stefanos Zafeiriou
论文链接:https://arxiv.org/abs/1905.02876
Github链接:https://github.com/gbouritsas/Neural3DMM
20、DUP-Net: Denoiser and Upsampler Network for 3D Adversarial Point Clouds Defense
DUP-Net:用于3D对抗点云防御的Denoiser和Upsampler网络
作者: Hang Zhou, Kejiang Chen, Weiming Zhang, Han Fang, Wenbo Zhou, Nenghai Yu
论文链接:https://arxiv.org/abs/1812.11017
21、Interpolated Convolutional Networks for 3D Point Cloud Understanding(Oral )
用于3D点云理解的插值卷积网络
作者: Jiageng Mao, Xiaogang Wang, Hongsheng Li
论文链接:https://arxiv.org/abs/1908.04512
22、Efficient Learning on Point Clouds with Basis Point Sets
基于点集的点云有效学习
作者:Sergey Prokudin, Christoph Lassner, Javier Romero
论文链接:https://arxiv.org/abs/1908.09186
23、Few-Shot Generalization for Single-Image 3D Reconstruction via Priors
基于先验的单幅图像三维重建的小镜头综合
作者:Bram Wallace, Bharath Hariharan
论文链接:https://arxiv.org/abs/1909.01205
24、DensePoint: Learning Densely Contextual Representation for Efficient Point Cloud Processing
DensePoint:学习密集的上下文表示,以实现高效的点云处理
作者:Yongcheng Liu, Bin Fan, Gaofeng Meng, Jiwen Lu, Shiming Xiang, Chunhong Pan
论文链接:https://arxiv.org/abs/1909.03669
GCN
1、Can GCNs Go as Deep as CNNs?
作者:Guohao Li, Matthias Müller, Ali Thabet, Bernard Ghanem
论文链接:https://arxiv.org/abs/1904.03751
GitHub:https://github.com/lightaime/deep_gcns
GAN
1、Controllable Artistic Text Style Transfer via Shape-Matching GAN(Oral )
通过形匹配GAN实现可控的艺术文本风格转换
作者:Shuai Yang, Zhangyang Wang, Zhaowen Wang, Ning Xu, Jiaying Liu, Zongming Guo
论文链接:https://arxiv.org/abs/1905.01354
Github链接:https://github.com/TAMU-VITA/ShapeMatchingGAN
项目链接:https://williamyang1991.github.io/projects/ICCV2019/SMGAN.html
2、Photo-Realistic Monocular Gaze Redirection Using Generative Adversarial Networks
利用生成对抗性网络,实现逼真的单目凝视重定向
作者:Zhe He, Adrian Spurr, Xucong Zhang, Otmar Hilliges
论文链接:https://arxiv.org/abs/1903.12530
Github链接:https://github.com/HzDmS/gaze_redirection
3、AutoGAN: Neural Architecture Search for Generative Adversarial Networks
神经结构搜索生成对抗性网络
Github链接:https://github.com/TAMU-VITA/AutoGAN
4、ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal
ARGAN:用于阴影检测和去除的细心的反复生成的对抗性网络
作者:Bin Ding, Chengjiang Long, Ling Zhang, Chunxia Xiao
论文链接:https://arxiv.org/abs/1908.01323
其他
1、Meta-Sim Learning to Generate Synthetic Datasets (Oral)
Meta-Sim学习生成合成数据集
作者:Amlan Kar, Aayush Prakash, Ming-Yu Liu, Eric Cameracci, Justin Yuan, Matt Rusiniak, David Acuna, Antonio Torralba, Sanja Fidler
项目链接:HTTPS://nv-tlabs.github.io/meta-sim/
论文链接:HTTPS://arxiv.org/abs/1904.11621
2、nocaps: novel object captioning at scale
nocaps:大规模的新颖物体字幕
作者:Harsh Agrawal, Karan Desai, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee, Peter
项目链接:https://nocaps.org
论文链接:https://arxiv.org/abs/1812.08658
3、Scene GraphPrediction with Limited Labels
作者:Vincent S. Chen, Paroma Varma, Ranjay Krishna, Michael Bernstein, Christopher Re, Li Fei-Fei
论文链接:https://arxiv.org/abs/1904.11622
4、Variational Adversarial Active Learning(Oral)
变异对抗主动学习
作者:Samarth Sinha, Sayna Ebrahimi, Trevor Darrell
论文链接:https://arxiv.org/abs/1904.00370
5、The Trajectron: Probabilistic Multi-Agent Trajectory Modeling with Dynamic Spatiotemporal Graphs
Trajectron:使用动态时空图的概率多智能体轨迹建模
作者:Boris Ivanovic, Marco Pavone
论文链接:https://arxiv.org/abs/1810.05993
6、End-to-End Learning of Representations for Asynchronous Event-BasedData
异步事件数据表示的端到端学习
作者:Daniel Gehrig, Antonio Loquercio, Konstantinos G. Derpanis, Davide Scaramuzza
论文链接:https://arxiv.org/abs/1904.08245
7、End-to-End Wireframe Parsing
端到端线框解析
作者:Yichao Zhou, Haozhi Qi, Yi Ma
论文链接:https://arxiv.org/abs/1905.03246
Github链接:https://github.com/zhou13/lcnn
8、Correlation Congruence for Knowledge Distillation
知识蒸馏的相关同余
作者:Baoyun Peng, Xiao Jin, Jiaheng Liu, Shunfeng Zhou, Yichao Wu, Yu Liu, Dongsheng Li, Zhaoning Zhang
论文链接:https://arxiv.org/abs/1904.018029
9、Equivariant Multi-View Networks (Oral)
Equivariant多视图网络
作者:Carlos Esteves, Yinshuang Xu, Christine Allen-Blanchette, Kostas Daniilidis
论文链接:https://arxiv.org/abs/1904.00993
Github链接:https://github.com/daniilidis-group/emvn
10、Episodic Training for Domain Generalization
领域泛化的模式训练
作者:Da Li, Jianshu Zhang, Yongxin Yang, Cong Liu, Yi-Zhe Song, Timothy M. Hospedales
论文链接:https://arxiv.org/abs/1902.00113
11、Few-shot Unsupervised Image-to-Image Translation
作者:Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz
论文链接:https://arxiv.org/abs/1905.01723
Github链接:https://github.com/nvlabs/FUNIT/
项目链接:http://www.cs.cornell.edu/~xhuang/publication/funit/
12、Tex2Shape: Detailed Full Human Body Geometry from a Single Image
Tex2Shape:单个图像的人体详细几何特征
作者:Thiemo Alldieck, Gerard Pons-Moll, Christian Theobalt, Marcus Magnor
论文链接:https://arxiv.org/abs/1904.08645
Github链接:https://github.com/thmoa/tex2shape
13、Semi-supervised Domain Adaptation via Minimax Entropy
通过Minimax熵进行半监督的域适应
作者:Kuniaki Saito, Donghyun Kim, Stan Sclaroff, Trevor Darrell, Kate Saenko
论文链接:https://arxiv.org/abs/1904.06487
14、Canonical Surface Mapping via Geometric Cycle Consistency
通过几何周期一致性的经典表面映射
作者:Nilesh Kulkarni, Abhinav Gupta, Shubham Tulsiani
论文链接:https://arxiv.org/abs/1907.10043
项目链接:https://nileshkulkarni.github.io/csm/
15、U4D: Unsupervised 4D Dynamic Scene Understanding
U4D:无监督的4D动态场景理解
作者:Armin Mustafa, Chris Russell, Adrian Hilton
论文链接:https://arxiv.org/abs/1907.09905
16、Scoot: A Perceptual Metric for Facial Sketches
Scoot:面部草图的感知度量
作者:Deng-Ping Fan, ShengChuan Zhang, Yu-Huan Wu, Yun Liu, Ming-Ming Cheng, Bo Ren, Paul L Rosin, Rongrong Ji
论文链接:http://dpfan.net/wp-content/uploads/FaceSketch.pdf
Github链接:http://dpfan.net/wp-content/uploads/Scoot.zip
项目链接:http://dpfan.net/Scoot/
17、Similarity-Preserving Knowledge Distillation
相似性 – 保持知识蒸馏
作者:Frederick Tung, Greg Mori
论文链接:https://arxiv.org/abs/1907.09682
18、Tell, Draw, and Repeat: Generating and modifying images based on continual linguistic instruction
Tell,Draw和Repeat:基于连续的语言指令生成和修改图像
作者:Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W. Taylor
论文链接:https://arxiv.org/pdf/1811.09845.pdf
19、Semantic Adversarial Attacks: Parametric Transformations That Fool Deep Classifiers
语义对抗性攻击:欺骗深度分类器的参数化变换
作者:Ameya Joshi, Amitangshu Mukherjee, Soumik Sarkar, Chinmay Hegde
论文链接:https://arxiv.org/abs/1904.08489
20、What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention
用滚动展开LSTM和模态注意预测自我中心行为
作者:Antonino Furnari, Giovanni Maria Farinella
项目链接:https://iplab.dmi.unict.it/rulstm/
论文链接:https://arxiv.org/pdf/1905.09035.pdf
GitHub:https://github.com/antoninofurnari/rulstm
21、Improving Adversarial Robustness via Guided Complement Entropy
通过引导补语熵提高对抗鲁棒性
作者:Hao-Yun Chen, Jhao-Hong Liang, Shih-Chieh Chang, Jia-Yu Pan, Yu-Ting Chen, Wei Wei, Da-Cheng Juan.
论文链接:https://arxiv.org/abs/1903.09799
Github链接:https://github.com/henry8527/GCE
22、6-DOF GraspNet: Variational Grasp Generation for Object Manipulation
6-DOF GraspNet:对象操作的变分抓取生成
作者:Arsalan Mousavian, Clemens Eppner, Dieter Fox
论文链接:https://arxiv.org/abs/1905.10520
23、Analyzing the Variety Loss in the Context of Probabilistic Trajectory Prediction
概率弹道预测语境中的变种损失分析
作者:Luca Anthony Thiede, Pratik Prabhanjan Brahma
论文链接:https://arxiv.org/abs/1907.10178
24、DAFL: Data-Free Learning of Student Networks
DAFL:学生网络的无数据学习
作者:Hanting Chen, Yunhe Wang, Chang Xu, Zhaohui Yang, Chuanjian Liu, Boxin Shi, Chunjing Xu, Chao Xu, Qi Tian
论文链接:https://arxiv.org/abs/1904.01186
25、Boosting Few-Shot Visual Learning with Self-Supervision
自我监督推动的少样本视觉学习
作者:Spyros Gidaris, Andrei Bursuc, Nikos Komodakis, Patrick Pérez, Matthieu Cord
论文链接:https://arxiv.org/abs/1906.05186
26、A Quaternion-based Certifiably Optimal Solution to the Wahba Problem with Outliers
Wahba问题异常值的基于四元数的可证明最优解
作者:Heng Yang, Luca Carlone
论文链接:https://arxiv.org/abs/1905.12536
27、Embodied Visual Recognition
人体视觉识别
作者:Jianwei Yang,Zhile Ren,Mingze Xu,Xinlei Chen,David Crandall,Devi Parikh,Dhruv Batra
项目链接:https://www.cc.gatech.edu/~jyang375/evr.html
28、Learning Implicit Generative Models by Matching Perceptual Features(Oral)
作者:Cicero Nogueira dos Santos, Youssef Mroueh, Inkit Padhi, Pierre Dognin
论文链接:https://arxiv.org/abs/1904.02762v1
29、Rethinking ImageNet Pre-training
作者:Kaiming He, Ross Girshick, and Piotr Dollár
论文链接:https://arxiv.org/abs/1811.08883
30、COCO-GAN: Generation by Parts via Conditional Coordinating(Oral)
COCO-GAN:通过条件协调按部件生成
作者:Chieh Hubert Lin, Chia-Che Chang, Yu-Sheng Chen, Da-Cheng Juan, Wei Wei, Hwann-Tzong Chen
论文链接:https://arxiv.org/abs/1904.00284
Github链接:https://github.com/hubert0527/COCO-GAN
项目链接:https://hubert0527.github.io/COCO-GAN/
31、Model Vulnerability to Distributional Shifts over Image Transformation Sets
作者:Riccardo Volpi, Vittorio Murino
论文链接:https://arxiv.org/abs/1903.11900
Github链接:https://github.com/ricvolpi/domain-shift-robustness
32、Exploring Randomly Wired Neural Networks for Image Recognition(Oral)
探索随机有线神经网络进行图像识别
作者:Saining Xie, Alexander Kirillov, Ross Girshick, and Kaiming He
论文链接:https://arxiv.org/abs/1904.01569
33、Temporal Attentive Alignment for Large-Scale Video Domain Adaptation
作者:Min-Hung Chen, Zsolt Kira, Ghassan AlRegib, Jaekwon Woo, Ruxin Chen, Jian Zheng
论文链接:https://arxiv.org/abs/1907.12743
Github链接:http://github.com/cmhungsteve/TA3N
34、Creativity Inspired Zero-Shot Learning
启发式的零样本学习
作者:Mohamed Elhoseiny, Mohamed Elfeki
论文链接:https://arxiv.org/abs/1904.01109
35、Model Vulnerability to Distributional Shifts over Image Transformation Sets
作者:Riccardo Volpi, Vittorio Murino
论文链接:https://arxiv.org/abs/1903.11900
Github链接:https://github.com/ricvolpi/domain-shift-robustness
36、Coherent Semantic Attention for Image Inpainting
图像修复的连贯语义注意力机制
作者:Hongyu Liu, Bin Jiang, Yi Xiao, Chao Yang
论文链接:https://arxiv.org/abs/1905.12384
37、Learning to Paint with Model-based Deep Reinforcement Learning
基于模型的深层强化学习学习绘画
作者:Zhewei Huang, Wen Heng, Shuchang Zhou
论文链接:https://arxiv.org/abs/1903.04411
Github链接:https://github.com/hzwer/ICCV2019-LearningToPaint
38、LayoutVAE: Stochastic Scene Layout Generation from a Label Set
LayoutVAE:从标签集生成随机场景布局
作者:Akash Abdu Jyothi, Thibaut Durand, Jiawei He, Leonid Sigal, Greg Mori
论文链接:https://arxiv.org/abs/1907.10719
39、Co-Evolutionary Compression for Unpaired Image Translation
非成对图像翻译的共同进化压缩
作者:Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chunjing Xu, Qi Tian, Chang Xu
论文链接:https://arxiv.org/abs/1907.10804
40、Enhancing Adversarial Example Transferability with an Intermediate Level Attack
通过中级攻击增强对抗性示例可转移性
作者:Qian Huang, Isay Katsman, Horace He, Zeqi Gu, Serge Belongie, Ser-Nam Lim
论文链接:https://arxiv.org/abs/1907.10823
41、Gated2Depth: Real-time Dense Lidar from Gated Images
Gated2Depth:来自门控图像的实时密集激光雷达
作者:Tobias Gruber, Frank Julca-Aguilar,Mario Bijelic,Werner Ritter,Klaus Dietmayer,Felix Heide
论文链接:https://www.cs.princeton.edu/~fheide/papers/Gated2Depth_preprint.pdf
42、Counting with Focus for Free
作者:Zenglin Shi, Pascal Mettes, Cees G. M. Snoek
论文链接:https://arxiv.org/abs/1903.12206
Github链接:https://github.com/shizenglin/Counting-with-Focus-for-Free
43、PU-GAN: a Point Cloud Upsampling Adversarial Network
作者:Ruihui Li, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng
论文链接:https://arxiv.org/abs/1907.10844
44、Moment Matching for Multi-Source Domain Adaptation (Oral)
多源域适应的配套匹配
作者:Xingchao Peng, Qinxun Bai, Xide Xia, Zijun Huang, Kate Saenko, Bo Wang
论文链接:https://arxiv.org/abs/1812.01754
45、EMPNet: Neural Localisation and Mapping using Embedded Memory Points
EMPNet:使用嵌入式存储点的神经定位和映射
作者:Gil Avraham, Yan Zuo, Thanuja Dharmasiri, Tom Drummond
论文链接:https://arxiv.org/abs/1907.13268
46、Learning Compositional Representations for Few-Shot Recognition
少样本识别的构图表示学习
作者:Pavel Tokmakov, Yuxiong Wang, Martial Hebert
论文链接:https://sites.google.com/view/comprepr/home
47、Digging Into Self-Supervised Monocular Depth Estimation
自我监督单眼深度估计的研究
作者:Clement Godard, Oisin Mac Aodha, Michael Firman, Gabriel Brostow
论文链接:https://arxiv.org/pdf/1806.01260.pdf
48、Deep Interpretable Non-Rigid Structure from Motion
运动的深层可解释的非刚性结构
作者:Chen Kong, Simon Lucey
论文链接:https://arxiv.org/pdf/1902.10840.pdf
49、PRECOG: PREdiction Conditioned On Goals in Visual Multi-Agent Settings
PRECOG:视觉多代理设置中的目标条件
作者:Nicholas Rhinehart, Rowan McAllister, Kris Kitani, Sergey Levine
论文链接:https://arxiv.org/pdf/1905.01296.pdf
项目链接:https://sites.google.com/view/precog
50、Lifelong GAN: Continual Learning for Conditional Image Generation
终身GAN:条件图像生成的持续学习
作者:Mengyao Zhai, Lei Chen, Fred Tung, Jiawei He, Megha Nawhal, Greg Mori
论文链接:https://arxiv.org/abs/1907.10107
52、An Empirical Study of Spatial Attention Mechanisms in Deep Networks
深度网络空间注意机制的实证研究
作者:Xizhou Zhu, Dazhi Cheng, Zheng Zhang, Stephen Lin, Jifeng Dai
论文链接:https://arxiv.org/pdf/1904.05873.pdf
53、Fashion++: Minimal Edits for Outfit Improvement
Fashion ++:改进装备的最小编辑
作者:Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, Kristen Grauman
论文链接:https://arxiv.org/pdf/1904.09261.pdf
54、Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
Align2Ground:由图像标题对齐引导的弱监督短语接地
作者:Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran
论文链接:https://arxiv.org/pdf/1903.11649.pdf
55、Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded
做一个提示:利用解释使视觉和语言模型更加扎实
作者:Ramprasaath R. Selvaraju, Stefan Lee, Yilin Shen, Hongxia Jin, Dhruv Batra, Devi Parikh
论文链接:https://arxiv.org/pdf/1902.03751.pdf
56、SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation
SplitNet:用于体验视觉导航的Sim2Sim和Task2Task转移
作者:Daniel Gordon, Abhishek Kadian, Devi Parikh, Judy Hoffman, Dhruv Batra
论文链接:https://arxiv.org/pdf/1905.07512.pdf
57、Habitat: A Platform for Embodied AI Research ( Oral )
Habitat:体验人工智能研究的平台
作者:Manolis Savva, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun, Jitendra Malik, Devi Parikh, Dhruv Batra
论文链接:https://arxiv.org/abs/1904.01201
58、EM-Fusion: Dynamic Object-Level SLAM with Probabilistic Data Association
EM-Fusion:具有概率数据关联的动态对象级SLAM
作者:Michael Strecke, Jörg Stückler
论文链接:https://arxiv.org/abs/1904.11781
59、Texture Fields: Learning Texture Representations in Function Space
纹理字段:在函数空间中学习纹理表示
作者:Michael Oechsle, Lars Mescheder, Michael Niemeyer, Thilo Strauss, Andreas Geiger
论文链接:https://arxiv.org/abs/1905.07259
60、AMASS: Archive of Motion Capture as Surface Shapes
AMASS:将运动捕捉存档为表面形状
作者:Naureen Mahmood, Nima Ghorbani, Nikolaus F. Troje, Gerard Pons-Moll, Michael J. Black
论文链接:https://arxiv.org/abs/1904.03278
61、End-to-end Learning for Graph Decomposition
图形分解的端到端学习
作者:Jie Song, Bjoern Andres, Michael Black, Otmar Hilliges, Siyu Tang
论文链接:https://arxiv.org/pdf/1812.09737.pdf
62、Towards Multi-pose Guided Virtual Try-on Network
Towards多姿态引导虚拟试穿网络
作者:Haoye Dong, Xiaodan Liang, Bochao Wang, Hanjiang Lai, Jia Zhu, Jian Yin
论文链接:https://arxiv.org/abs/1902.11026
63、On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method
利用无梯度优化和算子分裂方法设计黑盒对抗实例
作者:Pu Zhao, Sijia Liu, Pin-Yu Chen, Nghia Hoang, Kaidi Xu, Bhavya Kailkhura, Xue Lin
论文链接:https://arxiv.org/abs/1907.11684
64、Goal-Driven Sequential Data Abstraction
目标驱动的顺序数据抽象
作者:Umar Riaz Muhammad, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song
论文链接:https://arxiv.org/abs/1907.12336
65、Recursive Cascaded Networks for Unsupervised Medical Image Registration
用于无监督医学图像配准的递归级联网络
作者: Shengyu Zhao, Yue Dong, Eric I-Chao Chang, Yan Xu
论文链接:https://arxiv.org/abs/1907.12353
66、Learn to Scale: Generating Multipolar Normalized Density Map for Crowd Counting
学习规模:为人群计数生成多极归一化密度图
作者:Chenfeng Xu, Kai Qiu, Jianlong Fu, Song Bai, Yongchao Xu, Xiang Bai
论文链接:https://arxiv.org/abs/1907.12428
67、MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning
MetaPruning:自动神经网络通道修剪的元学习
作者:Zechun Liu, Haoyuan Mu, Xiangyu Zhang, Zichao Guo, Xin Yang, Tim Kwang-Ting Cheng, Jian Sun
论文链接:https://arxiv.org/abs/1903.10258
68、Switchable Whitening for Deep Representation Learning
作者:Xingang Pan, Xiaohang Zhan, Jianping Shi, Xiaoou Tang, Ping Luo
论文链接:https://arxiv.org/abs/1904.09739
69、Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
作者:Yunpeng Chen, Haoqi Fan, Bing Xu, Zhicheng Yan, Yannis Kalantidis, Marcus Rohrbach, Shuicheng Yan, Jiashi Feng
论文链接:https://arxiv.org/abs/1904.05049
70、Task2Vec: Task Embedding for Meta-Learning
Task2Vec:元学习的任务嵌入
作者:Alessandro Achille, Michael Lam, Rahul Tewari, Avinash Ravichandran, Subhransu Maji, Charless Fowlkes, Stefano Soatto, Pietro Perona
论文链接:https://arxiv.org/abs/1902.03545
71、CARAFE: Content-Aware ReAssembly of FEatures ( Oral )
CARAFE:内容意识重新组装特征
作者:Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin
论文链接:https://arxiv.org/pdf/1905.02188.pdf
72、Domain Intersection and Domain Difference
域交和域差
Github链接:https://github.com/sagiebenaim/DomainIntersectionDifference
73、A Closed-form Solution to Universal Style Transfer
一种通用样式转换的封闭式解决方案
作者:Ming Lu, Hao Zhao, Anbang Yao, Yurong Chen, Feng Xu, Li Zhang
论文链接:https://arxiv.org/abs/1906.00668
Github链接:https://github.com/lu-m13/OptimalStyleTransfer
74、Sampling-free Epistemic Uncertainty Estimation Using Approximated Variance Propagation
基于近似方差传播的无样本认知不确定性估计
Github链接:https://github.com/janisgp/Sampling-free-Epistemic-Uncertainty
75、On the Over-Smoothing Problem of CNN Based Disparity Estimation
基于CNN的视差估计的过平滑问题
Github链接:https://github.com/chenchr/otosp
76、Metric Learning with HORDE: High-Order Regularizer for Deep Embeddings
HORDE度量学习:用于深度嵌入的高阶正则化器
论文链接:https://arxiv.org/abs/1908.02735
Github链接:https://github.com/pierre-jacob/ICCV2019-Horde
77、Mask-ShadowGAN: Learning to Remove Shadows from Unpaired Data
面具阴影甘:学习从未配对的数据中去除阴影
作者:Xiaowei Hu, Yitong Jiang, Chi-Wing Fu, and Pheng-Ann Heng
Github链接:https://github.com/xw-hu/Mask-ShadowGAN
78、Universally Slimmable Networks and Improved Training Techniques
普遍精简的网络和改进的培训技术
作者:Jiahui Yu, Thomas Huang
论文链接:https://arxiv.org/abs/1903.05134
Github链接:https://github.com/JiahuiYu/slimmable_networks
79、Domain Adaptation for Structured Output via Discriminative Patch Representations (Oral)
通过有区别的Patch表示对结构化输出进行域适应
作者:Yi-Hsuan Tsai, Kihyuk Sohn, Samuel Schulter, Manmohan Chandraker
论文链接:https://arxiv.org/abs/1901.05427
80、Deep Non-Rigid Structure from Motion(Oral)
.深非刚性的结构与运动
作者:Chen Kong, Simon Lucey
论文链接:https://arxiv.org/abs/1908.00052
81、Learning the Model Update for Siamese Trackers
学习暹罗语追踪器的模型更新
作者:Lichao Zhang, Abel Gonzalez-Garcia, Joost van de Weijer, Martin Danelljan, Fahad Shahbaz Khan
论文链接:https://arxiv.org/abs/1908.00855
82、Distilling Knowledge From a Deep Pose Regressor Network
从深层位姿回归网络中提取知识
作者:Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Yasin Almalioglu, Andrew Markham, Niki Trigoni
论文链接:https://arxiv.org/abs/1908.00858
83、Permutation-invariant Feature Restructuring for Correlation-aware Image Set-based Recognition
基于相关感知的图像集识别的置换不变特征重构
作者:Xiaofeng Liu, Zhenhua Guo, Site Li, Lingsheng Kong, Ping Jia, Jane You, B. V. K. Kumar
论文链接:https://arxiv.org/abs/1908.01174
84、Restoration of Non-rigidly Distorted Underwater Images using a Combination of Compressive Sensing and Local Polynomial Image Representations(Oral )
恢复非刚性的扭曲的水下图像使用压缩传感和图像局部多项式表示的组合
作者: Jerin Geo James, Pranay Agrawal, Ajit Rajwade
论文链接:https://arxiv.org/abs/1908.01940
85、Semi-supervised Skin Detection by Network with Mutual Guidance
基于相互指导的网络半监督皮肤检测
作者:Yi He, Jiayuan Shi, Chuan Wang, Haibin Huang, Jiaming Liu, Guanbin Li, Risheng Liu, Jue Wang
论文链接:https://arxiv.org/abs/1908.01977
86、Consensus Maximization Tree Search Revisited(Oral)
共识最大化树搜索重新审视(口语)
作者:Zhipeng Cai, Tat-Jun Chin, Vladlen Koltun
论文链接:https://arxiv.org/abs/1908.02021
87、Deep Self-Learning From Noisy Labels
从嘈杂的标签中进行深度的自我学习
作者:Jiangfan Han, Ping Luo, Xiaogang Wang
论文链接:https://arxiv.org/abs/1908.02160
88、Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning
用于无监督图表示学习的对称图卷积自编码器
作者:Jiwoong Park, Minsik Lee, Hyung Jin Chang, Kyuewang Lee, Jin Young Choi
论文链接:https://arxiv.org/abs/1908.02441
89、Expert Sample Consensus Applied to Camera Re-Localization
将专家样本一致性应用于相机再定位
作者:Eric Brachmann, Carsten Rother
论文链接:https://arxiv.org/abs/1908.02484
90、SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition
空间感:一个反向众包的空间关系识别基准
作者:Kaiyu Yang, Olga Russakovsky, Jia Deng
论文链接:https://arxiv.org/abs/1908.02660
91、Bidirectional One-Shot Unsupervised Domain Mapping
双向一次无监督域映射
Github链接:https://github.com/tomercohen11/BiOST
92、CompenNet++: End-to-end Full Projector Compensation
CompenNet++:端到端全投影仪补偿
Github链接:https://github.com/BingyaoHuang/CompenNet-plusplus
93、Perspective-Guided Convolution Networks for Crowd Counting
用于人群计数的透视引导卷积网络
Github链接:https://github.com/Zhaoyi-Yan/PGCNet
94、Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation
Larger范数更可转移:无监督域自适应的自适应特征范数方法
作者:Ruijia Xu, Guanbin Li, Jihan Yang, Liang Lin
论文链接:https://arxiv.org/abs/1811.07456
95、Closed-Form Optimal Two-View Triangulation Based on Angular Errors
基于角度误差的闭式最优双视三角剖分
作者:Seong Hun Lee, Javier Civera
论文链接:https://arxiv.org/abs/1903.09115
96、Overcoming Catastrophic Forgetting with Unlabeled Data in the Wild
在荒野中克服了未标记数据的灾难性遗忘
作者:Kibok Lee, Kimin Lee, Jinwoo Shin, Honglak Lee
论文链接:https://arxiv.org/abs/1903.12648
Github链接:https://github.com/kibok90/iccv2019-inc
97、Learning Combinatorial Embedding Networks for Deep Graph Matching
用于深度图匹配的学习组合嵌入网络
作者:Runzhong Wang, Junchi Yan, Xiaokang Yang
论文链接:https://arxiv.org/abs/1904.00597
98、PR Product: A Substitute for Inner Product in Neural Networks(Oral )
PR产品:神经网络内部产品替代品(口服)
作者:Zhennan Wang, Wenbin Zou, Chen Xu
论文链接:https://arxiv.org/abs/1904.13148
Github链接:https://github.com/wzn0828/PR_Product
99、STM: SpatioTemporal and Motion Encoding for Action Recognition
STM:行动识别的SpatioTmporal和运动编码
作者:Boyuan Jiang, Mengmeng Wang, Weihao Gan, Wei Wu, Junjie Yan
论文链接:https://arxiv.org/abs/1908.02486
100、Memory-Based Neighbourhood Embedding for Visual Recognition(Oral )
基于记忆的邻域嵌入视觉识别
作者:Suichan Li, Dapeng Chen, Bin Liu, Nenghai Yu, Rui Zhao
论文链接:https://arxiv.org/abs/1908.04992
101、Few-Shot Learning with Global Class Representations
全球班级代表的快速学习
作者:Tiange Luo, Aoxue Li, Tao Xiang, Weiran Huang, Liwei Wang
论文链接:https://arxiv.org/abs/1908.05257
102、Learning Trajectory Dependencies for Human Motion PredictionOral
学习人体运动预测的轨迹依赖性
作者:Wei Mao, Miaomiao Liu, Mathieu Salzmann, Hongdong Li
论文链接:https://arxiv.org/abs/1908.05436
Github链接:https://github.com/wei-mao-2019/LearnTrajDep
103、Symmetric Cross Entropy for Robust Learning with Noisy Labels
具有噪声标签的鲁棒学习的对称交叉熵
作者:Yisen Wang, Xingjun Ma, Zaiyi Chen, Yuan Luo, Jinfeng Yi, James Bailey
论文链接:https://arxiv.org/abs/1908.06112
104、From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer
从打开设置到封闭设置:按空间划分和计数计算对象
作者:Haipeng Xiong, Hao Lu, Chengxin Liu, Liang Liu, Zhiguo Cao, Chunhua Shen
论文链接:https://arxiv.org/abs/1908.06473
Github链接:https://github. com/xhp-hust-2018-2011/S-DCNet
105、Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation
通过骷髅解剖表示从人工图像中恢复人体网格
作者:Sun Yu, Ye Yun, Liu Wu, Gao Wenpeng, Fu YiLi, Mei Tao
论文链接:https://arxiv.org/abs/1908.07172
106、ViCo: Word Embeddings from Visual Co-occurrences
ViCo:来自视觉共现的词嵌入
作者:Tanmay Gupta, Alexander Schwing, Derek Hoiem
论文链接:https://arxiv.org/abs/1908.08527
项目链接:http://tanmaygupta.info/vico/
107、Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
在多样性图像标题期间建模意图的顺序潜在空间
作者:Jyoti Aneja, Harsh Agrawal, Dhruv Batra, Alexander Schwing
论文链接:https://arxiv.org/abs/1908.08529
108、Learning Similarity Conditions Without Explicit Supervision
在没有明确监督的情况下学习相似性条件
作者:Reuben Tan, Mariya I. Vasileva, Kate Saenko, Bryan A. Plummer
论文链接:https://arxiv.org/abs/1908.08589
109、Shadow Removal via Shadow Image Decomposition
通过阴影图像分解去除阴影
作者:Hieu Le, Dimitris Samaras
论文链接:https://arxiv.org/abs/1908.08628
110、Crowd Counting with Deep Structured Scale Integration Network
深度结构化规模集成网络的计算
作者:Lingbo Liu, Zhilin Qiu, Guanbin Li, Shufan Liu, Wanli Ouyang, Liang Lin
论文链接:https://arxiv.org/abs/1908.08692
111、Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry
自我监督的深度视觉测距的顺序对抗性学习
作者:Shunkai Li, Fei Xue, Xin Wang, Zike Yan, Hongbin Zha
论文链接:https://arxiv.org/abs/1908.08704
112、Learning Filter Basis for Convolutional Neural Network Compression
卷积神经网络压缩的学习滤波器基础
作者:Yawei Li, Shuhang Gu, Luc Van Gool, Radu Timofte
论文链接:https://arxiv.org/abs/1908.08932
Github链接:https://github.com/ofsoundof/learning_filter_basis
113、Where Is My Mirror?
作者:Xin Yang, Haiyang Mei, Ke Xu, Xiaopeng Wei, Baocai Yin, Rynson W. H. Lau
论文链接:https://arxiv.org/abs/1908.09101
项目链接:https://mhaiyang.github.io/ICCV2019_MirrorNet/index.html
114、Towards Unsupervised Image Captioning with Shared Multimodal Embeddings
使用共享多模式嵌入来保护无监督的图像标题
作者:Iro Laina, Christian Rupprecht, Nassir Navab
论文链接:https://arxiv.org/abs/1908.09317
115、Object-Driven Multi-Layer Scene Decomposition From a Single Image
来自单个图像的对象驱动的多层场景分解
作者:Helisa Dhamo, Nassir Navab, Federico Tombari
论文链接:https://arxiv.org/abs/1908.09521
116、Non-local Recurrent Neural Memory for Supervised Sequence Modeling(Oral)
用于监督序列建模的非局部递归神经记忆
作者:Canmiao Fu, Wenjie Pei, Qiong Cao, Chaopeng Zhang, Yong Zhao, Xiaoyong Shen, Yu-Wing Tai
论文链接:https://arxiv.org/abs/1908.09535
117、Embarrassingly Simple Binary Representation Learning
简单的二进制表示学习
作者:Yuming Shen, Jie Qin, Jiaxin Chen, Li Liu, Fan Zhu
论文链接:https://arxiv.org/abs/1908.09573
118、Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels(Oral )
简单的二进制表示学习
作者:Felix J. S. Bragman, Ryutaro Tanno, Sebastien Ourselin, Daniel C. Alexander, M. Jorge Cardoso
论文链接:https://arxiv.org/abs/1908.09597
119、Confidence Regularized Self-Training
肯定的自我训练
作者:Yang Zou, Zhiding Yu, Xiaofeng Liu, B. V. K. Vijaya Kumar, Jinsong Wang
论文链接:https://arxiv.org/abs/1908.09822
Github链接:https://github.com/yzou2/CRST
120、SoftTriple Loss: Deep Metric Learning Without Triplet Sampling
软三重损失:没有三重抽样的深度度量学习
作者:Qi Qian, Lei Shang, Baigui Sun, Juhua Hu, Hao Li, Rong Jin
论文链接:https://arxiv.org/abs/1909.05235
121、A Camera That CNNs: Towards Embedded Neural Networks on Pixel Processor Arrays
一种CNNs相机:面向像素处理器阵列上的嵌入式神经网络
作者:Laurie Bose, Jianing Chen, Stephen J. Carey, Piotr Dudek, Walterio Mayol-Cuevas
论文链接:https://arxiv.org/abs/1909.05647
122、DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch
深度修剪:通过可微PatchMatch学习有效的立体匹配
作者:Shivam Duggal, Shenlong Wang, Wei-Chiu Ma, Rui Hu, Raquel Urtasun
论文链接:https://arxiv.org/abs/1909.05845
123、Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective
反思零镜头学习:一个有条件的视觉分类视角
作者:Kai Li, Martin Renqiang Min, Yun Fu
论文链接:https://arxiv.org/abs/1909.05995
124、Learning Spatial Awareness to Improve Crowd Counting(Oral)
作者:Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Alexander Hauptmann
论文链接:https://arxiv.org/abs/1909.07057
125、AdaptIS: Adaptive Instance Selection Network
AdaptIS:自适应实例选择网络
作者:Konstantin Sofiiuk, Olga Barinova, Anton Konushin
论文链接:https://arxiv.org/abs/1909.07829
Github链接:https://github.com/saic-vul/adaptis
126、Self-Supervised Monocular Depth Hints
自我监督单眼深度提示
作者:Jamie Watson, Michael Firman, Gabriel J. Brostow, Daniyar Turmukhambetov
论文链接:https://arxiv.org/abs/1909.09051
127、Making the Invisible Visible: Action Recognition Through Walls and Occlusions
使不可见变为可见:通过墙壁和遮挡的动作识别
作者:Tianhong Li, Lijie Fan, Mingmin Zhao, Yingcheng Liu, Dina Katabi
论文链接:https://arxiv.org/abs/1909.09300
128、Adversarial Learning with Margin-based Triplet Embedding Regularization
基于边值的三重嵌入正则化的对抗性学习
作者: Yaoyao Zhong, Weihong Deng
论文链接:https://arxiv.org/abs/1909.09481
129、Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation
交互式草图和填充:多级草图到图像的翻译
作者: Arnab Ghosh, Richard Zhang, Puneet K. Dokania, Oliver Wang, Alexei A. Efros, Philip H. S. Torr, Eli Shechtman
论文链接:https://arxiv.org/abs/1909.11081
项目链接:https://arnabgho.github.io/iSketchNFill/
130、Anchor Loss: Modulating Loss Scale based on Prediction Difficulty(Oral )
锚点损失:基于预测难度的调整损失量表
作者:Serim Ryou, Seong-Gyun Jeong, Pietro Perona
论文链接:https://arxiv.org/abs/1909.11155
131、Learning Propagation for Arbitrarily-structured Data
任意结构数据的学习传播
作者:Sifei Liu, Xueting Li, Varun Jampani, Shalini De Mello, Jan Kautz
论文链接:https://arxiv.org/abs/1909.11237
132、MIC: Mining Interclass Characteristics for Improved Metric Learning
MIC:挖掘类间特征以改进度量学习
作者:Karsten Roth, Biagio Brattoli, Björn Ommer
论文链接:https://arxiv.org/abs/1909.11574
133、Compact Trilinear Interaction for Visual Question Answering
紧凑的三线性互动视觉问题回答
作者:Tuong Do, Thanh-Toan Do, Huy Tran, Erman Tjiputra, Quang D. Tran
论文链接:https://arxiv.org/abs/1909.11874
134、Convex Relaxations for Consensus and Non-Minimal Problems in 3D Vision
三维视觉中一致和非最小问题的凸松弛
作者:Thomas Probst, Danda Pani Paudel, Ajad Chhatkuli, Luc Van Gool
论文链接:https://arxiv.org/abs/1909.12034
135、Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks
可区分的学习到组的通道通过可分组的卷积神经网络
作者:Zhaoyang Zhang, Jingyu Li, Wenqi Shao, Zhanglin Peng, Ruimao Zhang, Xiaogang Wang, Ping Luo
论文链接:https://arxiv.org/abs/1908.05867
Github链接:https://github.com/d-li14/dgconv.pytorch
136、HBONet: Harmonious Bottleneck on Two Orthogonal Dimensions
HBONet:两个正交维度上的和谐瓶颈
作者:Duo Li, Aojun Zhou, Anbang Yao
论文链接:https://arxiv.org/abs/1908.03888
Github链接:https://github.com/d-li14/HBONet
137、Dual Student: Breaking the Limits of the Teacher in Semi-supervised Learning
双元学生:打破教师在半监督学习中的限制
作者:Zhanghan Ke, Daoye Wang, Qiong Yan, Jimmy Ren, Rynson W. H. Lau
论文链接:https://arxiv.org/abs/1909.01804
138、
139、Program-Guided Image Manipulators
Program-Guided形象操纵者
作者:Jiayuan Mao, Xiuming Zhang, Yikai Li, William T. Freeman, Joshua B. Tenenbaum, Jiajun Wu
论文链接:https://arxiv.org/abs/1909.02116
项目链接:http://pgim.csail.mit.edu/
140、Understanding Human Gaze Communication by Spatio-Temporal Graph Reasoning
通过时空图推理来理解人类的目光交流
作者:Lifeng Fan, Wenguan Wang, Siyuan Huang, Xinyu Tang, Song-Chun Zhu
论文链接:https://arxiv.org/abs/1909.02144
141、Gravity as a Reference for Estimating a Person’s Height from Video
重力作为一个参考,从视频估计一个人的高度
作者:Didier Bieler, Semih Günel, Pascal Fua, Helge Rhodin
论文链接:https://arxiv.org/abs/1909.02211
142、Bayes-Factor-VAE: Hierarchical Bayesian Deep Auto-Encoder Models for Factor Disentanglement
贝叶斯-因素- vae:用于因素分解的层次贝叶斯深度自动编码器模型
作者:Minyoung Kim, Yuting Wang, Pritish Sahu, Vladimir Pavlovic
论文链接:https://arxiv.org/abs/1909.02820
143、Hierarchy Parsing for Image Captioning
用于图像字幕的层次结构解析
作者: Ting Yao, Yingwei Pan, Yehao Li, Tao Mei
论文链接:https://arxiv.org/abs/1909.03918
144、Learning Object-specific Distance from a Monocular Image
.从单眼图像学习物体特定的距离
作者:Jing Zhu, Yi Fang, Husam Abu-Haimed, Kuo-Chin Lien, Dongdong Fu, Junli Gu
论文链接:https://arxiv.org/abs/1909.04182
145、Bayesian Relational Memory for Semantic Visual Navigation
用于语义视觉导航的贝叶斯关系记忆
作者:Yi Wu, Yuxin Wu, Aviv Tamar, Stuart Russell, Georgia Gkioxari, Yuandong Tian
论文链接:https://arxiv.org/abs/1909.04306
146、FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images
FreiHAND:用于从单个RGB图像捕获手部姿势和形状的无标记数据集
作者:Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan Russell, Max Argus, Thomas Brox
论文链接:https://arxiv.org/abs/1909.04349
项目链接:https://lmb.informatik.uni-freiburg.de/projects/freihand/
147、Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection
联合深度特征的结构化建模和突出目标检测的预测精化
作者: Yingyue Xu, Dan Xu, Xiaopeng Hong, Wanli Ouyang, Rongrong Ji, Min Xu, Guoying Zhao
论文链接:https://arxiv.org/abs/1909.04366
148、FDA: Feature Disruptive Attack
特性破坏性攻击
作者: Aditya Ganeshan, B. S. Vivek, R. Venkatesh Babu
论文链接:https://arxiv.org/abs/1909.04385
Github链接:https://github.com/BardOfCodes/fda
149、Cross-X Learning for Fine-Grained Visual Categorization
用于细粒度视觉分类的交叉x学习
作者:Wei Luo, Xitong Yang, Xianjie Mo, Yuheng Lu, Larry S. Davis, Jun Li, Jian Yang, Ser-Nam Lim
论文链接:https://arxiv.org/abs/1909.04412
Github链接:https://github.com/cswluo/CrossX
150、Reasoning About Human-Object Interactions Through Dual Attention Networks
通过双重注意力网络对人-物交互进行推理
作者:Tete Xiao, Quanfu Fan, Dan Gutfreund, Mathew Monfort, Aude Oliva, Bolei Zhou
论文链接:https://arxiv.org/abs/1909.04743
151、Variable Rate Deep Image Compression With a Conditional Autoencoder
可变速率深图像压缩与条件自动编码器
作者: Yoojin Choi, Mostafa El-Khamy, Jungwon Lee
论文链接:https://arxiv.org/abs/1909.04802
152、Deep Elastic Networks with Model Selection for Multi-Task Learning
具有多任务学习模型选择的深度弹性网络
作者:Chanho Ahn, Eunwoo Kim, Songhwai Oh
论文链接:https://arxiv.org/abs/1909.04860
153、Sparse and Imperceivable Adversarial Attacks
稀疏的和不可感知的对抗攻击
作者:Francesco Croce, Matthias Hein
论文链接:https://arxiv.org/abs/1909.05040