• A Scalable Photonic Computer Solving the Subset Sum Problem

    Authors:

    Xiao-Yun Xu,

    Xuan-Lun Huang,

    Zhan-Ming Li,

    Jun Gao,

    Zhi-Qiang Jiao,

    Yao Wang,

    Ruo-Jing Ren,

    H. P. Zhang,

    Xian-Min Jin

    Abstract:

    The subset sum problem is a typical NP-complete problem that is hard to solve efficiently in time due to the intrinsic superpolynomial-scaling property. Increasing the problem size results in a vast amount of time consuming in conventionally available computers. Photons possess the unique features of extremely high propagation speed, weak interaction with environment and low detectable energy leve…
    ▽ More

    Submitted 3 February, 2020;
    originally announced February 2020.

  • PCNN: Pattern-based Fine-Grained Regular Pruning towards Optimizing CNN Accelerators

    Authors:

    Zhanhong Tan,

    Jiebo Song,

    Xiaolong Ma,

    Sia-Huat Tan,

    Hongyang Chen,

    Yuanqing Miao,

    Yifu Wu,

    Shaokai Ye,

    Yanzhi Wang,

    Dehui Li,

    Kaisheng Ma

    Abstract:

    Weight pruning is a powerful technique to realize model compression. We propose PCNN, a fine-grained regular 1D pruning method. A novel index format called Sparsity Pattern Mask (SPM) is presented to encode the sparsity in PCNN. Leveraging SPM with limited pruning patterns and non-zero sequences with equal length, PCNN can be efficiently employed in hardware. Evaluated on VGG-16 and ResNet-18, our…
    ▽ More

    Submitted 11 February, 2020;
    originally announced February 2020.

  • Efficient Training of Deep Convolutional Neural Networks by Augmentation in Embedding Space

    Authors:

    Mohammad Saeed Abrishami,

    Amir Erfan Eshratifar,

    David Eigen,

    Yanzhi Wang,

    Shahin Nazarian,

    Massoud Pedram

    Abstract:

    Recent advances in the field of artificial intelligence have been made possible by deep neural networks. In applications where data are scarce, transfer learning and data augmentation techniques are commonly used to improve the generalization of deep learning models. However, fine-tuning a transfer model with data augmentation in the raw input space has a high computational cost to run the full ne…
    ▽ More

    Submitted 11 February, 2020;
    originally announced February 2020.

  • Progressive Object Transfer Detection

    Authors:

    Hao Chen,

    Yali Wang,

    Guoyou Wang,

    Xiang Bai,

    Yu Qiao

    Abstract:

    Recent development of object detection mainly depends on deep learning with large-scale benchmarks. However, collecting such fully-annotated data is often difficult or expensive for real-world applications, which restricts the power of deep neural networks in practice. Alternatively, humans can detect new objects with little annotation burden, since humans often use the prior knowledge to identify…
    ▽ More

    Submitted 11 February, 2020;
    originally announced February 2020.

  • Music2Dance: Music-driven Dance Generation using WaveNet

    Authors:

    Wenlin Zhuang,

    Congyi Wang,

    Siyu Xia,

    Jinxiang Chai,

    Yangang Wang

    Abstract:

    In this paper, we propose a novel system, named as Music2Dance, for addressing the problem of fully automatic music and choreography. Our key idea is to shift the WaveNet, which is originally designed for speech generation, to the human motion synthesis. To balance the big differences between these two tasks, we propose a novel network structure. Typically, being regarded as the local condition fo…
    ▽ More

    Submitted 2 February, 2020;
    originally announced February 2020.

  • PointHop++: A Lightweight Learning Model on Point Sets for 3D Classification

    Authors:

    Min Zhang,

    Yifan Wang,

    Pranav Kadam,

    Shan Liu,

    C. -C. Jay Kuo

    Abstract:

    The PointHop method was recently proposed by Zhang et al. for 3D point cloud classification with unsupervised feature extraction. It has an extremely low training complexity while achieving state-of-the-art classification performance. In this work, we improve the PointHop method furthermore in two aspects: 1) reducing its model complexity in terms of the model parameter number and 2) ordering disc…
    ▽ More

    Submitted 8 February, 2020;
    originally announced February 2020.

  • Romance in China: Mining and Visualizing 10 Million Alibaba Valentine Purchases

    Authors:

    Yongzhen Wang,

    Xiaozhong Liu,

    Yingnan Ju,

    Katy Börner,

    Jun Lin,

    Changlong Sun,

    Luo Si

    Abstract:

    Valentine Day February 14, is the day of love. The days ahead of Valentine’s Day are filled with extensive shopping activity for loved ones. Previous studies have investigated expressions of romantic love and gift-giving using surveys with 40-100 participants. In the era of big data, large datasets can be used to study social phenomena and explore and exploit evolving patterns, trends, and outlier…
    ▽ More

    Submitted 7 February, 2020;
    originally announced February 2020.

  • Long-Range Gesture Recognition Using Millimeter Wave Radar

    Authors:

    Yu Liu,

    Yuheng Wang,

    Haipeng Liu,

    Anfu Zhou,

    Jianhua Liu,

    Ning Yang

    Abstract:

    Millimeter wave (mmWave) based gesture recognition technology provides a good human computer interaction (HCI) experience. Prior works focus on the close-range gesture recognition, but fall short in range extension, i.e., they are unable to recognize gestures more than one meter away from considerable noise motions. In this paper, we design a long-range gesture recognition model which utilizes a n…
    ▽ More

    Submitted 6 February, 2020;
    originally announced February 2020.

  • Source separation with weakly labelled data: An approach to computational auditory scene analysis

    Authors:

    Qiuqiang Kong,

    Yuxuan Wang,

    Xuchen Song,

    Yin Cao,

    Wenwu Wang,

    Mark D. Plumbley

    Abstract:

    Source separation is the task to separate an audio recording into individual sound sources. Source separation is fundamental for computational auditory scene analysis. Previous work on source separation has focused on separating particular sound classes such as speech and music. Many of previous work require mixture and clean source pairs for training. In this work, we propose a source separation…
    ▽ More

    Submitted 5 February, 2020;
    originally announced February 2020.

  • Multi-Fusion Chinese WordNet (MCW) : Compound of Machine Learning and Manual Correction

    Authors:

    Mingchen Li,

    Zili Zhou,

    Yanna Wang

    Abstract:

    Princeton WordNet (PWN) is a lexicon-semantic network based on cognitive linguistics, which promotes the development of natural language processing. Based on PWN, five Chinese wordnets have been developed to solve the problems of syntax and semantics. They include: Northeastern University Chinese WordNet (NEW), Sinica Bilingual Ontological WordNet (BOW), Southeast University Chinese WordNet (SEW),…
    ▽ More

    Submitted 5 February, 2020;
    originally announced February 2020.

  • On Positive-Unlabeled Classification in GAN

    Authors:

    Tianyu Guo,

    Chang Xu,

    Jiajun Huang,

    Yunhe Wang,

    Boxin Shi,

    Chao Xu,

    Dacheng Tao

    Abstract:

    This paper defines a positive and unlabeled classification problem for standard GANs, which then leads to a novel technique to stabilize the training of the discriminator in GANs. Traditionally, real data are taken as positive while generated data are negative. This positive-negative classification criterion was kept fixed all through the learning process of the discriminator without considering t…
    ▽ More

    Submitted 4 February, 2020;
    originally announced February 2020.

  • Aesthetic Quality Assessment for Group photograph

    Authors:

    Yaoting Wang,

    Yongzhen Ke,

    Kai Wang,

    Cuijiao Zhang,

    Fan Qin

    Abstract:

    Image aesthetic quality assessment has got much attention in recent years, but not many works have been done on a specific genre of photos: Group photograph. In this work, we designed a set of high-level features based on the experience and principles of group photography: Opened-eye, Gaze, Smile, Occluded faces, Face Orientation, Facial blur, Character center. Then we combined them and 83 generic…
    ▽ More

    Submitted 3 February, 2020;
    originally announced February 2020.

  • Widening and Squeezing: Towards Accurate and Efficient QNNs

    Authors:

    Chuanjian Liu,

    Kai Han,

    Yunhe Wang,

    Hanting Chen,

    Qi Tian,

    Chunjing Xu

    Abstract:

    Quantization neural networks (QNNs) are very attractive to the industry because their extremely cheap calculation and storage overhead, but their performance is still worse than that of networks with full-precision parameters. Most of existing methods aim to enhance performance of QNNs especially binary neural networks by exploiting more effective training techniques. However, we find the represen…
    ▽ More

    Submitted 12 February, 2020; v1 submitted 2 February, 2020;
    originally announced February 2020.

  • Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network

    Authors:

    Jun Qi,

    Hu Hu,

    Yannan Wang,

    Chao-Han Huck Yang,

    Sabato Marco Siniscalchi,

    Chin-Hui Lee

    Abstract:

    We propose a tensor-to-vector regression approach to multi-channel speech enhancement in order to address the issue of input size explosion and hidden-layer size expansion. The key idea is to cast the conventional deep neural network (DNN) based vector-to-vector regression formulation under a tensor-train network (TTN) framework. TTN is a recently emerged solution for compact representation of dee…
    ▽ More

    Submitted 2 February, 2020;
    originally announced February 2020.

  • The Sylvester Graphical Lasso (SyGlasso)

    Authors:

    Yu Wang,

    Byoungwook Jang,

    Alfred Hero

    Abstract:

    This paper introduces the Sylvester graphical lasso (SyGlasso) that captures multiway dependencies present in tensor-valued data. The model is based on the Sylvester equation that defines a generative model. The proposed model complements the tensor graphical lasso (Greenewald et al., 2019) that imposes a Kronecker sum model for the inverse covariance matrix by providing an alternative Kronecker s…
    ▽ More

    Submitted 1 February, 2020;
    originally announced February 2020.

  • Few-Shot Scene Adaptive Crowd Counting Using Meta-Learning

    Authors:

    Mahesh Kumar Krishna Reddy,

    Mohammad Hossain,

    Mrigank Rochan,

    Yang Wang

    Abstract:

    We consider the problem of few-shot scene adaptive crowd counting. Given a target camera scene, our goal is to adapt a model to this specific scene with only a few labeled images of that scene. The solution to this problem has potential applications in numerous real-world scenarios, where we ideally like to deploy a crowd counting model specially adapted to a target camera. We accomplish this chal…
    ▽ More

    Submitted 1 February, 2020;
    originally announced February 2020.

  • Exact and Robust Reconstruction of Integer Vectors Based on Multidimensional Chinese Remainder Theorem (MD-CRT)

    Authors:

    Li Xiao,

    Xiang-Gen Xia,

    Yu-Ping Wang

    Abstract:

    The robust Chinese remainder theorem (CRT) has been recently proposed for robustly reconstructing a large nonnegative integer from erroneous remainders. It has found applications in signal processing, including phase unwrapping and frequency estimation under sub-Nyquist sampling. Motivated by the applications in multidimensional (MD) signal processing, in this paper we propose the MD-CRT and robus…
    ▽ More

    Submitted 31 January, 2020;
    originally announced February 2020.

  • A Generative Adversarial Network for AI-Aided Chair Design

    Authors:

    Zhibo Liu,

    Feng Gao,

    Yizhou Wang

    Abstract:

    We present a method for improving human design of chairs. The goal of the method is generating enormous chair candidates in order to facilitate human designer by creating sketches and 3d models accordingly based on the generated chair design. It consists of an image synthesis module, which learns the underlying distribution of training dataset, a super-resolution module, which improve quality of g…
    ▽ More

    Submitted 31 January, 2020;
    originally announced January 2020.

  • Edit Distance Embedding using Convolutional Neural Networks

    Authors:

    Xinyan Dai,

    Xiao Yan,

    Kaiwen Zhou,

    Yuxuan Wang,

    Han Yang,

    James Cheng

    Abstract:

    Edit-distance-based string similarity search has many applications such as spell correction, data de-duplication, and sequence alignment. However, computing edit distance is known to have high complexity, which makes string similarity search challenging for large datasets. In this paper, we propose a deep learning pipeline (called CNN-ED) that embeds edit distance into Euclidean distance for fast…
    ▽ More

    Submitted 31 January, 2020;
    originally announced January 2020.

  • Deep Learning Based Unsupervised and Semi-supervised Classification for Keratoconus

    Authors:

    Nicole Hallett,

    Kai Yi,

    Josef Dick,

    Christopher Hodge,

    Gerard Sutton,

    Yu Guang Wang,

    Jingjing You

    Abstract:

    The transparent cornea is the window of the eye, facilitating the entry of light rays and controlling focusing the movement of the light within the eye. The cornea is critical, contributing to 75% of the refractive power of the eye. Keratoconus is a progressive and multifactorial corneal degenerative disease affecting 1 in 2000 individuals worldwide. Currently, there is no cure for keratoconus oth…
    ▽ More

    Submitted 30 January, 2020;
    originally announced January 2020.

  • CosmoVAE: Variational Autoencoder for CMB Image Inpainting

    Authors:

    Kai Yi,

    Yi Guo,

    Yanan Fan,

    Jan Hamann,

    Yu Guang Wang

    Abstract:

    Cosmic microwave background radiation (CMB) is critical to the understanding of the early universe and precise estimation of cosmological constants. Due to the contamination of thermal dust noise in the galaxy, the CMB map that is an image on the two-dimensional sphere has missing observations, mainly concentrated on the equatorial region. The noise of the CMB map has a significant impact on the e…
    ▽ More

    Submitted 30 January, 2020;
    originally announced January 2020.

  • Dual Convolutional LSTM Network for Referring Image Segmentation

    Authors:

    Linwei Ye,

    Zhi Liu,

    Yang Wang

    Abstract:

    We consider referring image segmentation. It is a problem at the intersection of computer vision and natural language understanding. Given an input image and a referring expression in the form of a natural language sentence, the goal is to segment the object of interest in the image referred by the linguistic query. To this end, we propose a dual convolutional LSTM (ConvLSTM) network to tackle thi…
    ▽ More

    Submitted 30 January, 2020;
    originally announced January 2020.

  • ABSent: Cross-Lingual Sentence Representation Mapping with Bidirectional GANs

    Authors:

    Zuohui Fu,

    Yikun Xian,

    Shijie Geng,

    Yingqiang Ge,

    Yuting Wang,

    Xin Dong,

    Guang Wang,

    Gerard de Melo

    Abstract:

    A number of cross-lingual transfer learning approaches based on neural networks have been proposed for the case when large amounts of parallel text are at our disposal. However, in many real-world settings, the size of parallel annotated training data is restricted. Additionally, prior cross-lingual mapping research has mainly focused on the word level. This raises the question of whether such tec…
    ▽ More

    Submitted 29 January, 2020;
    originally announced January 2020.

  • Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning

    Authors:

    Ming Yin,

    Yu-Xiang Wang

    Abstract:

    We consider the problem of off-policy evaluation for reinforcement learning, where the goal is to estimate the expected reward of a target policy $π$ using offline data collected by running a logging policy $μ$. Standard importance-sampling based approaches for this problem suffer from a variance that scales exponentially with time horizon $H$, which motivates a splurge of recent interest in alter…
    ▽ More

    Submitted 29 January, 2020;
    originally announced January 2020.

  • MGCN: Descriptor Learning using Multiscale GCNs

    Authors:

    Yiqun Wang,

    Jing Ren,

    Dong-Ming Yan,

    Jianwei Guo,

    Xiaopeng Zhang,

    Peter Wonka

    Abstract:

    We propose a novel framework for computing descriptors for characterizing points on three-dimensional surfaces. First, we present a new non-learned feature that uses graph wavelets to decompose the Dirichlet energy on a surface. We call this new feature wavelet energy decomposition signature (WEDS). Second, we propose a new multiscale graph convolutional network (MGCN) to transform a non-learned f…
    ▽ More

    Submitted 28 January, 2020;
    originally announced January 2020.

  • COKE: Communication-Censored Kernel Learning for Decentralized Non-parametric Learning

    Authors:

    Ping Xu,

    Yue Wang,

    Xiang Chen,

    Tian Zhi

    Abstract:

    This paper studies the decentralized optimization and learning problem where multiple interconnected agents aim to learn an optimal decision function defined over a reproducing kernel Hilbert (RKH) space by jointly minimizing a global objective function, with access to locally observed data only. As a non-parametric approach, kernel learning faces a major challenge in distributed implementation: t…
    ▽ More

    Submitted 27 January, 2020;
    originally announced January 2020.

  • StageNet: Stage-Aware Neural Networks for Health Risk Prediction

    Authors:

    Junyi Gao,

    Cao Xiao,

    Yasha Wang,

    Wen Tang,

    Lucas M. Glass,

    Jimeng Sun

    Abstract:

    Deep learning has demonstrated success in health risk prediction especially for patients with chronic and progressing conditions. Most existing works focus on learning disease Network (StageNet) model to extract disease stage information from patient data and integrate it into risk prediction. StageNet is enabled by (1) a stage-aware long short-term memory (LSTM) module that extracts health stage…
    ▽ More

    Submitted 24 January, 2020;
    originally announced January 2020.

  • Cellular Decomposition for Non-repetitive Coverage Task with Minimum Discontinuities

    Authors:

    Tong Yang,

    Jaime Valls Miro,

    Qianen Lai,

    Yue Wang,

    Rong Xiong

    Abstract:

    A mechanism to derive non-repetitive coverage path solutions with a proven minimal number of discontinuities is proposed in this work, with the aim to avoid unnecessary, costly end effector lift-offs for manipulators. The problem is motivated by the automatic polishing of an object. Due to the non-bijective mapping between the workspace and the joint-space, a continuous coverage path in the worksp…
    ▽ More

    Submitted 26 January, 2020;
    originally announced January 2020.

  • An efficient algorithm for $1$-dimensional (persistent) path homology

    Authors:

    Tamal K. Dey,

    Tianqi Li,

    Yusu Wang

    Abstract:

    This paper focuses on developing an efficient algorithm for analyzing a directed network (graph) from a topological viewpoint. A prevalent technique for such topological analysis involves computation of homology groups and their persistence. These concepts are well suited for spaces that are not directed. As a result, one needs a concept of homology that accommodates orientations in input space. P…
    ▽ More

    Submitted 26 January, 2020;
    originally announced January 2020.

  • Deep Learning-based Image Compression with Trellis Coded Quantization

    Authors:

    Binglin Li,

    Mohammad Akbari,

    Jie Liang,

    Yang Wang

    Abstract:

    Recently many works attempt to develop image compression models based on deep learning architectures, where the uniform scalar quantizer (SQ) is commonly applied to the feature maps between the encoder and decoder. In this paper, we propose to incorporate trellis coded quantizer (TCQ) into a deep learning based image compression framework. A soft-to-hard strategy is applied to allow for back propa…
    ▽ More

    Submitted 26 January, 2020;
    originally announced January 2020.

  • Fast Dense Residual Network: Enhancing Global Dense Feature Flow for Text Recognition

    Authors:

    Zhao Zhang,

    Zemin Tang,

    Yang Wang,

    Jie Qin,

    Haijun Zhang,

    Shuicheng Yan

    Abstract:

    Deep Convolutional Neural Networks (CNNs), such as Dense Convolutional Networks (DenseNet), have achieved great success for image representation by discovering deep hierarchical information. However, most existing networks simply stacks the convolutional layers and hence failing to fully discover local and global feature information among layers. In this paper, we mainly explore how to enhance the…
    ▽ More

    Submitted 23 January, 2020;
    originally announced January 2020.

  • 6D Object Pose Regression via Supervised Learning on Point Clouds

    Authors:

    Ge Gao,

    Mikko Lauri,

    Yulong Wang,

    Xiaolin Hu,

    Jianwei Zhang,

    Simone Frintrop

    Abstract:

    This paper addresses the task of estimating the 6 degrees of freedom pose of a known 3D object from depth information represented by a point cloud. Deep features learned by convolutional neural networks from color information have been the dominant features to be used for inferring object poses, while depth information receives much less attention. However, depth information contains rich geometri…
    ▽ More

    Submitted 24 January, 2020;
    originally announced January 2020.

  • SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency

    Authors:

    Zhengang Li,

    Yifan Gong,

    Xiaolong Ma,

    Sijia Liu,

    Mengshu Sun,

    Zheng Zhan,

    Zhenglun Kong,

    Geng Yuan,

    Yanzhi Wang

    Abstract:

    Structured weight pruning is a representative model compression technique of DNNs for hardware efficiency and inference accelerations. Previous works in this area leave great space for improvement since sparse structures with combinations of different structured pruning schemes are not exploited fully and efficiently. To mitigate the limitations, we propose SS-Auto, a single-shot, automatic struct…
    ▽ More

    Submitted 23 January, 2020;
    originally announced January 2020.

  • Impact-aware humanoid robot motion generation with a quadratic optimization controller

    Authors:

    Yuquan Wang,

    Arnaud Tanguy,

    Pierre Gergondet,

    Abderrahmane Kheddar

    Abstract:

    Impact-aware tasks (i.e. on purpose impacts) are not handled in multi-objective whole body controllers of hu-manoid robots. This leads to the fact that a humanoid robot typically operates at near-zero velocity to interact with the external environment. We explicitly investigate the propagation of the impact-induced velocity and torque jumps along the structure linkage and propose a set of constrai…
    ▽ More

    Submitted 23 January, 2020;
    originally announced January 2020.

  • BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method

    Authors:

    Xiaolong Ma,

    Zhengang Li,

    Yifan Gong,

    Tianyun Zhang,

    Wei Niu,

    Zheng Zhan,

    Pu Zhao,

    Jian Tang,

    Xue Lin,

    Bin Ren,

    Yanzhi Wang

    Abstract:

    Accelerating DNN execution on various resource-limited computing platforms has been a long-standing problem. Prior works utilize l1-based group lasso or dynamic regularization such as ADMM to perform structured pruning on DNN models to leverage the parallel computing architectures. However, both of the pruning dimensions and pruning methods lack universality, which leads to degraded performance an…
    ▽ More

    Submitted 22 January, 2020;
    originally announced January 2020.

  • Active Perception with A Monocular Camera for Multiscopic Vision

    Authors:

    Weihao Yuan,

    Rui Fan,

    Michael Yu Wang,

    Qifeng Chen

    Abstract:

    We design a multiscopic vision system that utilizes a low-cost monocular RGB camera to acquire accurate depth estimation for robotic applications. Unlike multi-view stereo with images captured at unconstrained camera poses, the proposed system actively controls a robot arm with a mounted camera to capture a sequence of images in horizontally or vertically aligned positions with the same parallax.…
    ▽ More

    Submitted 22 January, 2020;
    originally announced January 2020.

  • Causality based Feature Fusion for Brain Neuro-Developmental Analysis

    Authors:

    Peyman Hosseinzadeh Kassani,

    Li Xiao,

    Gemeng Zhang,

    Julia M. Stephen,

    Tony W. Wilson,

    Vince D. Calhoun,

    Yu Ping Wang

    Abstract:

    Human brain development is a complex and dynamic process that is affected by several factors such as genetics, sex hormones, and environmental changes. A number of recent studies on brain development have examined functional connectivity (FC) defined by the temporal correlation between time series of different brain regions. We propose to add the directional flow of information during brain matura…
    ▽ More

    Submitted 22 January, 2020;
    originally announced January 2020.

  • VoiceCoach: Interactive Evidence-based Training for Voice Modulation Skills in Public Speaking

    Authors:

    Xingbo Wang,

    Haipeng Zeng,

    Yong Wang,

    Aoyu Wu,

    Zhida Sun,

    Xiaojuan Ma,

    Huamin Qu

    Abstract:

    The modulation of voice properties, such as pitch, volume, and speed, is crucial for delivering a successful public speech. However, it is challenging to master different voice modulation skills. Though many guidelines are available, they are often not practical enough to be applied in different public speaking situations, especially for novice speakers. We present VoiceCoach, an interactive evide…
    ▽ More

    Submitted 21 January, 2020;
    originally announced January 2020.

  • An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices

    Authors:

    Xiaolong Ma,

    Wei Niu,

    Tianyun Zhang,

    Sijia Liu,

    Fu-Ming Guo,

    Sheng Lin,

    Hongjia Li,

    Xiang Chen,

    Jian Tang,

    Kaisheng Ma,

    Bin Ren,

    Yanzhi Wang

    Abstract:

    Weight pruning has been widely acknowledged as a straightforward and effective method to eliminate redundancy in Deep Neural Networks (DNN), thereby achieving acceleration on various platforms. However, most of the pruning techniques are essentially trade-offs between model accuracy and regularity which lead to impaired inference accuracy and limited on-device acceleration performance. To solve th…
    ▽ More

    Submitted 20 January, 2020;
    originally announced January 2020.

  • BARNet: Bilinear Attention Network with Adaptive Receptive Field for Surgical Instrument Segmentation

    Authors:

    Zhen-Liang Ni,

    Gui-Bin Bian,

    Guan-An Wang,

    Xiao-Hu Zhou,

    Zeng-Guang Hou,

    Xiao-Liang Xie,

    Zhen Li,

    Yu-Han Wang

    Abstract:

    Surgical instrument segmentation is extremely important for computer-assisted surgery. Different from common object segmentation, it is more challenging due to the large illumination and scale variation caused by the special surgical scenes. In this paper, we propose a novel bilinear attention network with adaptive receptive field to solve these two challenges. For the illumination variation, the…
    ▽ More

    Submitted 20 January, 2020;
    originally announced January 2020.

  • Finding Optimal Points for Expensive Functions Using Adaptive RBF-Based Surrogate Model Via Uncertainty Quantification

    Authors:

    Ray-Bing Chen,

    Yuan Wang,

    C. F. Jeff Wu

    Abstract:

    Global optimization of expensive functions has important applications in physical and computer experiments. It is a challenging problem to develop efficient optimization scheme, because each function evaluation can be costly and the derivative information of the function is often not available. We propose a novel global optimization framework using adaptive Radial Basis Functions (RBF) based surro…
    ▽ More

    Submitted 19 January, 2020;
    originally announced January 2020.

  • SQLFlow: A Bridge between SQL and Machine Learning

    Authors:

    Yi Wang,

    Yang Yang,

    Weiguo Zhu,

    Yi Wu,

    Xu Yan,

    Yongfeng Liu,

    Yu Wang,

    Liang Xie,

    Ziyao Gao,

    Wenjing Zhu,

    Xiang Chen,

    Wei Yan,

    Mingjie Tang,

    Yuan Tang

    Abstract:

    Industrial AI systems are mostly end-to-end machine learning (ML) workflows. A typical recommendation or business intelligence system includes many online micro-services and offline jobs. We describe SQLFlow for developing such workflows efficiently in SQL. SQL enables developers to write short programs focusing on the purpose (what) and ignoring the procedure (how). Previous database systems exte…
    ▽ More

    Submitted 19 January, 2020;
    originally announced January 2020.

  • MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recogntion

    Authors:

    Kaiyu Shan,

    Yongtao Wang,

    Zhuoying Wang,

    Tingting Liang,

    Zhi Tang,

    Ying Chen,

    Yangyan Li

    Abstract:

    To efficiently extract spatiotemporal features of video for action recognition, most state-of-the-art methods integrate 1D temporal convolution into a conventional 2D CNN backbone. However, they all exploit 1D temporal convolution of fixed kernel size (i.e., 3) in the network building block, thus have suboptimal temporal modeling capability to handle both long-term and short-term actions. To addre…
    ▽ More

    Submitted 24 January, 2020; v1 submitted 18 January, 2020;
    originally announced January 2020.

  • Unsupervised Learning of Camera Pose with Compositional Re-estimation

    Authors:

    Seyed Shahabeddin Nabavi,

    Mehrdad Hosseinzadeh,

    Ramin Fahimi,

    Yang Wang

    Abstract:

    We consider the problem of unsupervised camera pose estimation. Given an input video sequence, our goal is to estimate the camera pose (i.e. the camera motion) between consecutive frames. Traditionally, this problem is tackled by placing strict constraints on the transformation vector or by incorporating optical flow through a complex pipeline. We propose an alternative approach that utilizes a co…
    ▽ More

    Submitted 17 January, 2020;
    originally announced January 2020.

  • Plato Dialogue System: A Flexible Conversational AI Research Platform

    Authors:

    Alexandros Papangelis,

    Mahdi Namazifar,

    Chandra Khatri,

    Yi-Chia Wang,

    Piero Molino,

    Gokhan Tur

    Abstract:

    As the field of Spoken Dialogue Systems and Conversational AI grows, so does the need for tools and environments that abstract away implementation details in order to expedite the development process, lower the barrier of entry to the field, and offer a common test-bed for new ideas. In this paper, we present Plato, a flexible Conversational AI platform written in Python that supports any kind of…
    ▽ More

    Submitted 17 January, 2020;
    originally announced January 2020.

  • A Reliable Gravity Compensation Control Strategy for dVRK Robotic Arms With Nonlinear Disturbance Forces

    Authors:

    Hongbin Lin,

    C. W. Vincent Hui,

    Yan Wang,

    Anton Deguet,

    Peter Kazanzides,

    K. W. Samuel Au

    Abstract:

    External disturbance forces caused by nonlinear springy electrical cables in the Master Tool Manipulator (MTM) of the da Vinci Research Kit (dVRK) limits the usage of the existing gravity compensation methods. Significant motion drifts at the MTM tip are often observed when the MTM is located far from its identification trajectory, preventing the usage of these methods for the entire workspace rel…
    ▽ More

    Submitted 16 January, 2020;
    originally announced January 2020.

  • Understanding the Power of Persistence Pairing via Permutation Test

    Authors:

    Chen Cai,

    Yusu Wang

    Abstract:

    Recently many efforts have been made to incorporate persistence diagrams, one of the major tools in topological data analysis (TDA), into machine learning pipelines. To better understand the power and limitation of persistence diagrams, we carry out a range of experiments on both graph data and shape data, aiming to decouple and inspect the effects of different factors involved. To this end, we al…
    ▽ More

    Submitted 16 January, 2020;
    originally announced January 2020.

  • Outlier Detection Ensemble with Embedded Feature Selection

    Authors:

    Li Cheng,

    Yijie Wang,

    Xinwang Liu,

    Bin Li

    Abstract:

    Feature selection places an important role in improving the performance of outlier detection, especially for noisy data. Existing methods usually perform feature selection and outlier scoring separately, which would select feature subsets that may not optimally serve for outlier detection, leading to unsatisfying performance. In this paper, we propose an outlier detection ensemble framework with e…
    ▽ More

    Submitted 15 January, 2020;
    originally announced January 2020.

  • Improvement of an Approximated Self-Improving Sorter and Error Analysis of its Estimated Entropy

    Authors:

    Yujie Wang

    Abstract:

    The self-improving sorter proposed by Ailon et al. consists of two phases: a relatively long training phase and rapid operation phase. In this study, we have developed an efficient way to further improve this sorter by approximating its training phase to be faster but not sacrificing much performance in the operation phase. It is very necessary to ensure the accuracy of the estimated entropy when…
    ▽ More

    Submitted 15 January, 2020;
    originally announced January 2020.

  • Causal Discovery from Incomplete Data: A Deep Learning Approach

    Authors:

    Yuhao Wang,

    Vlado Menkovski,

    Hao Wang,

    Xin Du,

    Mykola Pechenizkiy

    Abstract:

    As systems are getting more autonomous with the development of artificial intelligence, it is important to discover the causal knowledge from observational sensory inputs. By encoding a series of cause-effect relations between events, causal networks can facilitate the prediction of effects from a given action and analyze their underlying data generation mechanism. However, missing data are ubiqui…
    ▽ More

    Submitted 15 January, 2020;
    originally announced January 2020.



  • Source link

    Write a comment:
    *

    Your email address will not be published.