2022 Vol. 48, No. 8

Display Method:
Volume 48 Issue82022
iconDownload (90388) 1550 iconPreview
Double drive adaptive super-resolution reconstruction method of remote sensing images for object detection
CHENG Keyang, RONG Lan, JIANG Senlin, ZHAN Yongzhao
2022, 48(8): 1343-1352. doi: 10.13700/j.bh.1001-5965.2021.0517
Abstract:

The existing optical remote sensing image super-resolution reconstruction method is mainly to generate visually satisfactory images, and does not take into account the particularity of the subsequent target detection task, so it cannot be effectively applied to target detection. Therefore, a double ...

Traffic signal timing method based on deep reinforcement learning and extended Kalman filter
WU Lan, WU Yuanming, KONG Fanshi, LI Binquan
2022, 48(8): 1353-1363. doi: 10.13700/j.bh.1001-5965.2021.0529
Abstract:

The deep Q-learning network (DQN) has become an effective method to solve the traffic signal timing problem because of its strong perception and decision-making ability. However, in the field of traffic signal timing systems, the problem of parameter uncertainty caused by external environment distur...

Single image dehazing method based on improved atmospheric scattering model
YANG Yong, QIU Genying, HUANG Shuying, WAN Weiguo, HU Wei
2022, 48(8): 1364-1375. doi: 10.13700/j.bh.1001-5965.2021.0532
Abstract:

Images obtained in foggy conditions often suffer from low contrast, color loss, and noise. At present, many traditional dehazing methods mainly focus on solving problems such as low contrast and color loss, but do not consider the hidden noise light scattered by dust particles in the air, resulting ...

Region-hierarchical predictive coding for quantized block compressive sensing
LIU Hao, ZHENG Haoran, HUANG Rong
2022, 48(8): 1376-1382. doi: 10.13700/j.bh.1001-5965.2021.0511
Abstract:

During the predictive coding of quantized block compressive sensing, a large quantity of inefficient candidates will lead to low rate-distortion performance. To efficiently reduce the encoding distortion, this paper proposes a region-hierarchical predictive coding method for quantized block compress...

Dual coding unit partition optimization algorithm of HEVC
LIU Meiqin, XU Chenming, YAO Chao, LIN Chunyu, ZHAO Yao
2022, 48(8): 1383-1389. doi: 10.13700/j.bh.1001-5965.2021.0528
Abstract:

To resolve the conflict between the increasing amount of video data and the demand for high-quality video experience, HEVC has boosted the compression performance by 50% based on dramatically increased complexity of H.264/AVC. In this paper, a fast coding unit (CU) partition algorithm is proposed to...

Object tracking method based on IoU-constrained Siamese network
ZHOU Lifang, LIU Jinlan, LI Weisheng, LEI Bangjun, HE Yu, WANG Yihan
2022, 48(8): 1390-1398. doi: 10.13700/j.bh.1001-5965.2021.0533
Abstract:

The tracking method based on the Siamese network trains the tracking model offline. Therefore, it maintains a good balance between tracking accuracy and speed, which attracts the interest of a growing number of researchers recently. The existing Siamese network object tracking method uses a fixed th...

Long-tail image captioning with dynamic semantic memory network
LIU Hao, YANG Xiaoshan, XU Changsheng
2022, 48(8): 1399-1408. doi: 10.13700/j.bh.1001-5965.2021.0518
Abstract:

Image captioning takes image as input and outputs a text sequence. Nowadays, most images included in image captioning datasets are captured from daily life of internet users. Captions of these images are consequently composed of a few common words and many rare words. Most existing studies focus on ...

Medical image segmentation based on multi-layer features and spatial information distillation
ZHENG Yuxiang, HAO Pengyi, WU Dong'en, BAI Cong
2022, 48(8): 1409-1417. doi: 10.13700/j.bh.1001-5965.2021.0504
Abstract:

U-Net is currently the most widely used segmentation model, and its "coding-decoding" structure has also become the most commonly used structure for building medical image segmentation models. Although U-Net has achieved very high segmentation accuracy in many fields, but there are problems such as ...

No reference quality assessment method for contrast-distorted images based on three elements of color
DING Yingqiu, YANG Yang, CHENG Ming, ZHANG Weiming
2022, 48(8): 1418-1427. doi: 10.13700/j.bh.1001-5965.2021.0509
Abstract:

Image quality assessment is a basic and challenging problem in the field of image processing, among which the contrast distortion has a greater impact on the perception of image quality. However, there is relatively little research on the no-reference image quality assessment of contrast-distorted i...

Knowledge graph completion based on graph contrastive attention network
LIU Danyang, FANG Quan, ZHANG Xiaowei, HU Jun, QIAN Shengsheng, XU Changsheng
2022, 48(8): 1428-1435. doi: 10.13700/j.bh.1001-5965.2021.0523
Abstract:

Knowledge graph (KG) completion aims to predict missing links based on the known triples in a knowledge base. Since most KG completion methods dealt with triples independently without capture the heterogeneous structure of KG and the rich information that was inherent the in neighbor nodes, which re...

Image difference caption generation with text information assistance
CHEN Weijing, WANG Weiying, JIN Qin
2022, 48(8): 1436-1444. doi: 10.13700/j.bh.1001-5965.2021.0526
Abstract:

The image captioning task requires the machine to automatically generate natural language text to describe the semantic content of the image, thus transforming visual information into textual descriptions that facilitate image management, retrieval, classification, and other tasks. Image difference ...

A high-speed spectral clustering method in Fourier domain for massive data
ZHANG Man, XU Zhaorui, SHEN Xiangjun
2022, 48(8): 1445-1454. doi: 10.13700/j.bh.1001-5965.2021.0537
Abstract:

Spectral clustering is widely used in data mining and pattern recognition. However, due to the high computational cost of eigenvector solutions and the huge memory requirements brought by big data, spectral clustering algorithm is greatly limited when it is applied to large-scale data. Therefore, th...

Crowd density estimation for fisheye images
YANG Jialin, LIN Chunyu, NIE Lang, LIU Meiqin, ZHAO Yao
2022, 48(8): 1455-1463. doi: 10.13700/j.bh.1001-5965.2021.0520
Abstract:

Aiming at the problem that the traditional crowd density estimation methods are not applicable under the distortion of fisheye images, this paper presents a crowd density estimation method for fisheye images, which realizes the monitoring of human traffic in scene of using fisheye lens. For model st...

A full-scale feature aggregation network for remote sensing image change detection
LIU Guoqiang, FANG Sheng, LI Zhe
2022, 48(8): 1464-1470. doi: 10.13700/j.bh.1001-5965.2021.0522
Abstract:

Change detection (CD) is an important task of remote sensing, always facing many pseudo changes and large scale variations. However, existing methods mainly focus on modeling difference features and neglect extracting sufficient information from the original images, which affects feature discriminat...

Improved spatial and channel information based global smoke attention network
DONG Zeshu, YUAN Feiniu, XIA Xue
2022, 48(8): 1471-1479. doi: 10.13700/j.bh.1001-5965.2021.0549
Abstract:

Smoke has the characteristics of semi-transparency, irregularity and blurry boundaries, leading to the challenging task of image smoke segmentation. To solve these problems, we propose an attention modeling method to extract the correlation of long-distance information. The attention method can capt...

Hypernymy detection based on graph contrast
ZHANG Yali, FANG Quan, WANG Yunxin, Hu Jun, QIAN Shengsheng, XU Changsheng
2022, 48(8): 1480-1486. doi: 10.13700/j.bh.1001-5965.2021.0524
Abstract:

Hypernymy is the foundation of many downstream tasks in natural language processing (NLP), so hypernymy detection has received considerable attention in the field of NLP. Adopting random initialization word vectors, existing word embedding methods cannot well capture the asymmetry and transferabilit...

3D object detection based on multi-path feature pyramid network for stereo images
SU Kaiqi, YAN Weiqing, XU Jindong
2022, 48(8): 1487-1494. doi: 10.13700/j.bh.1001-5965.2021.0525
Abstract:

3D object detection is an important scene understanding task in computer vision and autonomous driving. However, most of these methods do not fully consider the large differences in scales between multiple objects. Thus, objects with a small scale are easily ignored, resulting in low detection accur...

Prediction model of COVID-19 based on spatiotemporal attention mechanism
BAO Xin, TAN Zhiyi, BAO Bingkun, XU Changsheng
2022, 48(8): 1495-1504. doi: 10.13700/j.bh.1001-5965.2021.0535
Abstract:

The continuous spread of the COVID-19 has brought profound impacts on human society. For the prevention and control of virus spreading, it is critical to predict the future trend of epidemic situation. Existing studies on COVID-19 spread prediction, based on classic SEIR models or naive time-series ...

Hyperspectral image compression method based on 3D Saab transform
XU Aiming, HUANG Yuxing, SHEN Qiu
2022, 48(8): 1505-1514. doi: 10.13700/j.bh.1001-5965.2021.0521
Abstract:

Hyperspectral images contain rich and valuable spectral information, which brings great challenges to storage and transmission. However, most current hyperspectral image compression methods cannot consider spatial and spectral redundancy simultaneously, resulting in limited compression performance. ...

A real scene underwater semantic segmentation method and related dataset
MA Zhiwei, LI Haojie, FAN Xin, LUO Zhongxuan, LI Jianjun, WANG Zhihui
2022, 48(8): 1515-1524. doi: 10.13700/j.bh.1001-5965.2021.0527
Abstract:

Underwater object recognition and segmentation with high accuracy have become a challenge with the development of underwater object grabbing technology. The existing underwater object detection technology can only give the general position of an object, unable to give more detailed information such ...

Appearance and action adaptive target tracking method
XIONG Junyao, WANG Rong, SUN Yibo
2022, 48(8): 1525-1533. doi: 10.13700/j.bh.1001-5965.2021.0597
Abstract:

On the basis of DaSiamese-RPN, a target tracking approach of appearance and action adaptation is proposed to limit the effect of appearance deformation on target tracking when the target is moving.First of all, the appearance and action adaptive module is introduced in the subnet of the Siamese netw...

Multi-label cooperative learning for cross domain person re-identification
LI Hui, ZHANG Xiaowei, ZHAO Xinpeng, LU Xinyu
2022, 48(8): 1534-1542. doi: 10.13700/j.bh.1001-5965.2021.0600
Abstract:

Cross-domain was an important application scenario in person re-identification, but the apparent difference of person image in illumination condition, shooting angle, imaging background and style between the source domain and target domain was the most important factor that leads to the decline of t...

Player movement data analysis on soccer field reconstruction
JI Xiaoqi, SONG Zikai, YU Junqing
2022, 48(8): 1543-1552. doi: 10.13700/j.bh.1001-5965.2022.0131
Abstract:

Objective In soccer matches, player data analysis is crucial to improve the viewing experience for viewers and to aid coaches in performance evaluation. The difficulty of player data analysis is how to locate the coordinates of players on the soccer field, i.e., how to determine the mapping relation...