Publications by authors named "Guangming Shi"

Semantic communication has attracted considerable interest due to its potential to support emerging human-centric services, such as holographic communications, extended reality (XR), and human-machine interactions. Different from traditional communication systems that focus on minimizing the symbol-level distortion (e.g.

View Article and Find Full Text PDF

Coded Aperture Snapshot Spectral Imaging (CASSI) multiplexes 3D Hyperspectral Images (HSIs) into a 2D sensor to capture dynamic spectral scenes, which, however, sacrifices the spatial information. Dual-Camera Compressive Hyperspectral Imaging (DCCHI) enhances CASSI by incorporating a Panchromatic (PAN) camera to compensate for the loss of spatial information in CASSI. However, the dual-camera structure of DCCHI disrupts the diagonal property of the product of the sensing matrix and its transpose, making it difficult to efficiently and accurately solve the data subproblem in closed-form and thereby hindering the application of model-based methods and Deep Unfolding Networks (DUNs) that rely on such a closed-form solution.

View Article and Find Full Text PDF

The implementation of synthetic aperture interferometry via photonic integrated circuits (PICs) holds significant potential for miniaturized high-resolution imaging systems, particularly in space optics and precision metrology. However, its advancement remains constrained by two fundamental limitations: restricted baseline reconfigurability (conventionally fixed at <20 mm) and aliasing artifacts induced by quasi-uniform sampling patterns during two-dimensional image reconstruction. To overcome these challenges, we develop a hybrid fiber-PIC architecture enabling continuous baseline adjustment from 10 mm to 50 mm through dynamically tunable fiber delay lines, integrated with a wheel-optimized non-uniform radial sampling strategy designed to minimize spectral matrix mapping errors by prioritizing baseline angles intersecting frequency grid points, achieving 1.

View Article and Find Full Text PDF

Face photo-sketch recognition task plays a crucial role in forensic investigation, human visual perception, and facial biometrics applications. The substantial modality gap between photographs and sketches, compounded by the influence of the semantic gap, poses a formidable challenge to recognition tasks. This study aims to propose an effective electroencephalography (EEG)-based approach to bridge this gap.

View Article and Find Full Text PDF

The mechanical properties of the loess in the Ili region of China deteriorate significantly when it is subjected to the dry-wet cycles. Attributed to the critical role played by the mica content for the mechanical deterioration of Ili loess, a series of laboratory tests, including the X-ray diffraction (XRD) tests, the triaxial compression tests, the scanning electron microscopy (SEM), and other methods, were carried out to investigate the macroscopic and microscopic properties of Ili loess under different dry-wet cycles (i.e.

View Article and Find Full Text PDF

This paper introduces a novel self-supervised learning framework for enhancing 3D perception in autonomous driving scenes. Specifically, our approach, namely NCLR, focuses on 2D-3D neural calibration, a novel pretext task that estimates the rigid pose aligning camera and LiDAR coordinate systems. First, we propose the learnable transformation alignment to bridge the domain gap between image and point cloud data, converting features into a unified representation space for effective comparison and matching.

View Article and Find Full Text PDF

As a type of multi-dimensional sequential data, the spatial and temporal dependencies of electroencephalogram (EEG) signals should be further investigated. Thus, in this paper, we propose a novel spatial-temporal progressive attention model (STPAM) to improve EEG classification in rapid serial visual presentation (RSVP) tasks. STPAM employs a progressive approach using three sequential spatial experts to learn brain region topology and mitigate interference from irrelevant areas.

View Article and Find Full Text PDF

Brain-computer interfaces (BCIs) based on steady-state visual evoked potential (SSVEP) have attracted much attention due to their high efficiency and accuracy. The SSVEP paradigm and decoding methods have been extensively studied and achieved remarkable results. This study proposed two modulation methods for the SSVEP paradigm, namely color inversion modulation and brightness compression modulation.

View Article and Find Full Text PDF

Existing models for recognizing human-object interaction (HOI) in videos mainly rely on visual information for reasoning and generally treat recognition tasks as traditional multi-classification problems, where labels are represented by numbers. This supervised learning method discards semantic information in the labels and ignores advanced semantic relationships between actual categories. In fact, natural language contains a wealth of linguistic knowledge that humans have distilled about human-object interaction, and the category text contains a large amount of semantic relationships between texts.

View Article and Find Full Text PDF

Unsupervised domain adaptation (UDA) techniques, extensively studied in hyperspectral image (HSI) classification, aim to use labeled source domain data and unlabeled target domain data to learn domain invariant features for cross-scene classification. Compared to natural images, numerous spectral bands of HSIs provide abundant semantic information, but they also increase the domain shift significantly. In most existing methods, both explicit alignment and implicit alignment simply align feature distribution, ignoring domain information in the spectrum.

View Article and Find Full Text PDF

The development of novel host-guest catalysts with nanoscale active sites provides new opportunities to gain highly efficient catalytic performance for the oxidation of aromatic alcohols into high-value compounds. In this work, we develop a green and facile synthesis method to incorporate Pd nanoparticles (Pd) into a robust hierarchical porous Zr-metal-organic framework (Zr-IPA) through in situ solvent-free synthesis and wet reduction techniques. The optimized Pd/Zr-IPA catalyst exhibits exceptional benzyl alcohol oxidation performance, selectively oxidizing 95.

View Article and Find Full Text PDF

A key challenge in developing a robust electroencephalography (EEG)-based brain-computer interface (BCI) is obtaining reliable classification performance across multiple days. In particular, EEG-based motor imagery (MI) BCI faces large variability and low signal-to-noise ratio. To address these issues, collecting a large and reliable dataset is critical for learning of cross-session and cross-subject patterns while mitigating EEG signals inherent instability.

View Article and Find Full Text PDF

The soft- hard- interbedded rock slope in cold regions generally undergo the differential weathering due to the freeze-thaw effects, for which the irregular rock fractures increase the risk of geological disasters occurrence. To investigate the failure mechanism of the rock slope, both the numerical simulation and theoretical analysis were adopted in the present research. The structural integrity of the soft- hard- interbedded rock slope subjected to three different conditions (e.

View Article and Find Full Text PDF

Conventional spectral image demosaicing algorithms rely on pixels' spatial or spectral correlations for reconstruction. Due to the missing data in the multispectral filter array (MSFA), the estimation of spatial or spectral correlations is inaccurate, leading to poor reconstruction results, and these algorithms are time-consuming. Deep learning-based spectral image demosaicing methods directly learn the nonlinear mapping relationship between 2D spectral mosaic images and 3D multispectral images.

View Article and Find Full Text PDF

Active Domain Adaptation (ADA) improves knowledge transfer efficiency from the labeled source domain to the unlabeled target domain by selecting a few target sample labels. However, most existing active sampling methods ignore the local uncertainty of neighbors in the target domain,making it easier to pick out anomalous samples that are detrimental to the model. To address this problem, we present a new approach to active domain adaptation called Local Uncertainty Energy Transfer (LUET), which integrates active learning of local uncertainty confusion and energy transfer alignment constraints into a unified framework.

View Article and Find Full Text PDF

With the differential sensitivity and high time resolution, event cameras can record detailed motion clues, which form a complementary advantage with frame-based cameras to enhance the object tracking, especially in challenging dynamic scenes. However, how to better match heterogeneous event-image data and exploit rich complementary cues from them still remains an open issue. In this paper, we align event-image modalities by proposing a motion adaptive event sampling method, and we revisit the cross-complementarities of event-image data to design a bidirectional-enhanced fusion framework.

View Article and Find Full Text PDF

Motion deblurring is a highly ill-posed problem due to the significant loss of motion information in the blurring process. Complementary informative features from auxiliary sensors such as event cameras can be explored for guiding motion deblurring. The event camera can capture rich motion information asynchronously with microsecond accuracy.

View Article and Find Full Text PDF

We propose a prototype called a flexible integrated resolution and efficient light-imaging-expanded synthetic system (FIREFLIES). This paper describes the design, manufacturing, and experimental demonstration of the proposed system. FIREFLIES enables interferometric imaging at approximately 1550 nm using a variable baseline sampling technique, in which the baseline-collected light field forms interference fringes that are captured by an on-chip photodetector.

View Article and Find Full Text PDF

In the Image Aesthetics Computing (IAC) field, most prior methods leveraged the off-the-shelf backbones pre-trained on the large-scale ImageNet database. While these pre-trained backbones have achieved notable success, they often overemphasize object-level semantics and fail to capture the high-level concepts of image aesthetics, which may only achieve suboptimal performances. To tackle this long-neglected problem, we propose a multi-modality multi-attribute contrastive pre-training framework, targeting at constructing an alternative to ImageNet-based pre-training for IAC.

View Article and Find Full Text PDF
Article Synopsis
  • Loess landslides pose a major risk in arid and semi-arid regions, particularly due to factors like snowmelt and rainfall, necessitating robust monitoring systems for effective stability analysis and early warnings.
  • This study examines the Kalahaisu landslide in Xinyuan County by evaluating the effectiveness of different monitoring techniques in terms of time and spatial resolution, as well as data accuracy.
  • Findings suggest that monitoring instruments with higher resolution lead to better understanding of landslide behavior, and advanced technologies like Synthetic Aperture Radar (SAR) could significantly improve prediction and prevention strategies for future loess landslides.
View Article and Find Full Text PDF

To mitigate the societal impact of the COVID-19 pandemic, China implemented long-term restrictive measures. The sudden liberalization at the end of 2022 disrupted residents' daily routines, making it scientifically intriguing to explore its effect on air quality. Taking Chongqing City in Southwest China as an example, we examined the impact of restriction liberalization on air quality, identified potential sources of pollutants, simulated the effects of abrupt anthropogenic control relaxation using a Random Forest Model, and applied an optimized model to predict the post-liberalization pollutant concentrations.

View Article and Find Full Text PDF

Previous deep learning-based event denoising methods mostly suffer from poor interpretability and difficulty in real-time processing due to their complex architecture designs. In this paper, we propose window-based event denoising, which simultaneously deals with a stack of events while existing element-based denoising focuses on one event each time. Besides, we give the theoretical analysis based on probability distributions in both temporal and spatial domains to improve interpretability.

View Article and Find Full Text PDF

Deep learning-based video denoising methods have achieved great performance improvements in recent years. However, the expensive computational cost arising from sophisticated network design has severely limited their applications in real-world scenarios. To address this practical weakness, we propose a multiscale spatio-temporal memory network for fast video denoising, named MSTMN, aiming at striking an improved trade-off between cost and performance.

View Article and Find Full Text PDF

Preparation of multifunctional metal-organic frameworks (MOFs) offers new opportunities to obtain ultrahigh synergistic catalytic performance for heterogeneous reactions; however, the application of a one-pot method for preparing multifunctional MOFs remains challenging. Herein, we develop a one-pot green route for synthesizing bimetallic nitro-functionalized UiO-66(Zr-Hf)-NO with hierarchical porosity under solvent-free conditions. The optimal UiO-66(Zr-Hf)-NO shows an ultrahigh enhancement of oxidative desulfurization (ODS) efficiency to oxidize sulfur compounds (1000 ppm sulfur) in a model fuel at 40 °C within 12 min due to the introduction of more active Hf sites in the nodes, the increased Lewis acidity of Zr/Hf-O nodes by the electron-withdrawing NO group, and the enhanced diffusion rates by the mesopores.

View Article and Find Full Text PDF