Multiple one-shot image generation via deep structure reshuffle.

Yao Gou , Min Li , Yusen Zhang , Xianjie Zhang , Yujie He

Neural Netw

Xi'an High-Tech Research Institute, Xi'an, 710025, China. Electronic address:

Published: July 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

One-Shot image Generation (OSG) models have attracted much attention in recent years due to their relaxed requirements on the number of training samples (one is required). However, these methods cannot simultaneously model multiple samples of different classes (e.g., "balloons" and "birds" images), which limits the application and development of the models. To this end, we propose a unified framework to solve this problem, called Multiple One-Shot image Generation (MultiOSG) method. The core design of MultiOSG is to disentangle the image into a structure code and a texture code and obtain a new structure code through the proposed Structure Reshuffle. Compared with the original structure code, the new one should have good local diversity and global controllability. Then, the new structure and original texture codes are combined to obtain the corresponding generated results. Qualitative and quantitative experimental results show that our MultiOSG is competitive in the OSG tasks. Moreover, MultiOSG can model multiple images and produce random samples of various classes, which extends the limitation of the classical OSG method. Code will be released upon acceptance at https://github.com/gouayao/MultiOSG.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.neunet.2025.107862	DOI Listing

Publication Analysis

Top Keywords

one-shot image

image generation

structure code

multiple one-shot

structure reshuffle

model multiple

samples classes

structure

code

multiple

Similar Publications

SAID: Segment All Industrial Defects with Scene Prompts.

Sensors (Basel)

August 2025

School of Sino-Germany Intelligent Production, Shenzhen City Polytechnic, Shenzhen 518116, China.

Yican Huang , Junwei Zhu , Xiaopin Zhong , Yuanlong Deng

In the field of industrial inspection, image segmentation is a common method for surface inspection, capable of locating and segmenting the appearance defect areas of products. Most existing methods are trained specifically for particular products. The recent SAM (Segment Anything Model) serves as an image segmentation foundation model, capable of achieving zero-shot segmentation through diverse prompts.

View Article and Find Full Text PDF

Similar Publications

Counting with ease: Class-agnostic counting via one-shot detection across diverse domains.

Neural Netw

August 2025

School of Advanced Technology, Xi'an Jiaotong-Liverpool University, Suzhou, 215123, China. Electronic address:

Zhongxing Peng , Bohui Guo , Shugong Xu

Class-agnostic counting is increasingly prevalent in industrial and agricultural applications. However, most deployable methods rely on density maps, which (1) struggle with background interference in complex scenes, and (2) fail to provide precise object locations, limiting downstream usability. The advancement of class-agnostic counting is hindered by suboptimal model designs and the lack of datasets with bounding box annotations.

View Article and Find Full Text PDF

Similar Publications

Assessing Large Multimodal Models for One-Shot Learning and Interpretability in Biomedical Image Classification.

Adv Intell Syst

April 2025

Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham 27705, NC, USA.

Wenpin Hou , Qi Liu , Huifang Ma , Yilong Qu , Zhicheng Ji

Image classification plays a pivotal role in analyzing biomedical images, serving as a cornerstone for both biological research and clinical diagnostics. It is demonstrated that large multimodal models (LMMs), like GPT-4, excel in one-shot learning, generalization, interpretability, and text-driven image classification across diverse biomedical tasks. These tasks include the classification of tissues, cell types, cellular states, and disease status.

View Article and Find Full Text PDF

Similar Publications

Biovision-Inspired Perovskite Intelligent Camera for Panchromatic and Metameric Sensing.

Adv Mater

August 2025

Guangdong Provincial Key Lab of Nano-Micro Materials Research, School of Advanced Materials, Peking University, Shenzhen, 518055, China.

Yu Li , Zikun Jin , Yujin Liu , Jian Wang , Shanshan Yu

Biological vision systems excel at acquiring and processing information, but there is often a trade-off between these capabilities. For instance, mantis shrimp possess exceptional spectral sensing but poor color perception due to limited neural processing. Taking the best of both worlds, the mantis shrimp's spectral detection ability and the human-like visual processing power are integrated to achieve full-color perception.

View Article and Find Full Text PDF

Similar Publications

High-speed three-dimensional cross-sectional measurement of cultured neurons by scatterometry that improves resolution by an order of magnitude.

Opt Express

March 2025

Suguru Iwata , Tetsuya Hoshino , Sadao Aoki , Yosuke Takei , Masahide Itoh

In conventional three-dimensional (3D) measurements using lens imaging or holography, the practical sample size in visible light is as large as 50 µm, and the resolution is limited to about 5 µm due to image distortion caused by artifacts. Rigorous calculations of neuronal cell models confirm these limitations in the sample size and resolution for lens imaging. Scatterometry, on the other hand, is a technique to obtain 3D cross-sectional structures by measuring the diffraction patterns of a sample.

View Article and Find Full Text PDF

Similar Publications