Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

One-Shot image Generation (OSG) models have attracted much attention in recent years due to their relaxed requirements on the number of training samples (one is required). However, these methods cannot simultaneously model multiple samples of different classes (e.g., "balloons" and "birds" images), which limits the application and development of the models. To this end, we propose a unified framework to solve this problem, called Multiple One-Shot image Generation (MultiOSG) method. The core design of MultiOSG is to disentangle the image into a structure code and a texture code and obtain a new structure code through the proposed Structure Reshuffle. Compared with the original structure code, the new one should have good local diversity and global controllability. Then, the new structure and original texture codes are combined to obtain the corresponding generated results. Qualitative and quantitative experimental results show that our MultiOSG is competitive in the OSG tasks. Moreover, MultiOSG can model multiple images and produce random samples of various classes, which extends the limitation of the classical OSG method. Code will be released upon acceptance at https://github.com/gouayao/MultiOSG.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2025.107862DOI Listing

Publication Analysis

Top Keywords

one-shot image
12
image generation
12
structure code
12
multiple one-shot
8
structure reshuffle
8
model multiple
8
samples classes
8
structure
6
code
5
multiple
4

Similar Publications

In the field of industrial inspection, image segmentation is a common method for surface inspection, capable of locating and segmenting the appearance defect areas of products. Most existing methods are trained specifically for particular products. The recent SAM (Segment Anything Model) serves as an image segmentation foundation model, capable of achieving zero-shot segmentation through diverse prompts.

View Article and Find Full Text PDF

Class-agnostic counting is increasingly prevalent in industrial and agricultural applications. However, most deployable methods rely on density maps, which (1) struggle with background interference in complex scenes, and (2) fail to provide precise object locations, limiting downstream usability. The advancement of class-agnostic counting is hindered by suboptimal model designs and the lack of datasets with bounding box annotations.

View Article and Find Full Text PDF

Image classification plays a pivotal role in analyzing biomedical images, serving as a cornerstone for both biological research and clinical diagnostics. It is demonstrated that large multimodal models (LMMs), like GPT-4, excel in one-shot learning, generalization, interpretability, and text-driven image classification across diverse biomedical tasks. These tasks include the classification of tissues, cell types, cellular states, and disease status.

View Article and Find Full Text PDF

Biovision-Inspired Perovskite Intelligent Camera for Panchromatic and Metameric Sensing.

Adv Mater

August 2025

Guangdong Provincial Key Lab of Nano-Micro Materials Research, School of Advanced Materials, Peking University, Shenzhen, 518055, China.

Biological vision systems excel at acquiring and processing information, but there is often a trade-off between these capabilities. For instance, mantis shrimp possess exceptional spectral sensing but poor color perception due to limited neural processing. Taking the best of both worlds, the mantis shrimp's spectral detection ability and the human-like visual processing power are integrated to achieve full-color perception.

View Article and Find Full Text PDF

In conventional three-dimensional (3D) measurements using lens imaging or holography, the practical sample size in visible light is as large as 50 µm, and the resolution is limited to about 5 µm due to image distortion caused by artifacts. Rigorous calculations of neuronal cell models confirm these limitations in the sample size and resolution for lens imaging. Scatterometry, on the other hand, is a technique to obtain 3D cross-sectional structures by measuring the diffraction patterns of a sample.

View Article and Find Full Text PDF