Hybrid Fine-Tuning Strategy for Few-Shot Classification.

Lei Zhao , Zhonghua Ou , Lixun Zhang , Shuxiao Li

Comput Intell Neurosci

Institute of Automation, Chinese Academy of Sciences, Beijing, China.

Published: October 2022

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Few-shot classification aims to enable the network to acquire the ability of feature extraction and label prediction for the target categories given a few numbers of labeled samples. Current few-shot classification methods focus on the pretraining stage while fine-tuning by experience or not at all. No fine-tuning or insufficient fine-tuning may get low accuracy for the given tasks, while excessive fine-tuning will lead to poor generalization for unseen samples. To solve the above problems, this study proposes a hybrid fine-tuning strategy (HFT), including a few-shot linear discriminant analysis module (FSLDA) and an adaptive fine-tuning module (AFT). FSLDA constructs the optimal linear classification function under the few-shot conditions to initialize the last fully connected layer parameters, which fully excavates the professional knowledge of the given tasks and guarantees the lower bound of the model accuracy. AFT adopts an adaptive fine-tuning termination rule to obtain the optimal training epochs to prevent the model from overfitting. AFT is also built on FSLDA and outputs the final optimum hybrid fine-tuning strategy for a given sample size and layer frozen policy. We conducted extensive experiments on mini-ImageNet and tiered-ImageNet to prove the effectiveness of our proposed method. It achieves consistent performance improvements compared to existing fine-tuning methods under different sample sizes, layer frozen policies, and few-shot classification frameworks.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9569229	PMC
http://dx.doi.org/10.1155/2022/9620755	DOI Listing

Publication Analysis

Top Keywords

few-shot classification

hybrid fine-tuning

fine-tuning strategy

fine-tuning

adaptive fine-tuning

layer frozen

few-shot

classification

strategy few-shot

classification few-shot

Similar Publications

Do large language models learn like humans: Interleaved and spaced practice in morphological learning.

Acta Psychol (Amst)

September 2025

Shanghai Jiao Tong University, China. Electronic address:

Ying Xiong , Shiyu Wu

This study investigates fundamental differences in the acquisition of morphological patterns by humans and large language models (LLMs) within an artificial language learning paradigm. Specifically, it compares how each system responds to variations in input structure-blocked versus interleaved sequences and juxtaposed versus spaced presentation-across verb classification and inflection tasks. While LLMs (GPT4mini, DeepSeek_V3, Llama3.

View Article and Find Full Text PDF

Similar Publications

Few-Shot Class-Incremental Learning with Dynamic Prototype Refinement for Brain Activity Classification.

IEEE J Biomed Health Inform

September 2025

Lei Cao , Hao Li , Yilin Dong , Tianyu Liu , Jie Li

The brain-computer interface (BCI) system facilitates efficient communication and control, with Electroencephalography (EEG) signals as a vital component. Traditional EEG signal classification, based on static deeplearning models, presents a challenge when new classes of the subject's brain activity emerge. The goal is to develop a model that can recognize new few-shot classes while preserving its ability to discriminate between existing ones.

View Article and Find Full Text PDF

Similar Publications

LitAutoScreener: Development and Validation of an Automated Literature Screening Tool in Evidence-Based Medicine Driven by Large Language Models.

Health Data Sci

September 2025

Key Laboratory of Epidemiology of Major Diseases, Ministry of Education/Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China.

Yiming Tao , Xuehu Li , Zuhar Yisha , Sihan Yang , Siyan Zhan

The traditional manual literature screening approach is limited by its time-consuming nature and high labor costs. A pressing issue is how to leverage large language models to enhance the efficiency and quality of evidence-based evaluations of drug efficacy and safety. This study utilized a manually curated reference literature database-comprising vaccine, hypoglycemic agent, and antidepressant evaluation studies-previously developed by our team through conventional systematic review methods.

View Article and Find Full Text PDF

Similar Publications

Analysis of Freezing of Gait in Parkinson's Disease Detection Using a Multimodal Prototype Learning Framework.

IEEE Trans Neural Syst Rehabil Eng

September 2025

Madhuri Thimmapuram , Ananda Rao Akepogu , P Radhika Raju

Patients with severe Parkinson's disease (PD) frequently have freezing of gait (FOG), a gait disability. By anticipating FOG before it occurs, pre-emptive cueing can either prevent FOG or lessen its severity and duration. To improve the accuracy of FOG detection, both electroencephalography (EEG) data and other complementary modalities, such as gait-based data, are increasingly being explored.

View Article and Find Full Text PDF

Similar Publications

Performance of ChatGPT, Gemini and DeepSeek for non-critical triage support using real-world conversations in emergency department.

BMC Emerg Med

September 2025

Department of Emergency Medicine, Korea University Ansan Hospital, Ansan-si, 15355, Republic of Korea.

Sukyo Lee , Sumin Jung , Jong-Hak Park , Hanjin Cho , Sungwoo Moon

Background: Timely and accurate triage is crucial for the emergency department (ED) care. Recently, there has been growing interest in applying large language models (LLMs) to support triage decision-making. However, most existing studies have evaluated these models using simulated scenarios rather than real-world clinical cases.

View Article and Find Full Text PDF

Similar Publications