Robust Few-Shot Learning for User-Provided Data.

Jiang Lu , Sheng Jin , Jian Liang , Changshui Zhang

IEEE Trans Neural Netw Learn Syst

Published: April 2021

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Few-shot learning (FSL) focuses on distilling transferrable knowledge from existing experience to cope with novel concepts for which the labeled data are scarce. A typical assumption in FSL is that the training examples of novel classes are all clean with no outlier interference. In many realistic applications where examples are provided by users, however, data are potentially noisy or unreadable. In this context, we introduce a novel research topic, robust FSL (RFSL), where we aim to address two types of outliers within user-provided data: the representation outlier (RO) and the label outlier (LO). Moreover, we introduce a metric for estimating robustness and use it to investigate the performance of several advanced methods to FSL when faced with user-provided outliers. In addition, we propose robust attentive profile networks (RapNets) to achieve outlier suppression. The results of a comprehensive evaluation of benchmark data sets demonstrate the shortcomings of current FSL methods and the superiority of the proposed RapNets when dealing with RFSL problems, establishing a benchmark for follow-up studies.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TNNLS.2020.2984710	DOI Listing

Publication Analysis

Top Keywords

few-shot learning

user-provided data

data

fsl

robust few-shot

learning user-provided

data few-shot

learning fsl

fsl focuses

focuses distilling

Similar Publications

Do large language models learn like humans: Interleaved and spaced practice in morphological learning.

Acta Psychol (Amst)

September 2025

Shanghai Jiao Tong University, China. Electronic address:

Ying Xiong , Shiyu Wu

This study investigates fundamental differences in the acquisition of morphological patterns by humans and large language models (LLMs) within an artificial language learning paradigm. Specifically, it compares how each system responds to variations in input structure-blocked versus interleaved sequences and juxtaposed versus spaced presentation-across verb classification and inflection tasks. While LLMs (GPT4mini, DeepSeek_V3, Llama3.

View Article and Find Full Text PDF

Similar Publications

Few-Shot Class-Incremental Learning with Dynamic Prototype Refinement for Brain Activity Classification.

IEEE J Biomed Health Inform

September 2025

Lei Cao , Hao Li , Yilin Dong , Tianyu Liu , Jie Li

The brain-computer interface (BCI) system facilitates efficient communication and control, with Electroencephalography (EEG) signals as a vital component. Traditional EEG signal classification, based on static deeplearning models, presents a challenge when new classes of the subject's brain activity emerge. The goal is to develop a model that can recognize new few-shot classes while preserving its ability to discriminate between existing ones.

View Article and Find Full Text PDF

Similar Publications

Retrieval augmented generation based dynamic prompting for few-shot biomedical named entity recognition using large language models.

Res Sq

August 2025

Yao Ge , Sudeshna Das , Yuting Guo , Abeed Sarker

Biomedical named entity recognition (NER) is a high-utility natural language processing (NLP) task, and large language models (LLMs) show promise particularly in few-shot settings (i.e., limited training data).

View Article and Find Full Text PDF

Similar Publications

LitAutoScreener: Development and Validation of an Automated Literature Screening Tool in Evidence-Based Medicine Driven by Large Language Models.

Health Data Sci

September 2025

Key Laboratory of Epidemiology of Major Diseases, Ministry of Education/Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China.

Yiming Tao , Xuehu Li , Zuhar Yisha , Sihan Yang , Siyan Zhan

The traditional manual literature screening approach is limited by its time-consuming nature and high labor costs. A pressing issue is how to leverage large language models to enhance the efficiency and quality of evidence-based evaluations of drug efficacy and safety. This study utilized a manually curated reference literature database-comprising vaccine, hypoglycemic agent, and antidepressant evaluation studies-previously developed by our team through conventional systematic review methods.

View Article and Find Full Text PDF

Similar Publications

Medical Entity Linking in Low-Resource Settings with Fine-Tuning-Free LLMs.

Stud Health Technol Inform

September 2025

Chair of Medical Informatics, Institute of AI and Informatics in Medicine (AIIM), TUM University Hospital, Technical University of Munich, Munich, Germany.

Suteera Seeha , Martin Boeker , Luise Modersohn

Introduction: Medical entity linking is an important task in biomedical natural language processing, aiming to align textual mentions of medical concepts with standardized concepts in ontologies. Most existing approaches rely on supervised models or domain-specific embeddings, which require large datasets and significant computational resources.

Objective: The objective of this work is (1) to investigate the effectiveness of large language models (LLMs) in improving both candidate generation and disambiguation for medical entity linking through synonym expansion and in-context learning, and (2) to evaluate this approach against traditional string-matching and supervised methods.

View Article and Find Full Text PDF

Similar Publications