SCSFish2025: a large dataset from South China sea for coral reef fish identification.

Sci Rep

School of Computer Science and Engineering, Guangzhou Higher Education Mega Center, Sun Yat-Sen University, No. 132 Waihuandong Road, Guangzhou, 510006, China.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Coral reefs are one of the most biodiverse ecosystems on Earth and are extremely important for marine ecosystems. However, coral reefs are rapidly degrading globally, and for this reason, in-situ online monitoring systems are being used to monitor coral reef ecosystems in real time. At the same time, artificial intelligence technology, particularly deep learning technology, is playing an increasingly important role in the study of coral reef ecology, especially in the automatic detection and identification of coral reef fish. However, deep learning is essentially a data-driven technique that relies on high-quality datasets for training, while existing fish identification datasets suffer from low resolution and inaccurate labeling, which limits the application of deep learning techniques to coral reef fish identification. To better utilize deep learning techniques for real-time automatic detection and identification of coral reef fish from the data collected by the in-situ online monitoring system, this paper proposes a high-resolution, fish species-rich, and well-labeled coral reef fish dataset SCSFish2025, which is the first publicly available coral reef fish dataset in the waters of China's Nansha Islands. SCSFish2025 contains 11,956 high-resolution underwater surveillance images and over 120,000 bounding boxes covering 30 species of fish that have been manually labelled by experienced fish identification experts, with sub-category labels for blurring, occlusion, and altered pose. Furthermore, this paper establishes a benchmark for the dataset by analyzing the detection performance of deep learning object detection techniques on this dataset using four state-of-the-art or typical object detection models as baseline models. The best baseline model RT-DETRv2 achieves mAP@50 performance of 0.9960 and 0.7486 respectively on the five-fold cross-validation of the training set and the independent test set. The release of this dataset will help promote the development of AI technology in the study of automatic detection and identification of coral reef fish, and provide strong support for the study of marine biodiversity and ecosystems. The project code and dataset are available at https://github.com/FudanZhengSYSU/SCSFish2025 .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12358537PMC
http://dx.doi.org/10.1038/s41598-025-14785-4DOI Listing

Publication Analysis

Top Keywords

coral reef
36
reef fish
28
deep learning
20
fish identification
16
identification coral
16
automatic detection
12
detection identification
12
coral
11
fish
11
reef
9

Similar Publications

Boat noise alters behaviour of two coral reef macroinvertebrates, Lambis lambis and Tridacna maxima.

Mar Pollut Bull

September 2025

Marine Science Program, Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.

Boat noise has been shown to distract and cause harm to many marine organisms. Most of the study effort has focused on fish & marine mammals, even though invertebrates represent over 92 % of all marine life. The few studies conducted on invertebrates have demonstrated clear negative effects of anthropogenic noise pollution.

View Article and Find Full Text PDF

Background: The damselfishes, an extremely diverse group of herbivorous fish, stands out as an important and ubiquitous ecological component of coral reefs. In the Western South Atlantic, the genus Stegastes is the most representative, whose evolutionary paths and taxonomic status of insular endemic species have been better evaluated. To clarify the karyotypic evolution involved in the diversification of this group, cytogenetic analyses were performed in four nominal species (S.

View Article and Find Full Text PDF

Above-ground biomass contributes a large proportion of mangrove carbon stock; however, spatio-temporal dynamics of biomass are poorly understood in carbonate settings of the Southern Hemisphere. This influences the capacity to accurately project the effects of accelerating sea-level rise on this important carbon store. Here, above-ground biomass and productivity dynamics were quantified across mangrove age zones dominated by , spanning a tidal gradient atop a reef platform at Low Isles, Great Barrier Reef, Australia.

View Article and Find Full Text PDF

Many cnidarian animals possess multiple opsins, including a type known as cnidopsin, which is found throughout the phylum Cnidaria and is divided into several subgroups. Previous studies have suggested that cnidopsins from jellyfish and coral can light-dependently elevate intracellular cAMP levels, likely via activation of Gs-type G protein in cultured cells. However, their spectroscopic properties remain largely unclear, with the exception of jellyfish opsins.

View Article and Find Full Text PDF

Anemonefish have a characteristic vertical white barred color pattern on an orange background made by a specific distribution of three types of pigment cells: melanophores, xanthophores, and iridophores. This color pattern is an interesting alternative model to zebrafish to understand the cellular and molecular basis of complex color pattern formation. Using transmission electron microscopic observations, we have investigated the pigment cell composition in the skin of the anemonefish and found that: 1) white skin comprises iridophores and isolated melanophores; 2) orange skin contains xanthophores and scattered melanophores; and 3) black skin encompasses melanophores only.

View Article and Find Full Text PDF