Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The article describes the production steps and accuracy assessment of an analysis-ready, open-access European data cube consisting of 2000-2020+ Landsat data, 2017-2021+ Sentinel-2 data and a 30 m resolution digital terrain model (DTM). The main purpose of the data cube is to make annual continental-scale spatiotemporal machine learning tasks accessible to a wider user base by providing a spatially and temporally consistent multidimensional feature space. This has required systematic spatiotemporal harmonization, efficient compression, and imputation of missing values. Sentinel-2 and Landsat reflectance values were aggregated into four quarterly averages approximating the four seasons common in Europe (winter, spring, summer and autumn), as well as the 25th and 75th percentile, in order to retain intra-seasonal variance. Remaining missing data in the Landsat time-series was imputed with a temporal moving window median (TMWM) approach. An accuracy assessment shows TMWM performs relatively better in Southern Europe and lower in mountainous regions such as the Scandinavian Mountains, the Alps, and the Pyrenees. We quantify the usability of the different component data sets for spatiotemporal machine learning tasks with a series of land cover classification experiments, which show that models utilizing the full feature space (30 m DTM, 30 m Landsat, 30 m and 10 m Sentinel-2) yield the highest land cover classification accuracy, with different data sets improving the results for different land cover classes. The data sets presented in the article are part of the EcoDataCube platform, which also hosts open vegetation, soil, and land use/land cover (LULC) maps created. All data sets are available under CC-BY license as Cloud-Optimized GeoTIFFs (ca. 12 TB in size) through SpatioTemporal Asset Catalog (STAC) and the EcoDataCube data portal.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10252825PMC
http://dx.doi.org/10.7717/peerj.15478DOI Listing

Publication Analysis

Top Keywords

data sets
16
data cube
12
land cover
12
data
11
accuracy assessment
8
spatiotemporal machine
8
machine learning
8
learning tasks
8
feature space
8
cover classification
8

Similar Publications

The calculation of the highest occupied molecular orbital-lowest unoccupied molecular orbital (HOMO-LUMO) gap for chemical molecules is computationally intensive using quantum mechanics (QM) methods, while experimental determination is often costly and time-consuming. Machine Learning (ML) offers a cost-effective and rapid alternative, enabling efficient predictions of HOMO-LUMO gap values across large data sets without the need for extensive QM computations or experiments. ML models facilitate the screening of diverse molecules, providing valuable insights into complex chemical spaces and integrating seamlessly into high-throughput workflows to prioritize candidates for experimental validation.

View Article and Find Full Text PDF

The huge volcanic eruption at Thera (Santorini), situated in the Aegean Sea, occurred within the Late Minoan IA archaeological period. However, its temporal association with Egyptian history has long been a controversial subject. Traditionally, the eruption was placed in the early 18th Dynasty, associated with Pharaoh Thutmose III as the youngest option or with Pharaoh Nebpehtire Ahmose as the oldest possibility.

View Article and Find Full Text PDF

Identification and prioritization of gene sets associated with schizophrenia risk by network analysis.

Psychopharmacology (Berl)

September 2025

Institute of Cardiovascular Research, Sleep Medical Center, Department of Psychiatry, Fundamental and Clinical Research on Mental Disorders Key Laboratory of Luzhou, Affiliated Hospital, Southwest Medical University, Luzhou, Sichuan Province, 646000, China.

Rationale: Genome-wide association studies (GWASs) are used to identify genetic variants for association with schizophrenia (SCZ) risk; however, each GWAS can only reveal a small fraction of this association.

Objectives: This study systematically analyzed multiple GWAS data sets to identify gene subnetwork and pathways associated with SCZ.

Methods: We identified gene subnetwork using dmGWAS program by combining SCZ GWASs and a human interaction network, performed gene-set analysis to test the association of gene subnetwork with clinical symptom scores and disease state, meanwhile, conducted spatiotemporal and tissue-specific expression patterns and cell-type-specific analysis of genes in the subnetwork.

View Article and Find Full Text PDF

Nisin-like biosynthetic gene clusters are widely distributed across microbiomes.

mBio

September 2025

APC Microbiome Ireland, Biosciences Institute, Biosciences Research Institute, University College, Cork, Ireland.

Bacteriocins are antimicrobial peptides/proteins that can have narrow or broad inhibitory spectra and remarkable potency against clinically relevant pathogens. One such bacteriocin that is extensively used in the food industry and with potential for biotherapeutic application is the post-translationally modified peptide, nisin. Recent studies have shown the impact of nisin on the gastrointestinal microbiome, but relatively little is known of how abundant nisin production is in nature, the breadth of existing variants, and their antimicrobial potency.

View Article and Find Full Text PDF

Development of Coarse-Grained Lipid Force Fields Based on a Graph Neural Network.

J Chem Theory Comput

September 2025

Department of Materials Science and Engineering, City University of Hong Kong, Kowloon 999077, Hong Kong China.

Coarse-grained (CG) lipid models enable efficient simulations of large-scale membrane events. However, achieving both speed and atomic-level accuracy remains challenging. Graph neural networks (GNNs) trained on all-atom (AA) simulations can serve as CG force fields, which have demonstrated success in CG simulations of proteins.

View Article and Find Full Text PDF