Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Advances in computer vision and increasingly widespread video-based behavioral monitoring are currently transforming how we study animal behavior. However, there is still a gap between the prospects and practical application, especially in videos from the wild. In this Perspective, we aim to present the capabilities of current methods for behavioral analysis, while at the same time highlighting unsolved computer vision problems that are relevant to the study of animal behavior. We survey state-of-the-art methods for computer vision problems relevant to the video-based study of individualized animal behavior, including object detection, multi-animal tracking, individual identification and (inter)action understanding. We then review methods for effort-efficient learning, one of the challenges from a practical perspective. In our outlook on the emerging field of computer vision for animal behavior, we argue that the field should develop approaches to unify detection, tracking, identification and (inter)action understanding in a single, video-based framework.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41592-025-02653-yDOI Listing

Publication Analysis

Top Keywords

computer vision
20
animal behavior
16
study animal
8
vision problems
8
problems relevant
8
identification interaction
8
interaction understanding
8
computer
5
behavior
5
vision primate
4

Similar Publications

Deep feature engineering for accurate sperm morphology classification using CBAM-enhanced ResNet50.

PLoS One

September 2025

School of Computer Science, CHART Laboratory, University of Nottingham, Nottingham, United Kingdom.

Background And Objective: Male fertility assessment through sperm morphology analysis remains a critical component of reproductive health evaluation, as abnormal sperm morphology is strongly correlated with reduced fertility rates and poor assisted reproductive technology outcomes. Traditional manual analysis performed by embryologists is time-intensive, subjective, and prone to significant inter-observer variability, with studies reporting up to 40% disagreement between expert evaluators. This research presents a novel deep learning framework combining Convolutional Block Attention Module (CBAM) with ResNet50 architecture and advanced deep feature engineering (DFE) techniques for automated, objective sperm morphology classification.

View Article and Find Full Text PDF

Street-level environments play a vital role in children's development by promoting their physical activity, cognitive growth, and overall development. This study systematically reviews the measurement tools available to assess street environments according to children's needs. This systematic review was conducted according to the PRISMA-COSMIN guidelines.

View Article and Find Full Text PDF

This study explores how differences in colors presented separately to each eye (binocular color differences) can be identified through EEG signals, a method of recording electrical activity from the brain. Four distinct levels of green-red color differences, defined in the CIELAB color space with constant luminance and chroma, are investigated in this study. Analysis of Event-Related Potentials (ERPs) revealed a significant decrease in the amplitude of the P300 component as binocular color differences increased, suggesting a measurable brain response to these differences.

View Article and Find Full Text PDF

Background: Staging laparoscopy (SL) is an essential procedure for peritoneal metastasis (PM) detection. Although surgeons are expected to differentiate between benign and malignant lesions intraoperatively, this task remains difficult and error-prone. The aim of this study was to develop a novel multimodal machine learning (MML) model to differentiate PM from benign lesions by integrating morphologic characteristics with intraoperative SL images.

View Article and Find Full Text PDF

Background: In clinical practice, digital subtraction angiography (DSA) often suffers from misregistration artifact resulting from voluntary, respiratory, and cardiac motion during acquisition. Most prior efforts to register the background DSA mask to subsequent postcontrast images rely on key point registration using iterative optimization, which has limited real-time application.

Purpose: Leveraging state-of-the-art, unsupervised deep learning, we aim to develop a fast, deformable registration model to substantially reduce DSA misregistration in craniocervical angiography without compromising spatial resolution or introducing new artifacts.

View Article and Find Full Text PDF