A strongly annotated passive acoustic dataset for tropical bird monitoring

📅 2026-05-19
📈 Citations: 0
Influential: 0
📄 PDF

career value

185K/year
🤖 AI Summary
This study addresses the scarcity of temporally localized bird vocalization data in tropical soundscapes, which hinders the application of supervised learning to biodiversity monitoring. To bridge this gap, the authors introduce and release PteroSet—the first large-scale, strongly annotated acoustic dataset for tropical birds—comprising 563 audio recordings (73.62 hours) collected from two sites in Colombia, with 15,372 time-frequency annotations spanning 168 bird species. Annotations and audio metadata are uniformly structured in COCO-style JSON format. The work characterizes, for the first time, patterns of sound co-occurrence and cross-site domain shifts in tropical soundscapes, provides deep learning baselines, and establishes a realistic evaluation benchmark. Experiments validate the dataset’s utility for avian detection tasks while highlighting the technical challenges posed by complex acoustic environments.
📝 Abstract
Passive acoustic monitoring enables continuous, non-invasive biodiversity assessment across diverse ecosystems. The scale of these datasets has driven the adoption of machine learning, with supervised approaches showing strong performance. However, supervised methods require time-resolved annotated datasets, which remain scarce, especially in complex tropical soundscapes. We present PteroSet, a curated dataset of strongly annotated Neotropical bird vocalizations recorded in Puerto Asis (Putumayo) and Pivijay (Magdalena), Colombia, between 2023 and 2025. The dataset comprises 563 recordings (73.62 h) and 15,372 time-frequency annotations, including 6,702 events identified to the species level across 168 species. We release the annotations in a COCO-inspired JSON schema that unifies audio files, taxonomic categories, and labels for machine learning workflows. Beyond providing annotated data, PteroSet serves as a realistic benchmark that highlights key characteristics of tropical soundscapes, including acoustic co-occurrence and domain shift across recording sites. We provide a deep learning baseline for binary bird detection, demonstrating PteroSet's usability and the challenges it presents.
Problem

Research questions and friction points this paper is trying to address.

passive acoustic monitoring
tropical soundscapes
annotated dataset
bird vocalizations
supervised learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

passive acoustic monitoring
strongly annotated dataset
tropical soundscapes
COCO-inspired JSON schema
domain shift
🔎 Similar Papers
No similar papers found.
Daniela Ruiz
Daniela Ruiz
Universidad de los Andes
Computer visionBiomedical Image AnalysisTranscriptomicsMachine learningDeep learning
J
Juan Sebastián Ulloa
Instituto de Investigación de Recursos Biológicos Alexander von Humboldt, Bogotá, Colombia
Zhongqi Miao
Zhongqi Miao
AI for Good Lab - Microsoft
EcologyComputer VisionDeep Learning
N
Nicolás Betancourt
Instituto de Investigación de Recursos Biológicos Alexander von Humboldt, Bogotá, Colombia
M
Maria Paula Toro-Gómez
Instituto de Investigación de Recursos Biológicos Alexander von Humboldt, Bogotá, Colombia
A
Andrés Hernández
Microsoft AI for Good Research Lab, Redmond, Washington, United States
B
Bruno Demuro
Microsoft AI for Good Research Lab, Redmond, Washington, United States
E
Eliana Barona-Cortés
Instituto de Investigación de Recursos Biológicos Alexander von Humboldt, Bogotá, Colombia
A
Angela Mendoza-Henao
Fundación Manacus, Red Ecoacústica Colombiana, Cali, Colombia
A
Andrés Sierra-Ricaurte
Instituto de Investigación de Recursos Biológicos Alexander von Humboldt, Bogotá, Colombia
S
Sebastián Pérez-Peña
Louisiana State University, Baton Rouge, United States, Museum of Natural Sciences
Rahul Dodhia
Rahul Dodhia
Deputy Director, AI for Good Research Lab, Microsoft
generative aiartificial intelligencestatisticscomputer visiongeospatial imagery
P
Pablo Arbeláez
Universidad de Los Andes, Bogotá, Colombia, Center for Research and Formation in Artificial Intelligence
Juan M. Lavista Ferres
Juan M. Lavista Ferres
Chief Scientist and Lab Director, Microsoft AI for Good Research Lab
Medical ImagingDeep LearningCausalityMachine LearningData Science