Models & Researchbioacousticsdeep audiomarine conservationai research

Study applies deep audio embeddings to right whales

|June 30, 2026|By LDS Team

5.2

Relevance Score

Study applies deep audio embeddings to right whales

For ML practitioners in audio and bioacoustics: a bioRxiv preprint (July 2025) from Cornell Lab of Ornithology, Oregon State, and Syracuse demonstrates that BirdNET audio embeddings robustly distinguish individual North Atlantic right whales from 234 vocalization samples across 11 individuals and 3 sites. The study addresses three tasks - detection, communication, and individual identification - and proposes a framework for quantifying how anthropogenic noise compresses communication space in data-limited species. The transfer-learning result suggests general audio foundation models may be applicable to marine mammal individual ID without domain-specific retraining. This is a preprint; results are preliminary.

Why this matters for audio ML practitioners

Most bioacoustic monitoring research equates signal detection with signal communication - a persistent conflation the authors argue leads to systematically overestimated communication ranges and underestimated anthropogenic noise impacts. This preprint proposes and demonstrates that deep audio embeddings from general-purpose models (specifically BirdNET, trained primarily on bird calls) transfer to marine mammal individual identification well enough to distinguish critically endangered North Atlantic right whales from on-animal tag recordings. For practitioners, this is a transfer-learning result with practical implications: large audio foundation models trained on broad corpora may be deployable for individual ID in data-limited species without domain-specific retraining.

The study Authors Irina Tolkova, Holger Klinck, Dana A. Cusano, Anke Kugler, and Susan E. Parks - affiliated with Cornell Lab of Ornithology's K. Lisa Yang Center for Conservation Bioacoustics, Oregon State University's Marine Mammal Institute, and Syracuse University's Department of Biology - published this preprint on bioRxiv (July 16, 2025, doi:10.1101/2025.07.11.664307). The work focuses on the critically endangered North Atlantic right whale (Eubalaena glacialis, NARW), a species with fewer than 370 individuals remaining.

The dataset consists of 234 NARW vocalization samples recorded with on-animal archival tags, drawn from 11 individuals across 3 sites. The study targets three tasks using the NARW upcall (a low-frequency contact call produced across ages and sexes, previously shown to encode individual identity): detection, communication analysis, and individual identification. Key results: BirdNET embeddings robustly distinguish individual right whales, and simulated noise analyses estimate signal excess (signal-to-noise surplus) separately for detection and individual ID tasks.

The communication space distinction The conceptual contribution is separating signal detection from signal communication. Prior studies, the authors argue, treated detection as sufficient for communication, thereby overestimating the spatial area in which individuals can exchange information. Deep learning embeddings operating on the full audio signal (not just detection thresholds) can, in principle, estimate whether a received signal is recognizable as a specific individual - a biologically more relevant measure of effective communication range under noise.

What to watch

•Whether BirdNET embeddings generalize to other cetacean species or data-limited taxa beyond NARW.
•Peer review status: this is a preprint; results are preliminary pending journal review.
•Follow-on work using measured rather than simulated ambient noise conditions for validation.

Key Points

1What: Cornell/Oregon State/Syracuse researchers demonstrate BirdNET audio embeddings can robustly identify individual right whales from 234 on-animal tag recordings across 11 individuals and 3 sites (bioRxiv, July 2025).
2Why: Separating signal detection from signal communication - prior studies conflated the two, overestimating communication range and underestimating noise impacts on critically endangered species.
3So what: General audio foundation models may transfer to individual ID for data-limited marine species, reducing the need for species-specific model training in conservation monitoring.

Scoring Rationale

A methodologically solid domain-specific ML preprint demonstrating BirdNET transfer to marine mammal bioacoustics, with a clean conceptual advance in separating detection from communication. Scoring reflects a well-executed preprint on a conservation application with modest but genuine transfer-learning implications; scored below 5.5 given preprint status and narrow specialized scope.

MoreAI Research news

Sources

Primary source and supporting public references used for this report.

2 sources

Primary sourcejournals.plos.orgDetection, communication, and individual identification with deep audio embeddings: A case study with North Atlantic right whales

View 1 more source

Detection, communication, and individual identification with deep audio embeddings: A case study with North Atlantic right whalebiorxiv.org

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

Why this matters for audio ML practitioners

What to watch

•Whether BirdNET embeddings generalize to other cetacean species or data-limited taxa beyond NARW.
•Peer review status: this is a preprint; results are preliminary pending journal review.
•Follow-on work using measured rather than simulated ambient noise conditions for validation.

Key Points

1What: Cornell/Oregon State/Syracuse researchers demonstrate BirdNET audio embeddings can robustly identify individual right whales from 234 on-animal tag recordings across 11 individuals and 3 sites (bioRxiv, July 2025).

2Why: Separating signal detection from signal communication - prior studies conflated the two, overestimating communication range and underestimating noise impacts on critically endangered species.

3So what: General audio foundation models may transfer to individual ID for data-limited marine species, reducing the need for species-specific model training in conservation monitoring.

Scoring Rationale

Study applies deep audio embeddings to right whales

Why this matters for audio ML practitioners

What to watch

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Microsoft and Mistral expand European AI partnership with multibillion-dollar compute deal

xAI Sues Grok User Over Alleged Child Sexual Abuse Images

Nativ Launches Open-Source MLX App for Local Mac Models

OpenAI Paused Long-Horizon Model After Sandbox Bypass

Study applies deep audio embeddings to right whales

Why this matters for audio ML practitioners

What to watch

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Microsoft and Mistral expand European AI partnership with multibillion-dollar compute deal

xAI Sues Grok User Over Alleged Child Sexual Abuse Images

Nativ Launches Open-Source MLX App for Local Mac Models

OpenAI Paused Long-Horizon Model After Sandbox Bypass