Skip to content

jramassa/ListenToTheWild

Repository files navigation

Listen To The Wild 🐦

About πŸ’

The analysis of massive datasets of soundscapes presents a challenge to ecologists. This project aims to develop a method for predicting the naturalness of ecosystems and their richness based on acoustic indices. To achieve this, we will combine multiple acoustic indices, employ unsupervised and supervised classification methods, and use regression to predict the naturalness of a territory. This project will highlight the relationships between acoustic indices and environmental characteristics, with the goal of developing tools for biodiversity and environmental monitoring.


Ressources πŸ›„

Here are some useful resources related to this project:

> Related Work : This approach summarizes the attempts that are being undertaken, taking into account related works and builds upon a comprehensive literature review. It seeks to contribute to the existing body of knowledge by addressing the specific challenges posed by the analysis of massive datasets of soundscapes in ecological research.

We hope these resources will be helpful to those interested in learning more about our project.

Technologies πŸ§‘β€πŸ’»


Repository πŸ›…

This repository offers an extensive collection of code and data designed to address various project tasks. It includes classifiers, regressors. The following section delineates the intended purposes of the different files within the repository:


How to start ? 🚨

Kindly be aware that the code has been crafted with maximum flexibility in mind. Nevertheless, there's a possibility that you may need to customize it to suit your particular use case and circumstances.


Requirement πŸ”‘

All notebooks have been developed for use in Google Colab, and Python files are also coded to complement these notebooks. You can effortlessly employ them in this environment, ensuring smooth execution of the provided code.


Usage πŸ”§

> Manipulating Data

In order to do the different classification and regresssion tasks, we had used pandas and os library, creating csv files based on the data that we have.

You can explore example outputs in the following files: concatenated_acoustic_indices.csv (resulting from the concatenation of scikit-maad acoustic indices CSV files from all sites, including a new "Year" column for 2022 and 2023) or concatenated_VGGish_acoustic_indices.csv (resulting from the concatenation of VGGish acoustic indices CSV files from all sites, also with a new "Year" column for 2022 and 2023). These files are particularly valuable for predicting longitude and comparing the two approaches, VGGish versus scikit-maad.


> Soundscape Analysis

In the classification process, we conducted a comprehensive analysis involving four classes (bleu estive, bleu lisiere, rouge estive, and rouge lisiere) utilizing unsupervised techniques such as PCA, tSNE, and uMAP.

Our unsupervised analysis visualization was twofold – first, based on the Transect feature, a combination of Commune, LID, and Dynamique, and second, based on the temporal aspect (time).

To gauge the significance of features, we employed Random Forest, MultiLayer Perceptron (MLP), and Gradient Boosting.

For the analysis of habitat_composition_by_station.csv and naturalness_index_by_site.csv, we employed PCA, tSNE, and uMAP, generating visualizations on a map of the southern region of France using the Cartopy library in Python.

Similarly, we conducted visualizations after applying PCA, tSNE, and uMAP to the concatenated file of scikit-maad acoustic indices from all sites, as well as another visualization for VGGish acoustic indices. Both were mapped onto the southern region of France for spatial representation.


> Biodiversty Data Regression

We conducted a regression analysis of naturalness metrics, including LONGITUDE, LATITUDE, SHANNON_LANDSCAPE_DIVERSITY, etc., utilizing scikit-maad acoustic indices. A parallel regression was performed using VGGish acoustic indices to facilitate a comparative analysis of the two approaches. This analysis was conducted employing the scikit-learn library, utilizing algorithms such as Random Forest, MultiLayer Perceptron (MLP), and XGBoost. Cross-validation techniques were applied for data validation, incorporating both temporal and spatial segregation. Hyperparameter tuning was performed to enhance the model's performance.

Following feature selection, we proceeded with a regression analysis of one acoustic index based on the other acoustic indices.

Contact πŸ“©

  • yasmine [dot] charifi [at] etu [dot] toulouse-inp [dot] com
  • bryan [dot] chen [at] etu [dot] toulouse-inp [dot] com
  • jonas [dot] lavaur [at] etu [dot] toulouse-inp [dot] com
  • kawtar [dot] lyamoudi [at] etu [dot] toulouse-inp [dot] com
  • jordan [dot] ramassamymoutoussamy [at] etu [dot] toulouse-inp [dot] com

Acknowledgments πŸ€—

A big thanks to our supervisors:

  • Maxime Cauchoix, Research Scientist, SETE, LEFE, CNRS.
  • Axel Carlier, Assistant Professor, Toulouse INP, IRIT.

And the last year group members of ENSEEIHT, Toulouse INP:

  • Amar Meddahi
  • Edgar Remy
  • Fabio Pereira de Araujo
  • Linda Hammami
  • Sarah Chougar
  • Younes Boutiyarzist

Listen To The Wild communicates with and/or references the following:

We express our gratitude to all their contributors and maintainers!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 5