├── train.py # Full training pipeline ├── inference.py # Style transfer using trained models ├── evaluate.py # Spectrogram and metric visualization ├── models/ │ ├── content_encoder.py # ...
Recent studies have successfully shown how style transfer can be applied on images from one domain to another. In this project we attempt to use this technique to embed emotions in spectrogram images.
Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...
An example of a spectrogram, which scientists use to highlight sound sources based on their visual signatures. Credit must be given to the creator. Only noncommercial uses of the work are permitted.