Extraction of fixed windows for LLD #27

giorgiolbt · 2021-04-28T10:56:06Z

Hi,

I was wondering if it is possible to have a total number of windows that is fixed even when extracting LLD. At the moment, for each audio, I obtain a variable number of vectors of features that depends on the length of the audio since the window size is fixed. I would need to have for instance 200 rows for each audio independently from the audio's duration.

Thanks in advance!
Giorgio

frankenjoe · 2021-04-28T11:08:10Z

No, that is not possible.

bagustris · 2021-07-21T06:31:21Z

Use zero paddings. That's the common step in speech processing.
Using keras, it only needs one line to make all utterances have the same row size.

Reference:
https://www.tensorflow.org/api_docs/python/tf/keras/preprocessing/sequence/pad_sequences

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extraction of fixed windows for LLD #27

Extraction of fixed windows for LLD #27

giorgiolbt commented Apr 28, 2021 •

edited

Loading

frankenjoe commented Apr 28, 2021

bagustris commented Jul 21, 2021

Extraction of fixed windows for LLD #27

Extraction of fixed windows for LLD #27

Comments

giorgiolbt commented Apr 28, 2021 • edited Loading

frankenjoe commented Apr 28, 2021

bagustris commented Jul 21, 2021

giorgiolbt commented Apr 28, 2021 •

edited

Loading