Sunday, 3 July 2016

voice - Quantifying the energy of a voiced frame


What should the energy, pitch frequency, and zero crossing rate of a voice/unvoiced/silence frame in signal be?



My professor said that it is experimental, but I do not have any idea what the factors of these are and where I should begin from.


I would appreciate any guesses based on experimental data or any limits on the quantities.


I've tried a zero crossing rate between 2000 and 94000, energy between 0.0001 and 0.7, and pitch frequency between 180 and 500.




No comments:

Post a Comment

readings - Appending 内 to a company name is read ない or うち?

For example, if I say マイクロソフト内のパートナーシップは強いです, is the 内 here read as うち or ない? Answer 「内」 in the form: 「Proper Noun + 内」 is always read 「ない...