Skip to content

Generate ShortTermFeatures for an audio signal of 0.1 seconds of a mono, 16KHz and PCM wave file  #396

Open
@prashant-saxena

Description

@prashant-saxena

Hello,

First of all, thank you for providing a great library.
I would like to process a signal of 0.1 seconds (1600) for short-term features.

F, f_names = ShortTermFeatures.feature_extraction(x[0:1600], Fs, 160, 160, deltas=False)

It's throwing an error. When I try little bigger values

F, f_names = ShortTermFeatures.feature_extraction(x[0:1600], Fs, 200, 200, deltas=False)

Now no errors but there are <= 8 points in features which I believe is quite low for uniqueness. Using python_speech_features, I'm successfully able to generate 20 points but I think the resulting mfcc is not unique in terms of noise.

mfcc(x[0:1600], # The audio signal (N*1 array) from which to compute features.
        16000, # The sample rate of the signal we are working with.
         numcep=NUM_CEP, # The number of cepstrum to return, default 13
         winlen=160/16000, # The length of the analysis window in seconds.
         winstep=160/16000, # The step between successive windows in seconds.
         nfilt=20)

How do you generate features with more points (20-40), and a good amount of noise for a short signal?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions