tf.raw_ops.Mfcc
    
    
      
    
    
      
      Stay organized with collections
    
    
      
      Save and categorize content based on your preferences.
    
  
  
      
    
  
  
  
  
  
    
  
  
    
    
Transforms a spectrogram into a form that's useful for speech recognition.
tf.raw_ops.Mfcc(
    spectrogram,
    sample_rate,
    upper_frequency_limit=4000,
    lower_frequency_limit=20,
    filterbank_channel_count=40,
    dct_coefficient_count=13,
    name=None
)
Mel Frequency Cepstral Coefficients are a way of representing audio data that's
been effective as an input feature for machine learning. They are created by
taking the spectrum of a spectrogram (a 'cepstrum'), and discarding some of the
higher frequencies that are less significant to the human ear. They have a long
history in the speech recognition world, and https://en.wikipedia.org/wiki/Mel-frequency_cepstrum
is a good resource to learn more.
| Args | 
|---|
| spectrogram | A Tensorof typefloat32.
Typically produced by the Spectrogram op, with magnitude_squared
set to true. | 
| sample_rate | A Tensorof typeint32.
How many samples per second the source audio used. | 
| upper_frequency_limit | An optional float. Defaults to4000.
The highest frequency to use when calculating the
ceptstrum. | 
| lower_frequency_limit | An optional float. Defaults to20.
The lowest frequency to use when calculating the
ceptstrum. | 
| filterbank_channel_count | An optional int. Defaults to40.
Resolution of the Mel bank used internally. | 
| dct_coefficient_count | An optional int. Defaults to13.
How many output channels to produce per time slice. | 
| name | A name for the operation (optional). | 
| Returns | 
|---|
| A Tensorof typefloat32. | 
  
  
 
  
    
    
      
       
    
    
  
  
  Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.
  Last updated 2023-10-06 UTC.
  
  
  
    
      [null,null,["Last updated 2023-10-06 UTC."],[],[]]