アクセントdb

説明:

AccentDB は、構造化され、ラベル付けされたアクセント付き音声のマルチペアワイズ並列コーパスです。これには、英語の 4 つの非ネイティブアクセント (8 人のスピーカー、4 つのインド言語) のスピーカーからの音声サンプルが含まれています。また、英語の 4 つのネイティブアクセント (4 か国、13 スピーカー) とメトロポリタンインドアクセント (2 スピーカー) の編集物もあります。ここで利用可能なデータセットは、 https: //accentdb.github.io/#dataset の Accentdb_extended というタイトルのリリースに対応しています

追加のドキュメント:コードを使用したペーパーの探索
ホームページ: https://accentdb.github.io/
ソースコード: tfds.datasets.accentdb.Builder
バージョン:
- 1.0.0 (デフォルト): リリースノートはありません。
ダウンロードサイズ: 3.56 GiB
データセットサイズ: 19.47 GiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	17,313

機能構造:

FeaturesDict({
    'audio': Audio(shape=(None,), dtype=int64),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=9),
    'speaker_id': string,
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
オーディオ	オーディオ	（なし、）	int64
ラベル	クラスラベル		int64
Speaker_id	テンソル		ストリング

監視されたキー( as_supervised docを参照): ('audio', 'label')
図( tfds.show_examples ): サポートされていません。
例( tfds.as_dataframe ):

引用：

@InProceedings{ahamad-anand-bhargava:2020:LREC,
  author    = {Ahamad, Afroz  and  Anand, Ankit  and  Bhargava, Pranesh},
  title     = {AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition},
  booktitle      = {Proceedings of The 12th Language Resources and Evaluation Conference},
  month          = {May},
  year           = {2020},
  address        = {Marseille, France},
  publisher      = {European Language Resources Association},
  pages     = {5353--5360},
  url       = {https://www.aclweb.org/anthology/2020.lrec-1.659}
}

アクセントdb コレクションでコンテンツを整理 必要に応じて、コンテンツの保存と分類を行います。

アクセントdb