参考文献:
全て
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/all')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 67072 |
'train' | 1207222 |
'validation' | 67068 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ある
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/a')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 9675 |
'train' | 174134 |
'validation' | 9674 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
b
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/b')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 8974 |
'train' | 161520 |
'validation' | 8973 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
c
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/c')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 5614 |
'train' | 101042 |
'validation' | 5613 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
d
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/d')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 565 |
'train' | 10164 |
'validation' | 565 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
e
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/e')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 1914年 |
'train' | 34443 |
'validation' | 1914年 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
f
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/f')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 4754 |
'train' | 85568 |
'validation' | 4754 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
g
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/g')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 14386 |
'train' | 258935 |
'validation' | 14385 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
h
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/h')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 14279 |
'train' | 257019 |
'validation' | 14279 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
y
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:big_patent/y')
- 説明:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- ライセンス: クリエイティブ・コモンズ 表示 4.0 インターナショナル
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 6911 |
'train' | 124397 |
'validation' | 6911 |
- 特徴:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}