multi_eurlex

Referencje:

pl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/en')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 55000
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

da

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/da')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 55000
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

de

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/de')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 55000
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

nl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/nl')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 55000
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

sw

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/sv')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 42490
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

bg

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/bg')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 15986
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

cs

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/cs')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 23187
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

godz

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/hr')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 7944
'validation' 2500
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

pl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/pl')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 23197
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

sk

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/sk')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 22971
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/sl')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 23184
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

es

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/es')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 52785
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

ks

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/fr')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 55000
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

To

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/it')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 55000
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

pkt

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/pt')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 52370
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

ro

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/ro')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 15921
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

i.t

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/et')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 23126
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

fi

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/fi')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 42497
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

hu

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/hu')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 22664
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

Por

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/lt')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 23188
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

poz

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/lv')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 23208
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

el

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/el')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 55000
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

mt

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/mt')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 17521
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

wszystkie_języki

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:multi_eurlex/all_languages')
  • Opis :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
  • Licencja : Brak znanej licencji
  • Wersja : 1.0.0
  • Podziały :
Podział Przykłady
'test' 5000
'train' 55000
'validation' 5000
  • Cechy :
{
    "celex_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "text": {
        "languages": [
            "en",
            "da",
            "de",
            "nl",
            "sv",
            "bg",
            "cs",
            "hr",
            "pl",
            "sk",
            "sl",
            "es",
            "fr",
            "it",
            "pt",
            "ro",
            "et",
            "fi",
            "hu",
            "lt",
            "lv",
            "el",
            "mt"
        ],
        "id": null,
        "_type": "Translation"
    },
    "labels": {
        "feature": {
            "num_classes": 21,
            "names": [
                "100149",
                "100160",
                "100148",
                "100147",
                "100152",
                "100143",
                "100156",
                "100158",
                "100154",
                "100153",
                "100142",
                "100145",
                "100150",
                "100162",
                "100159",
                "100144",
                "100151",
                "100157",
                "100161",
                "100146",
                "100155"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}