- Description:
Universal Dependencies (UD) is a framework for consistent annotation of grammar (parts of speech, morphological features, and syntactic dependencies) across different human languages. UD is an open community effort with over 200 contributors producing more than 100 treebanks in over 70 languages. If you’re new to UD, you should start by reading the first part of the Short Introduction and then browsing the annotation guidelines.
Homepage: https://universaldependencies.org/
Source code:
tfds.datasets.xtreme_pos.BuilderVersions:
1.0.0(default): Initial release.
Download size:
338.76 MiBAuto-cached (documentation): Yes
Feature structure:
FeaturesDict({
'tokens': Sequence(Text(shape=(), dtype=string)),
'upos': Sequence(ClassLabel(shape=(), dtype=int64, num_classes=18)),
})
- Feature documentation:
| Feature | Class | Shape | Dtype | Description |
|---|---|---|---|---|
| FeaturesDict | ||||
| tokens | Sequence(Text) | (None,) | string | |
| upos | Sequence(ClassLabel) | (None,) | int64 |
Supervised keys (See
as_superviseddoc):NoneFigure (tfds.show_examples): Not supported.
Citation:
@article{nivre2018universal,
title={Universal Dependencies 2.2},
author={Nivre, Joakim and Abrams, Mitchell and Agi{'c}, {
{Z} }eljko
and Ahrenberg, Lars and Antonsen, Lene and Aranzabe, Maria Jesus and
Arutie, Gashaw and Asahara, Masayuki and Ateyah, Luma and Attia,
Mohammed and others},
year={2018}
}
xtreme_pos/xtreme_pos_af (default config)
Dataset size:
445.94 KiBSplits:
| Split | Examples |
|---|---|
'dev' |
194 |
'test' |
425 |
'train' |
1,315 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ar
Dataset size:
3.35 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
909 |
'test' |
1,680 |
'train' |
6,075 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_bg
Dataset size:
2.14 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
1,115 |
'test' |
1,116 |
'train' |
8,907 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_de
Dataset size:
37.62 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
19,233 |
'test' |
22,458 |
'train' |
166,849 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_el
Dataset size:
7.17 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
2,559 |
'test' |
2,809 |
'train' |
28,152 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_en
Dataset size:
4.67 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
4,699 |
'test' |
6,165 |
'train' |
26,825 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_es
Dataset size:
8.26 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
3,054 |
'test' |
3,147 |
'train' |
28,492 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_et
Dataset size:
4.84 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
3,125 |
'test' |
3,760 |
'train' |
25,749 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_eu
Dataset size:
1.27 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
1,798 |
'test' |
1,799 |
'train' |
5,396 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_fa
Dataset size:
1.73 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
599 |
'test' |
600 |
'train' |
4,798 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_fi
Dataset size:
4.48 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
3,239 |
'test' |
4,422 |
'train' |
27,198 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_fr
Dataset size:
7.28 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
5,979 |
'test' |
9,465 |
'train' |
47,308 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_he
Dataset size:
1.57 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
484 |
'test' |
491 |
'train' |
5,241 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_hi
Dataset size:
5.78 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
1,884 |
'test' |
2,909 |
'train' |
14,752 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_hu
Dataset size:
438.07 KiBSplits:
| Split | Examples |
|---|---|
'dev' |
441 |
'test' |
449 |
'train' |
910 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_id
Dataset size:
1.31 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
559 |
'test' |
1,557 |
'train' |
4,477 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_it
Dataset size:
6.85 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
2,278 |
'test' |
3,518 |
'train' |
29,685 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ja
Dataset size:
3.57 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
8,938 |
'test' |
10,253 |
'train' |
47,926 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_kk
Dataset size:
167.15 KiBSplits:
| Split | Examples |
|---|---|
'test' |
1,047 |
'train' |
31 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ko
Dataset size:
5.82 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
3,016 |
'test' |
4,276 |
'train' |
27,410 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_mr
Dataset size:
56.14 KiBSplits:
| Split | Examples |
|---|---|
'dev' |
46 |
'test' |
47 |
'train' |
373 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_nl
Dataset size:
2.90 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
1,394 |
'test' |
1,471 |
'train' |
18,051 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_pt
Dataset size:
4.65 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
1,770 |
'test' |
2,681 |
'train' |
17,992 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ru
Dataset size:
20.25 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
9,960 |
'test' |
11,336 |
'train' |
67,435 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ta
Dataset size:
3.65 KiBSplits:
| Split | Examples |
|---|---|
'test' |
55 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_te
Dataset size:
143.77 KiBSplits:
| Split | Examples |
|---|---|
'dev' |
131 |
'test' |
146 |
'train' |
1,051 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_th
Dataset size:
377.24 KiBSplits:
| Split | Examples |
|---|---|
'test' |
1,000 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_tl
Dataset size:
228.78 KiBSplits:
| Split | Examples |
|---|---|
'dev' |
80 |
'test' |
120 |
'train' |
400 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_tr
Dataset size:
1.06 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
988 |
'test' |
4,785 |
'train' |
3,664 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_ur
Dataset size:
1.50 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
552 |
'test' |
535 |
'train' |
4,043 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_vi
Dataset size:
454.32 KiBSplits:
| Split | Examples |
|---|---|
'dev' |
800 |
'test' |
800 |
'train' |
1,400 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_yo
Dataset size:
22.65 KiBSplits:
| Split | Examples |
|---|---|
'test' |
100 |
- Examples (tfds.as_dataframe):
xtreme_pos/xtreme_pos_zh
Dataset size:
3.29 MiBSplits:
| Split | Examples |
|---|---|
'dev' |
3,038 |
'test' |
5,528 |
'train' |
18,998 |
- Examples (tfds.as_dataframe):