TFDS는 이제 Croissant 🥐 형식을 지원합니다! 자세한 내용은 설명서를 읽어보세요.

이 페이지는 Cloud Translation API를 통해 번역되었습니다.

핏

참고자료:

아니면-ur

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/or-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	43766

특징 :

{
    "translation": {
        "languages": [
            "or",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml 또는

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/ml-or')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	19413

특징 :

{
    "translation": {
        "languages": [
            "ml",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

억타

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-ta')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	33005

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

구-미스터

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/gu-mr')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	30766

특징 :

{
    "translation": {
        "languages": [
            "gu",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

안녕하세요

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/hi-or')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	61070

특징 :

{
    "translation": {
        "languages": [
            "hi",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-or

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/en-or')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	98230

특징 :

{
    "translation": {
        "languages": [
            "en",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

미스터-어

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/mr-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	49691

특징 :

{
    "translation": {
        "languages": [
            "mr",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

엔타

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/en-ta')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	118759

특징 :

{
    "translation": {
        "languages": [
            "en",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

안녕따

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/hi-ta')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	64945

특징 :

{
    "translation": {
        "languages": [
            "hi",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

억엔

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-en')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	93560

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-또는

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-or')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	26456

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-ta

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/ml-ta')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	23609

특징 :

{
    "translation": {
        "languages": [
            "ml",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

구르

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/gu-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	29938

특징 :

{
    "translation": {
        "languages": [
            "gu",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-ml

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-ml')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	18149

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-pa

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/ml-pa')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	21978

특징 :

{
    "translation": {
        "languages": [
            "ml",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

엔파

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/en-pa')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	103296

특징 :

{
    "translation": {
        "languages": [
            "en",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-hi

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-hi')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	49598

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "hi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

안녕하세요

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/hi-pa')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	75200

특징 :

{
    "translation": {
        "languages": [
            "hi",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

구테

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/gu-te')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	16335

특징 :

{
    "translation": {
        "languages": [
            "gu",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

파타

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/pa-ta')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	46349

특징 :

{
    "translation": {
        "languages": [
            "pa",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

안녕 ml

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/hi-ml')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	27167

특징 :

{
    "translation": {
        "languages": [
            "hi",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

오르테

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/or-te')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	10475

특징 :

{
    "translation": {
        "languages": [
            "or",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ml

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/en-ml')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	44986

특징 :

{
    "translation": {
        "languages": [
            "en",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

엔하이

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/en-hi')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	269594

특징 :

{
    "translation": {
        "languages": [
            "en",
            "hi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

억파

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-pa')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	35109

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

미스터 테

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/mr-te')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	18179

특징 :

{
    "translation": {
        "languages": [
            "mr",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

미스터 아빠

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/mr-pa')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	50418

특징 :

{
    "translation": {
        "languages": [
            "mr",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

비앤티

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-te')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	17605

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

구하이

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/gu-hi')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	41587

특징 :

{
    "translation": {
        "languages": [
            "gu",
            "hi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

타 우르

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/ta-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	48892

특징 :

{
    "translation": {
        "languages": [
            "ta",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

테-우르

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/te-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	21148

특징 :

{
    "translation": {
        "languages": [
            "te",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

오르파

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/or-pa')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	43159

특징 :

{
    "translation": {
        "languages": [
            "or",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

구-ml

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/gu-ml')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	18252

특징 :

{
    "translation": {
        "languages": [
            "gu",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

구파

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/gu-pa')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	35566

특징 :

{
    "translation": {
        "languages": [
            "gu",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

하이테

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/hi-te')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	28569

특징 :

{
    "translation": {
        "languages": [
            "hi",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

엔테

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/en-te')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	44888

특징 :

{
    "translation": {
        "languages": [
            "en",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-te

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/ml-te')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	10480

특징 :

{
    "translation": {
        "languages": [
            "ml",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

파-우르

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/pa-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	51831

특징 :

{
    "translation": {
        "languages": [
            "pa",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

안녕하세요

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/hi-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	109951

특징 :

{
    "translation": {
        "languages": [
            "hi",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

미스터 또는

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/mr-or')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	47001

특징 :

{
    "translation": {
        "languages": [
            "mr",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

엉-우르

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/en-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	202578

특징 :

{
    "translation": {
        "languages": [
            "en",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-ur

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/ml-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	20913

특징 :

{
    "translation": {
        "languages": [
            "ml",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

억-미스터

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-mr')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	34043

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

구타

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/gu-ta')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	29187

특징 :

{
    "translation": {
        "languages": [
            "gu",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

머리

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/pa-te')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	25684

특징 :

{
    "translation": {
        "languages": [
            "pa",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

억구

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-gu')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	25166

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "gu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

억-우르

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/bn-ur')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	39290

특징 :

{
    "translation": {
        "languages": [
            "bn",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-mr

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/ml-mr')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	22796

특징 :

{
    "translation": {
        "languages": [
            "ml",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

오르타

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/or-ta')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	44035

특징 :

{
    "translation": {
        "languages": [
            "or",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

따-테

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/ta-te')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	17359

특징 :

{
    "translation": {
        "languages": [
            "ta",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

구-또는

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/gu-or')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	27162

특징 :

{
    "translation": {
        "languages": [
            "gu",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

엔구

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/en-gu')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	59739

특징 :

{
    "translation": {
        "languages": [
            "en",
            "gu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

안녕하세요

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/hi-mr')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	69186

특징 :

{
    "translation": {
        "languages": [
            "hi",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

미스터타

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/mr-ta')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	48535

특징 :

{
    "translation": {
        "languages": [
            "mr",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mr

TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.

ds = tfds.load('huggingface:pib/en-mr')

설명 :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

라이센스 : 알려진 라이센스 없음
버전 : 1.3.0
분할 :

나뉘다	예
`'train'`	117199

특징 :

{
    "translation": {
        "languages": [
            "en",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}