数据集:

diwank/hinglish-dump

许可:

mit
英文

Hinglish Dump

Hinglish(hi-EN)数据集的原始合并转储。

子集和特征

子集:

  • crowd_transliteration
  • hindi_romanized_dump
  • hindi_xlit
  • hinge
  • hinglish_norm
  • news2018
_FEATURE_NAMES = [
    "target_hinglish",
    "source_hindi",
    "parallel_english",
    "annotations",
    "raw_input",
    "alternates",
]