Major KBT's: Option to add them to training data when the source glosses are in a different language than the training source text. #441
Labels
enhancement
New feature or request
pipeline 3: preprocess
Issue related to preprocessing.
research
Research topics
*** This enhancement request is for research purposes. ***
Glosses for the Major KBT's are available in a limited number of languages. Currently, SILNLP will not add glosses to the training data if the source language for the glosses is not the same as the source language of the translation used for training. This excludes many projects from being able to use their KBT's. We would like to investigate the benefits of including the source/vernacular glosses in the training data even when the translation source language differs from the gloss source language.
The text was updated successfully, but these errors were encountered: