Improve extraction where an opening marker absorbs the following whitespace. #420
Labels
bug
Something isn't working
pipeline 2: extract
Issue related to extracting parallel corpora
usfm
USFM parsing issue
In the ESV the SFM is particularly complex and some extracted files occasionally missing a space between words.
->
->
I tested a few variations and the problem seems to be that the \wj marker absorbs the following whitespace. The complexity of the SFM markup doesn't seem to be an issue.
The text was updated successfully, but these errors were encountered: