Skip to content

Korean text: WARNING: Untokenizable #1176

Discussion options

You must be logged in to vote

First, you have to select "Korean" in the "New Project" window. Currently you are selecting "English."

Second, please try 5MB file at first. Not 50 GB.

I recommend that you perform random sampling to reduce the data size. I tried up to 200MB. I am not sure about GB.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Lyroxide
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment