-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with quoted text #4
Comments
Yes, that indeed seems like a bug! Thanks for the heads up, we will let you know when it is corrected! |
Thanks for responding Sarah. The reason for this bug is because of quote escapes as we parse first into tsv and only then import into MySQL. Fields separated by quotes. I suggest we fix that at the DB stage as part of our transformation routine.
… On Oct 18, 2017, at 7:48 PM, sarahkelley ***@***.***> wrote:
Yes, that indeed seems like a bug! Thanks for the heads up, we will let you know when it is corrected!
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Thanks for the suggestion Evgeny, that makes a lot of sense! |
Sarah: an even easier solution - use mediumtext or longtext in the DB to store detailed descriptions http://boolean.co.nz/blog/max-length-for-mysql-text-field-types/135/
… On Oct 18, 2017, at 7:48 PM, sarahkelley ***@***.***> wrote:
Yes, that indeed seems like a bug! Thanks for the heads up, we will let you know when it is corrected!
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
It looks like there is an issue with how PatentsView handles quotation marks. For example, whenever a quotation mark occurs in the patent's title, PatentsView quotes the entire title and adds extra quotation marks around the actual quoted text. You can see this behavior in patent number 5767337:
The same behavior is seen in the bulk data files.
The text was updated successfully, but these errors were encountered: