Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

transcript ingest process #206

Open
kimpham54 opened this issue Jul 27, 2020 · 1 comment
Open

transcript ingest process #206

kimpham54 opened this issue Jul 27, 2020 · 1 comment
Labels
enhancement New feature or request

Comments

@kimpham54
Copy link
Member

bulk transcript ingest - add transcript.txt to each file to have it indexed in elasticsearch.
individual transcript ingest - that happens after objects are ingested

how to accommodate for various transcript formats? or we can't.

transcripts are free text, what about coordinated transcripts? e.g. https://transkribus.eu/r/notarial/detail/LnknZhYV?term=dog&result=0&search=%7B%22term%22%3A%22dog%22%7D

@kimpham54 kimpham54 added the enhancement New feature or request label Jul 27, 2020
@kcrowe1981
Copy link
Collaborator

I realize this was specific to the JCRS and Transkribus project on handwriting character recognition, but I would love to revisit the indexing of full text objects (PDFs for example) in elasticsearch at some future point, which is way more of an enhancement.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants