-
Notifications
You must be signed in to change notification settings - Fork 573
Issues: Unstructured-IO/unstructured
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
bug/Two Column PDF partition result in incorrect text.
bug
Something isn't working
#3325
opened Jun 28, 2024 by
pfcharles
bug/pip install failure for Something isn't working
"unstructured[all-docs]"
bug
#3324
opened Jun 28, 2024 by
jacoblee93
bug/quotes from markdown are stripped out
bug
Something isn't working
html
#3309
opened Jun 27, 2024 by
gaspardpetit
feat/extract_pdf_page_images
enhancement
New feature or request
#3299
opened Jun 25, 2024 by
huanji1987
Clean up warning table transformer warning statements statements
bug
Something isn't working
#3288
opened Jun 25, 2024 by
magallardo
Return image data from confluence
enhancement
New feature or request
#3281
opened Jun 24, 2024 by
ML-Abdula
List block in a partitioned Markdown doc identified as a Something isn't working
Title
element under special conditions
bug
#3280
opened Jun 23, 2024 by
nickphilip
UnstructuredImageLoader,使用 OCRAgentPaddle 如何设置ocr 内存大小,模型/语言包下载地址
#3279
opened Jun 23, 2024 by
haike-1213
Minor ocr_interface.py Error Handling Improvement
enhancement
New feature or request
ocr
Related to optical character recognition (OCR).
#3276
opened Jun 22, 2024 by
AscendingGrass
bug(html): invisible links are reported in metadata
bug
Something isn't working
html
#3255
opened Jun 19, 2024 by
scanny
partition_pdf got TypeError: UnstructuredTableTransformerModel.predict() got an unexpected keyword argument 'result_format'
awaiting-response
bug
Something isn't working
pdf
#3253
opened Jun 19, 2024 by
liyang79
bug - duplicates merged cell text following issue #2106
awaiting-response
docx
Related to Microsoft Word (.docx) file format
enhancement
New feature or request
#3250
opened Jun 19, 2024 by
veredmm
feat(pptx): add coordinate metadata to PPTX elements
enhancement
New feature or request
pptx
Related to Microsoft PowerPoint (.pptx) file format
#3248
opened Jun 19, 2024 by
scanny
bug(html): distinct paragraphs within <li> are squashed into single element
bug
Something isn't working
html
#3245
opened Jun 18, 2024 by
scanny
Process Attachments available via Paid API
email
Related to EML and/or MSG format
enhancement
New feature or request
#3243
opened Jun 18, 2024 by
jeremydiba
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.