PythonHub
2.44K subscribers
2.35K photos
49.4K links
News & links about Python programming.
https://pythonhub.dev/
Download Telegram
Python
To extract text, I use tika for pdf files. However, for scanned PDFs it cannot work. I have learned that I must set extractInlineImages true. But how ? There is no tika-config.xml file on my disk. How can I programmatically set it?: https://www.reddit.com/r/Python/comments/7yqway/to_extract_text_i_use_tika_for_pdf_files_however/