![]() ![]() Apache PDFBox claims 'PDF to text extraction' as its top feature. Change digitizer settings to recognize text from image dataĬonverterDigitizerSettings cds = dc.getPreferences().getDigitizerSettings() Ĭds.setDigitizationMode(DigitizationMode.ALL_IMAGES) Ĭds.setRecognizeElementTypes(RecognizeElementTypes. Two options: iText - it seems the PdfTextExtractor class can do what you want. Import .* ĭocumentConverter dc = new DocumentConverter() Click on the Save Your PDF file will be saved on your computer. Now, again go to the File menu and choose Save as Adobe PDF Define the name and location to save the PDF file.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |