metadata.xml 986 B

1234567891011121314151617181920212223242526
  1. <?xml version="1.0" encoding="UTF-8"?>
  2. <!DOCTYPE pkgmetadata SYSTEM "http://www.gentoo.org/dtd/metadata.dtd">
  3. <pkgmetadata>
  4. <maintainer type="person">
  5. <email>tomka@gentoo.org</email>
  6. </maintainer>
  7. <longdescription>
  8. pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which
  9. contain only images (no text) will be processed by optical character
  10. recognition (OCR) and the text will be added to each page invisibly
  11. "behind" the images.
  12. pdfsandwich is a command line tool which is supposed to be useful to
  13. OCR scanned books or journals. It is able to recognize the page layout
  14. even for multicolumn text.
  15. Essentially, pdfsandwich is a wrapper script which calls the following
  16. binaries: convert, cuneiform, gs, and hocr2pdf. It is known to run on
  17. Unix systems and has been tested on Linux and MacOS X. It supports
  18. parallel processing on multiprocessor systems.
  19. </longdescription>
  20. <upstream>
  21. <remote-id type="sourceforge">pdfsandwich</remote-id>
  22. </upstream>
  23. </pkgmetadata>