Convert a PDF to XML - Using pdftohtml it's possible to convert a PDF file to an XML file that includes all location information How to extract one page of a PDF as an image -