I'm asked this often because the truth is, making an eBook from a PDF is quite expensive. As it's time-consuming and labor-intensive. People think that's nuts--that they can see that option, right there in Adobe Acrobat or PDF 995 or whatever PDF reader software that they're using, that says "export file to Word" or "export to HTML" or from GoogleDocs, "export to ePUB" and they think, "well, hell, how hard can it be?"
The answer is--very.
The actual steps for us to make an ebook from a PDF are:
- We scan the PDF, using AbbyyFineReader commercial scanning software.
- We OCR the PDF, also using Abbyy.
- When we're done with that step, we have a raw scan output Word file. This Word file has the same page layout and page breaks as the original PDF. That's important and you'll see why in a moment.
- That Word file is full of red-marked/lettered text. That text indicates where Abbyy suggests that there's a scanning error.
- We go through, fixing all of those, checking them against the original pdf.
- When we've completed that step, we export that edited Word file to PDF format.
- We then run a COMPARE program, that compares the original PDF ("PDF1") with this new, from-the-Word-file-PDF ("PDF2").
- We check and correct every single comparison "twig" that says that there's something different between PDF1 and PDF2.
- Then we take the revised Word file, and export another PDF.
- We export another PDF (PDF 3), and,
- Yup, we run a comparison, now, between PDF3 and PDF1.
- If we get more compare discrepancies, we lather-rinse-repeat, correcting those discrepancies in the Word file, from the original PDF.
- And we continue this process, over and over, until there are no discrepancy reports between PDFx and PDF1.
- At that point--we are finally ready to start the eBook formatting process, which means we start by cleaning the "new" Word file, exporting it to HTML and starting at the same place that we would have been, if we'd had a Word file to start with.
And that is why, especially for very long, complexly-laid out PDFs, formatting into an eBook is so expensive. The automatic "export to Word" functions, either from Acrobat, other PDF software or those online websites are all full of utter nonsense. What comes out looks okay on the surface, but it's broken underneath--where eBooks live.
Heck, don't believe me--export your PDF into Word and then upload that at the KDP, and preview the resulting eBook. Horrified? Yup, that's how that goes. Trust me, we don't do this for fun! If there were a faster, cheaper way to do this right--making ebooks from PDFs--I can assure you, we'd be the very first people to use it!