Quote:
Originally Posted by Joseph Paul
What if it was scanned with OCR and saved as a non-PDF text format?
|
Unless that was a REALLY advanced OCR program specifically designed to do exactly what you want it to do it would likely create even more formatting problems.
Though, thinking about the problem again. Copy the PDF text for a template, strip out all newlines, do newline on semicolon, do newline on dot, do indent and newline on comma should produce something largely in line with what Mailanka posted