We developers seem to really dislike PDFs, to a degree that we'll build LLMs and...

jgalt212 · 2025-03-06T20:01:15 1741291275

I think you might be looking for PDF/A.

For example, if you print a word doc to PDF, you get the raw text in PDF form, not an image of the text.

gpvos · 2025-03-06T21:20:13 1741296013

PDF/A doesn't require preserving the document structure, only that any text is extractable.

siva7 · 2025-03-07T04:12:43 1741320763

> We developers seem to really dislike PDFs, to a degree that we'll build LLMs and have them translate it into Markdown.

Why Jokes aside? Markdown/html is better suited for the web than pdf