Markdown export

Last reviewed on April 24, 2026

The Markdown export turns an extracted PDF into a single .md file with sensible headings and paragraph breaks. It is intended for cases where the body text matters as much as the citation record — for example, when you want to take the paper into a notebook, a wiki, or a static-site generator and annotate it.

What the file contains

From top to bottom:

Heading levels are GitHub-flavoured Markdown defaults: # for the title, ## for the major sections.

What it does not contain

The export deliberately does not try to reconstruct the original PDF's typography. It does not preserve column layout, figure placement, or table formatting. Tables, in particular, will appear as a wall of cell text rather than as a Markdown table; the heuristics needed to recover a usable table from arbitrary PDF text fragments are out of scope for the in-browser tool.

Inline citations within the body text are preserved as plain text — for example, "[12]" or "(Smith, 2023)" — but they are not linked to entries in the References section. If you need that, post-process the file with a small script that matches each citation marker to its entry.

Useful pipelines

Tips for cleaner output

When Markdown is the wrong choice

If the goal is to file the citation in a reference manager, use BibTeX, RIS, or CSL-JSON instead. Markdown carries the human-readable representation but loses the field-level structure that a reference manager needs. The two outputs complement each other: the structured export feeds your bibliography, the Markdown export feeds your reading and annotation workflow.