Skip to content

Convert TEI to Markdown.

Drop a TEI file and pull the title and body text out as plain Markdown. It runs entirely in your browser, so your file never leaves your device.

Instant & offlineFree, no accounts

Drag & drop your files

or

WordExcelPowerPointPDFEPUBCSVJSONCode
+135 file formats supported
Batch convertMany files at once
100% privateStays on your device
Works offlineNo connection needed
Preset

Optimize for AI & RAG

Extra cleanup for LLM ingestion: strip HTML, fix smart quotes, tidy Unicode and spacing.

Add YAML front matter

Prepend a metadata block (title, source, date, word & token counts) for knowledge bases and RAG.

Add table of contents

Build a linked index from the headings. Handy for long documents.

Export RAG chunks (.json)

Split the result into retrieval-ready chunks. Download per file from the result panel.

Most converters quietly upload your documents to a server. This one physically can't.

01Why

Rich scholarly tags,
plain reading text.

A TEI file wraps a text in dense scholarly markup. To read or reuse it you mostly want the title and the body text. Converting lifts those out as clean, readable Markdown.

TEItext.tei

<text>

<body>

<head>Sonnet 1</head>

<p>From fairest creatures we desire increase.</p>

</body>

</text>

MDtext.md

# Sonnet 1

From fairest creatures we desire increase.

02Features

Everything you
actually need.

TEI files in, clean Markdown out, with no server and no account anywhere.

It never leaves your browser

The .tei file is read and converted on your own device. Nothing is uploaded to any server, ever.

local
TEItext.tei

# Heading

- point one

3 chunks

AI & RAG ready

Optional cleanup, YAML front matter, a table of contents and RAG chunk export.

Works offline

Once the page has loaded you can switch off your connection and it keeps converting.

<head>Sonnet 1</head>

## Sonnet 1

Structure mapped

Body headings, paragraphs and list items become Markdown headings, text and bullets.

éñü

Unicode safe

Accents, symbols and non-Latin scripts come through intact as UTF-8.

Free, and unlimited

No sign-up, no quotas, no watermarks. Convert one file or a thousand; it all runs the same way, on your own device.

03Fidelity

What survives
the trip.

Honest about what comes through, and what doesn't. These are the same notes the Formats list shows for TEI, so the page never drifts from what the converter really does.

Kept

4
  • First title
  • Body headings
  • Paragraphs
  • List items

Dropped

4
  • Inline markup
  • Attributes
  • Notes
  • Header metadata
TEItext.tei
Sonnet 1
kept
kept
<hi rend="italic">word</hi>·<note>...</note>
dropped
<teiHeader>...</teiHeader>
dropped
04FAQ

TEI questions,
answered.

Everything worth knowing before you drop in a TEI file.

05More

Other converters.

Working with more than TEI files? These convert the same way: privately, in your browser.

Hangul to Markdown

.hwpx

Korean Hangul Office documents.

Convert

InDesign / IDML to Markdown

.idml · .icml

Adobe InDesign layout exchange files.

Convert

Excel to Markdown

.xlsx

Microsoft Excel workbooks.

Convert

OpenDocument Sheet to Markdown

.ods · .fods

LibreOffice Calc spreadsheets.

Convert

CSV to Markdown

.csv

Comma-separated value tables.

Convert

TSV to Markdown

.tsv

Tab-separated value tables.

Convert

Parquet to Markdown

.parquet

Columnar big-data tables.

Convert

Gnumeric to Markdown

.gnumeric

Gnumeric spreadsheets.

Convert