Markdown is a lightweight text format that uses simple symbols to structure documents: # for headings, **text** for bold, *text* for italic, - for lists, and `text` for code. It reads as plain text but renders as formatted content. It is the standard format in GitHub, Notion, Obsidian, VS Code, ChatGPT, Claude and most AI tools.

How many file formats are supported for Markdown conversion?

15 formats: DOCX, PDF, XLSX, XLS, HTML, TXT, MD, CSV, JSON, XML, JPG, PNG, WEBP, BMP and GIF. Images are processed using in-browser OCR. More formats are added based on user demand.

Is my file uploaded to a server during Markdown conversion?

No. All processing happens in your browser using JavaScript. Your file never leaves your device and no data is sent to external servers.

Is ConverterToMarkdown free?

Yes, completely free with no registration. No account or credit card required.

What is the maximum file size for conversion to Markdown?

20 MB per file. If your document is larger, consider compressing it or splitting the content before converting.

Can I edit the generated Markdown after conversion?

Yes. The result appears in a built-in editor with two modes: editor .md to edit the raw Markdown syntax directly, and Preview to see the rendered output with real headings, tables and formatting. Both modes are synced in real time. You can copy to clipboard or download the .md file at any time.

Does the Markdown converter work offline?

Yes. Once the page is loaded, the converter works fully offline. No internet connection is required to process files.

Do scanned PDFs convert to Markdown correctly?

Yes. The tool automatically detects when a PDF contains no extractable text and applies OCR page by page. PDFs with digital text are converted directly; scanned PDFs are processed with optical character recognition entirely in your browser.

Can I convert multiple files to Markdown at once?

Yes. The Multiple files mode lets you drag or select several files at once. They are converted sequentially to avoid overloading the browser, and each file gets its own download button as soon as it finishes. When all files are ready, a single button downloads all the .md files as a ZIP archive.

Is there a Model Context Protocol (MCP) server for AI agent integration?

Yes. A Model Context Protocol (MCP) server is available via npx convertertomarkdown-mcp. Compatible with Claude Code, Cursor, and any MCP-compatible agent. The agent can convert files to Markdown directly without opening a browser. See https://github.com/franciscovaleromartin/convertertomarkdown-mcp for details.

How do I convert a PDF to Markdown for free?

Go to ConverterToMarkdown.com and drag your PDF file onto the converter. The tool automatically extracts text from digital PDFs using pdf.js. If the PDF is scanned or image-only, OCR kicks in automatically via Tesseract.js — no manual action required. The result appears in the built-in editor where you can copy or download the Markdown file. Free, no upload, no registration.

How do I convert a Word document (DOCX) to Markdown?

Drag your .docx file onto ConverterToMarkdown.com. The tool uses mammoth.js to convert Word formatting to Markdown: headings become # symbols, bold text becomes **bold**, italic becomes *italic*, and tables are converted to Markdown pipe syntax. Lists, links and inline styles are preserved. The output appears instantly in a live editor. Free, no upload, works entirely in your browser.

How do I convert an Excel spreadsheet to Markdown?

Drag your .xlsx or .xls file onto ConverterToMarkdown.com. The tool uses SheetJS to parse each worksheet and converts it to a Markdown table with aligned column headers. If the workbook has multiple sheets, each sheet becomes a separate table in the output. The result is ready to paste into GitHub READMEs, Confluence, Notion, or any Markdown-aware tool. Free, no upload.

ConverterToMarkdown.com

—

How ConverterToMarkdown Works

ConverterToMarkdown converts any file to Markdown directly in your browser using specialized JavaScript libraries — no installation, no server, no upload required. Supports 15 formats: DOCX, PDF, XLSX, HTML, CSV, JSON, XML and images with automatic OCR via Tesseract.js.

Updated June 2026

📁 Choose how to convert

Three input modes: "File" to drag or select a single file from your system; "URL" to paste the link to any publicly accessible file (PDF on a CDN, DOCX on a server, etc.); and "Multiple files" to select several documents at once and convert them in batch. Supports DOCX, PDF, XLSX, XLS, HTML, TXT, MD, CSV, JSON, XML and images (JPG, PNG, WEBP, BMP, GIF) via OCR. Maximum file size: 20 MB per file.

⚙️ The browser processes it

The file is converted entirely in your browser using specialized JavaScript libraries: mammoth.js for DOCX, pdf.js for PDF, SheetJS for Excel, Turndown for HTML, PapaParse for CSV and Tesseract.js for images (OCR). No bytes are sent to any server. The process is instant for small files and works offline once the page is loaded.

✓ Edit, preview, copy or download

The resulting Markdown appears in the built-in editor. Switch to "editor .md" to edit raw Markdown syntax directly, or switch to "Preview" to see the rendered output — headings, bold, tables, code blocks — and edit with visual formatting. Changes sync in real time between both modes. Copy to clipboard or download as a .md file ready for GitHub, GitLab, Notion, Obsidian, Docusaurus, Jekyll, Hugo or any Markdown-aware tool.

Details by format

DOCX

mammoth.js ↗

Converts to intermediate HTML using mammoth.js, preserving headings (h1–h6), bold, italic, tables and lists. The HTML is then cleaned and converted to Markdown with Turndown. Images are skipped; only text content is converted.

PDF

pdf.js ↗

Extracts text from each page using pdf.js. If the PDF contains no extractable text (scanned PDF), automatically falls back to OCR with Tesseract.js page by page — same as with image files. Headers and footers may merge with body text depending on the PDF structure.

XLSX / XLS

SheetJS ↗

Reads the workbook with SheetJS and converts each sheet into a separate Markdown table with pipe-delimited columns. Multi-sheet files produce multiple tables, each labeled with the sheet name. Formulas are resolved to their current values.

HTML

DOMParser + Turndown ↗

Strips inline styles, scripts, navigation elements and visual noise with DOMParser before passing the cleaned HTML to Turndown. Preserves semantic structure: headings, paragraphs, links, emphasis, blockquotes and code blocks.

CSV

PapaParse ↗

Parses CSV files with PapaParse, auto-detecting delimiter (comma, semicolon, tab). Outputs a Markdown table with header row detection. Supports large files with hundreds of rows.

TXT / MD

Native

Returns plain text without transformation. Line breaks are preserved as-is.

JSON

Native

Validates the JSON structure and wraps the formatted output in a fenced code block with json syntax highlighting. Handles nested objects, arrays, minified JSON and malformed input.

XML

Native

Wraps the raw XML content in a fenced code block preserving indentation and structure. Useful for inspection and documentation purposes.

JPG / PNG
WEBP / BMP
GIF

Tesseract.js ↗

Runs OCR (optical character recognition) in the browser using Tesseract.js. Automatically detects the language from browser settings and loads the matching language model. Supports JPG, PNG, WEBP, BMP and GIF. The language model (~4 MB) is downloaded once and cached. Works well with printed text; handwritten content may have lower accuracy.