Question 1

Why does AI output have all this formatting in the first place?

Accepted Answer

LLMs are trained heavily on markdown — documentation, GitHub, forums — and chat interfaces render that markdown into pretty bold text and headers. When you copy from the chat window, some apps give you the rendered text, others give you the raw asterisks and pound signs. Smart quotes and em dashes appear because the training data is full of professionally typeset prose. None of it is a watermark; it is just the model writing in its native dialect.

Question 2

Are em dashes really a sign of AI writing?

Accepted Answer

No — the em dash is a legitimate punctuation mark used by professional writers for centuries. AI models do use it noticeably more often than the average person types it (since — is hard to type on most keyboards), which is why frequent em dashes became a folk heuristic for AI text in 2024-25. The goal of this tool is not to hide anything; it is consistency with your own style. If you never type em dashes, your published text should not suddenly be full of them.

Question 3

What are the hidden characters it removes?

Accepted Answer

Zero-width spaces (U+200B), zero-width non-joiners and joiners (U+200C, U+200D), word joiners (U+2060), and byte-order marks (U+FEFF). These are invisible but real characters that break search-and-replace, inflate character counts, trip plagiarism checkers, and cause SEO tools to misread your content. They sneak in via copy-paste chains through web apps. The summary tells you how many were found — often zero, sometimes dozens.

Question 4

Will it mangle code blocks in the output?

Accepted Answer

Not with the default settings. The preserve-code toggle splits the text on fenced ``` blocks and passes them through untouched, so indentation, asterisks, and underscores inside code survive. Turn the toggle off if you want the fences removed and the code treated as ordinary text.

Question 5

How does it handle ***bold italic*** or nested emphasis?

Accepted Answer

The emphasis stripper runs repeatedly until the text stops changing. ***word*** unwraps to *word* on the first pass and to plain word on the second. This iterate-until-stable approach is more reliable than trying to write one regex that anticipates every nesting combination.

Question 6

Is my text sent to a server?

Accepted Answer

No. The entire cleaner runs in your browser with JavaScript string operations. Nothing is uploaded, logged, or stored — you can verify by loading the page once and then switching off your internet connection; the tool keeps working.

Artifact	Example in	Result out
Bold / italic	key point and aside	key point and aside
Headers	## Conclusion	Conclusion
Links	[our guide](https://…)	our guide
Em dash	fast — and cheap	fast, and cheap
Smart quotes	“done” & it’s	"done" & it's
Zero-width chars	invisible (U+200B…U+FEFF)	deleted, with a count

AI Text Cleaner (Markdown & Em Dash Remover)

How to use the ai text cleaner (markdown & em dash remover)

What gets cleaned, exactly

The CMS and email paste workflow

Frequently asked questions

Related tools

Learn more