HTML Entity Decoder — Convert Entities to Characters
Decode HTML entities like &, <, and ' back to their original characters. Clean up encoded content from web scraping, emails, and CMS exports.
Cleaning Up Scraped Content
Web scraping tools often return text with HTML entities intact. Titles, descriptions, and article bodies may contain & instead of &, " instead of quotation marks, and numeric entities for special characters. Decoding these entities produces clean, readable text ready for storage, analysis, or display in a different format.
Frequently Asked Questions
Yes. It supports all 2,231 named entities defined in the HTML Living Standard, including less common ones like ♥ (♥), — (—), and € (€), as well as all numeric and hexadecimal entities.
Common scenarios include cleaning up web-scraped content, processing RSS/Atom feeds, migrating content between CMS platforms, and reading data from APIs that return HTML-encoded strings.