Discover GroupDocs.Parser free online app!

XHTML Extensible Hypertext Markup Language File

The XHTML is a text-based file format with markup in the XML, using a reformulation of HTML 4.0. These files are well suited to be open or viewed in a web browser. XHTML was designed to be more structured, less scripting, generic; using all the existing facilities of XML and more device-independent. XHTML provides a generally worthwhile set of elements and attributes, with extension options in combination with style sheets.

Read More

How to

How to parse text and metadata from XHTML files online

  • Click inside the file drop area to upload a XHTML file or drag & drop a XHTML file.
  • Click "Get Text and Metadata" button to extract a text and metadata from your XHTML document.
  • Click "Get Images" button to extract images from your XHTML document.
  • Once your XHTML document is parsed click on "Download Now" button.
  • You may also send the download link to any email address by clicking on "Email" button.

Powered By

 GroupDocs.Parser for .NET API

Powerful document parser .NET library & API.
Build your own multi-platform document and file parser with our .NET library.

More Platforms

 GroupDocs.Parser for Java

Java version of GroupDocs.Parser library & API.
Parse documents and extract data from documents in Java!

 GroupDocs.Parser Cloud

Easy to use cross-platform Cloud solution.
Parse documents using Cloud SDKs for popular programming languages or communicate with REST API using cURL!

Open-Source UI/UX Solutions

 GroupDocs.Parser for .NET and Java

Integrate document parser in your apps using our out-of-the-box open-source front-end solutions based on Angular and GroupDocs.Parser for .NET/Java.
UI/UX solutions can be run as standalone application or can be integrated in any .NET/Java application, download and build your parser solution within few clicks!

 GroupDocs.Total for .NET and Java

GroupDocs.Total is an open-source UI/UX solution where all GroupDocs products are working together as one.
GroupDocs.Total provides multiple high quality features against over 120 document formats, such as conversion, signature, parser and much more!

Other Parser file formats

You can also parse many other file formats. Please see the complete list below.

ZIP PARSER (Zipped File)
EPUB PARSER (Open eBook File)
FB2 PARSER (FictionBook eBook)
CHM PARSER (Compiled HTML Help File)
MSG PARSER (Outlook Mail Message)
EML PARSER (E-Mail Message)
EMLX PARSER (Apple Mail Message)
PST PARSER (Outlook Personal Information Store File)
OST PARSER (Outlook Offline Data File)
HTM PARSER (Hypertext Markup Language File)
HTML PARSER (Hypertext Markup Language File)
MHT PARSER (MHTML Web Archive)
MHTML PARSER (MIME HTML File)
XML PARSER (XML File)
MD PARSER (Markdown Files)
XML PARSER (Excel 2003 XML (SpreadsheetML))
ONE PARSER (OneNote Document)
PDF PARSER (Portable Document Format File)
PPT PARSER (PowerPoint Presentation)
PPTX PARSER (PowerPoint Open XML Presentation)
PPS PARSER (PowerPoint Slide Show)
PPSX PARSER (PowerPoint Open XML Slide Show)
ODP PARSER (OpenDocument Presentation)
POT PARSER (PowerPoint Template)
PPTM PARSER (PowerPoint Open XML Macro-Enabled Presentation)
POTX PARSER (PowerPoint Open XML Presentation Template)
POTM PARSER (PowerPoint Open XML Macro-Enabled Presentation Template)
PPSM PARSER (PowerPoint Open XML Macro-Enabled Slide)
OTP PARSER (OpenDocument Presentation Template)
XLS PARSER (Excel Spreadsheet)
XLT PARSER (Microsoft Excel Template)
XLTX PARSER (Excel Open XML Spreadsheet Template)
XLSX PARSER (Microsoft Excel Open XML Spreadsheet)
XLSM PARSER (Excel Open XML Macro-Enabled Spreadsheet)
XLSB PARSER (Excel Binary Spreadsheet)
XLAM PARSER (Microsoft Excel Add-in)
XLTM PARSER (Microsoft Excel Macro-Enabled Template)
CSV PARSER (Comma Separated Values File)
TSV PARSER (Tab Separated Values File)
ODS PARSER (OpenDocument Spreadsheet)
OTS PARSER (OpenDocument Spreadsheet Template)
DOC PARSER (Microsoft Word Document)
DOCX PARSER (Microsoft Word Open XML Document)
DOCM PARSER (Word Open XML Macro-Enabled Document)
DOT PARSER (Word Document Template)
DOTX PARSER (Word Open XML Document Template)
DOTM PARSER (Word Open XML Macro-Enabled Document Template)
RTF PARSER (Rich Text Format File)
TXT PARSER (Plain Text File)
ODT PARSER (OpenDocument Text Document)
OTT PARSER (OpenDocument Document Template)
NUMBERS PARSER (Apple numbers)