Discover GroupDocs.Parser free online app!

Hypertext Markup Language File

Files with .htm extension represent Hypertext Markup Language for creating web pages for display in web browsers such as Google Chrome, Internet Explorer, Firefox, and a number of others.

Read More

How to

How to extract text and metadata from HTM files

  • Click inside the file drop area to upload a HTM file or drag & drop a HTM file.
  • Click Get Text and Metadata button to extract text and metadata from your HTM document.
  • Once your HTM document is parsed click on Download Now button.
  • You may also send the download link to any email address by clicking on Email button.

Powered By

 GroupDocs.Parser for .NET API

Powerful document parser .NET library & API.
Build your own multi-platform document and file parser with our .NET library.

More Platforms

 GroupDocs.Parser for Java

Java version of GroupDocs.Parser library & API.
Parse documents and extract data from documents in Java!

 GroupDocs.Parser Cloud

Easy to use cross-platform Cloud solution.
Parse documents using Cloud SDKs for popular programming languages or communicate with REST API using cURL!

Open-Source UI/UX Solutions

 GroupDocs.Parser for .NET and Java

Integrate document parser in your apps using our out-of-the-box open-source front-end solutions based on Angular and GroupDocs.Parser for .NET/Java.
UI/UX solutions can be run as standalone application or can be integrated in any .NET/Java application, download and build your parser solution within few clicks!

 GroupDocs.Total for .NET and Java

GroupDocs.Total is an open-source UI/UX solution where all GroupDocs products are working together as one.
GroupDocs.Total provides multiple high quality features against over 120 document formats, such as conversion, signature, parser and much more!

Other file formats supported by GroupDocs.Parser

You can also parse many other file formats. Please see the complete list below.

EPUB TEXT EXTRACTOR (Digital E-Book File Format)
CHM TEXT EXTRACTOR (Microsoft Compiled HTML)
MSG TEXT EXTRACTOR (Microsoft Outlook Email Format)
EML TEXT EXTRACTOR (E-Mail Message File)
EMLX TEXT EXTRACTOR (Apple Mail Message)
HTML TEXT EXTRACTOR (Hyper Text Markup Language)
XHTML TEXT EXTRACTOR (Extensible Hypertext Markup Language File)
MHT TEXT EXTRACTOR (MIME Encapsulation of Aggregate HTML)
MHTML TEXT EXTRACTOR (MIME Encapsulation of Aggregate HTML)
XML TEXT EXTRACTOR (Extended Markup Language)
ONE TEXT EXTRACTOR (Microsoft OneNote File Format)
PDF TEXT EXTRACTOR (Portable Document)
PPT TEXT EXTRACTOR (PowerPoint Presentation)
PPTX TEXT EXTRACTOR (PowerPoint Open XML Presentation)
PPS TEXT EXTRACTOR (Microsoft PowerPoint Slide Show)
PPSX TEXT EXTRACTOR (PowerPoint Open XML Slide Show)
ODP TEXT EXTRACTOR (OpenDocument Presentation File Format)
POT TEXT EXTRACTOR (PowerPoint Template)
PPTM TEXT EXTRACTOR (Microsoft PowerPoint Presentation)
POTX TEXT EXTRACTOR (Microsoft PowerPoint Open XML Template)
POTM TEXT EXTRACTOR (Microsoft PowerPoint Template)
PPSM TEXT EXTRACTOR (Microsoft PowerPoint Slide Show)
OTP TEXT EXTRACTOR (Origin Graph Template)
XLS TEXT EXTRACTOR (Microsoft Excel Binary File Format)
XLT TEXT EXTRACTOR (Microsoft Excel Template)
XLTX TEXT EXTRACTOR (Microsoft Excel Open XML Template)
XLSX TEXT EXTRACTOR (Microsoft Excel Open XML Spreadsheet)
XLSM TEXT EXTRACTOR (Microsoft Excel Macro-Enabled Spreadsheet)
XLSB TEXT EXTRACTOR (Microsoft Excel Binary Spreadsheet File)
XLAM TEXT EXTRACTOR (Microsoft Excel Macro-Enabled Add-In)
XLTM TEXT EXTRACTOR (Microsoft Excel Macro-Enabled Template)
CSV TEXT EXTRACTOR (Comma Separated Values File)
TSV TEXT EXTRACTOR (Tab Separated Values File)
ODS TEXT EXTRACTOR (Open Document Spreadsheet)
OTS TEXT EXTRACTOR (OpenDocument Spreadsheet Template)
DOC TEXT EXTRACTOR (Microsoft Word Document)
DOCX TEXT EXTRACTOR (Microsoft Word Open XML Document)
DOCM TEXT EXTRACTOR (Microsoft Word Macro-Enabled Document)
DOT TEXT EXTRACTOR (Microsoft Word Document Template)
DOTX TEXT EXTRACTOR (Word Open XML Document Template)
DOTM TEXT EXTRACTOR (Microsoft Word Macro-Enabled Template)
RTF TEXT EXTRACTOR (Rich Text File Format)
TXT TEXT EXTRACTOR (Plain Text File Format)
ODT TEXT EXTRACTOR (Open Document Text)
OTT TEXT EXTRACTOR (Open Document Template)
Viewer Annotation Conversion Comparison Signature Assembly Metadata Search Parser Watermark Editor Merger Redaction Classification Splitter Translation Unlock