Category Archive: GroupDocs.Parser Product Family

Official blog with announcements of latest supported features, hot fixes, technical articles, tips and videos of GroupDocs.Text – A text extraction API for .NET.

Extract Text from Databases using GroupDocs.Parser for .NET 18.9

Posted on September 13, 2018 by Usman Aziz

GroupDocs.Parser for .NET 18.9 has been released! The latest version allows you to extract text from the databases. You can also extract data from the form fields in a PDF document. Please continue to read for more details on the features introduced in v18.9.

Features Introduced

Extracting Text from Databases

You can now extract text from the databases. To extract text from databases DbContainer class is used that implements IContainer interface. Each data table is represented by the entity. The… Continue Reading

Text Analysis API for Spreadsheets, Presentations and Text Documents – GroupDocs.Parser for .NET 18.8

Posted on August 20, 2018 by Usman Aziz

We are pleased to announce the release of version 18.8 of GroupDocs.Parser for .NET. In this version, we have extended the support of text analysis API for spreadsheets, presentations and text documents. Furthermore, the latest version allows providing a password for protected documents on-demand. We’d recommend you to use the latest version of the API and share your feedback.

Features Introduced

Text Analysis API

GroupDocs.Parser allows extracting text areas from the pages of a document. This feature may help… Continue Reading

Posted in GroupDocs.Parser Product Family | Tagged .NET Text Extractor, Dcoument Parsing API, Document Parser, GroupDocs.Parser for .NET Releases, Text Extraction API for .NET

Releasing GroupDocs.Parser for Java – A Convenient Document Parser API

Posted on July 10, 2018 by Usman Aziz

We are pleased to announce that the first version of GroupDocs.Parser for Java has been released. GroupDocs.Parser for Java allows the Java developers to extract raw and formatted text from the popular document formats. The API also supports working with containers such as ZIP and email containers. You can also access the metadata attached to the documents using a few lines of code. Please continue to read more about the features and the file formats supported by the API.

Supported

Posted in GroupDocs.Parser Product Family | Tagged Document Parser, GroupDocs.Parser for Java, GroupDocs.Parser for Java Releases, Java Text Extractor, Java text extractor API, text extractor API for Java, text parser

Extract Text Areas from Document Pages using GroupDocs.Parser for .NET 18.7

Posted on July 10, 2018 by Usman Aziz

Today, we are excited to announce the release of version 18.7 of GroupDocs.Parser for .NET. The latest release supports extracting text areas from document pages. This feature may help you getting data for text analysis. We recommend you to upgrade the API to the latest version and share your valuable feedback.

Extracting Text Areas

Extracting text areas is useful when you need to get the data for text analysis. To extract text areas, text extractors implement their own internal… Continue Reading

Posted in GroupDocs.Parser Product Family | Tagged .NET Text Extractor, Document Parser, Extract Text, GroupDocs.Parser for .NET, GroupDocs.Parser for .NET Releases, Text Extraction API for .NET, Text extractor API, text parser

Upcoming Release of GroupDocs.Parser for Java

Posted on June 20, 2018 by Usman Aziz

We are excited to announce that GroupDocs.Parser is coming soon to Java platform as GroupDocs.Parser for Java. It will be an easy to use back-end API that will permit the users to extract raw and formatted text from the supported document formats. Besides, it will also allow the users to extract the metadata from the popular document formats. GroupDocs.Parser for Java will soon be available for download.

Salient Features of GroupDocs.Parser for Java

GroupDocs.Parser for Java will come with all… Continue Reading

Posted in GroupDocs.Parser Product Family | Tagged Document Parser, GroupDocs.Parser for Java, GroupDocs.Parser for Java Releases, java, Java API, Java Text Extractor, Text Extractor, Text extractor API, text parser

GroupDocs.Text for .NET has been Renamed to GroupDocs.Parser for .NET

Posted on May 30, 2018 by Usman Aziz

We are pleased to announce that GroupDocs.Text for .NET has been renamed to GroupDocs.Parser for .NET. We have published the first monthly release as GroupDocs.Parser for .NET 18.5 after renaming of the API. The latest release has come with a few changes and a couple of enhancements. Please continue to read about the changes and the enhancements that we have made in version 18.5.

Important to Know

It is important to inform you that the renaming of the API… Continue Reading

Posted in GroupDocs.Parser Product Family | Tagged .NET Text Extractor, Dcoument Parsing API, Document Parser, GroupDocs.Parser for .NET, GroupDocs.Parser for .NET Releases, Text Extraction API for .NET, Text Extractor

Extract TOC from EPUB Documents using GroupDocs.Text for .NET 18.4

Posted on April 18, 2018 by Usman Aziz

It gives us immense pleasure to announce the release of version 18.4 of GroupDocs.Text for .NET. The latest version allows extracting the table of contents from the EPUB documents. Furthermore, we have added the feature of detecting media type of .one file. Following sections provide details about the newly added features.

Extracting TOC from EPUB Documents

Using version 18.4, you can now extract TOC from the EPUB documents. To access the TOC, TableOfContents property of EpubPackage class is used. Once… Continue Reading

Posted in GroupDocs.Parser Product Family | Tagged .NET Text Extractor, Extract Text, GroupDocs.Text for .NET Releases, Text Extraction API for .NET, Text Extractor

Extract Formatted Text from CHM Documents using GroupDocs.Text for .NET 18.3

Posted on March 29, 2018 by Usman Aziz

We keep looking forward to bringing you more features and therefore, we have released version 18.3 of GroupDocs.Text for .NET providing the support of extracting formatted text from CHM documents. The latest version also allows you to extract text by pages and extract table of content from CHM documents. The following sections will provide you the details about the new features of the API.

Extracting Formatted Text from CHM Documents

GroupDocs.Text provides a couple of ways to extract formatted text… Continue Reading

Posted in GroupDocs.Parser Product Family | Tagged .NET Text Extractor, Extract Text, GroupDocs.Text for .NET Releases, Text Extraction API for .NET, Text Extractor

Extract Text from Markdown Documents using GroupDocs.Text for .NET 18.2

Posted on February 12, 2018 by Usman Aziz

We are pleased to announce that we have released another version of GroupDocs.Text for .NET API providing the support of text extraction from Markdown documents. Using GroupDocs.Text for .NET 18.2, you can extract raw text, formatted text and the structured text from a Markdown document. Following sections will provide you the details about the new features of the API.

Extracting Raw Text from Markdown Documents

Since version 18.2, GroupDocs.Text allows you to extract a line of characters as well as… Continue Reading

Posted in GroupDocs.Parser Product Family | Tagged .NET Text Extractor, GroupDocs.Text for .NET Releases, Text Extraction API for .NET

Retrieving ZIP Container’s Entity by Full Name using GroupDocs.Text for .NET 17.12

Posted on December 27, 2017 by Usman Aziz

We are excited to announce that we have released version 17.12 of GroupDocs.Text for .NET API. In this version, we have introduced a simplified way of extracting text as well as formatted text using a simple interface. We have also extended the support of IPageTextExtractor interface for OneNote documents. Furthermore, the latest version also allows you to retrieve an entity by its name from ZIP container. Please continue to read for more details.

GroupDocs.Text for .NET API – New Features

Posted in GroupDocs.Parser Product Family | Tagged .NET Text Extractor, GroupDocs.Text for .NET Releases, Text Extraction API for .NET

Category Archive: GroupDocs.Parser Product Family

Extract Text from Databases using GroupDocs.Parser for .NET 18.9

Features Introduced

Extracting Text from Databases

Text Analysis API for Spreadsheets, Presentations and Text Documents – GroupDocs.Parser for .NET 18.8

Features Introduced

Text Analysis API

Releasing GroupDocs.Parser for Java – A Convenient Document Parser API

Supported

Extract Text Areas from Document Pages using GroupDocs.Parser for .NET 18.7

Extracting Text Areas

Upcoming Release of GroupDocs.Parser for Java

Salient Features of GroupDocs.Parser for Java

GroupDocs.Text for .NET has been Renamed to GroupDocs.Parser for .NET

Important to Know

Extract TOC from EPUB Documents using GroupDocs.Text for .NET 18.4

Extracting TOC from EPUB Documents

Extract Formatted Text from CHM Documents using GroupDocs.Text for .NET 18.3

Extracting Formatted Text from CHM Documents

Extract Text from Markdown Documents using GroupDocs.Text for .NET 18.2

Extracting Raw Text from Markdown Documents

Retrieving ZIP Container’s Entity by Full Name using GroupDocs.Text for .NET 17.12

GroupDocs.Text for .NET API – New Features

Search

Follow Us

Categories