Extract Images from Documents using Java

Today, we will learn to programmatically extract images from PDF, Excel, PowerPoint, and Word documents using Java. For the extraction of images, we will use GroupDocs.Parser for Java. This Java API supports the parsing of documents and extraction of images, text, and metadata from word-processing documents, spreadsheets, presentations, archives, and email documents. Extracted images can be saved in BMP, GIF, JPEG, PNG, and WebP formats.

Following topics will be covered in this article:
  • Image Extraction Java API
  • Image Extraction from PDF documents in Java
  • Extract Images from Word, Excel, PowerPoint documents in Java
  • Extract Image from Specific Page in Java
Posted in GroupDocs.Parser Product Family | Tagged , , ,

Insert OLE Objects in Word, Excel, PowerPoint using Java

Today, we will be learning to embed PDF and other different documents as OLE objects in Word, Excel, PowerPoint files using Java. For embedding the documents via Object Linking and Embedding, we will be using the GroupDocs.Merger for Java API that also allows us to efficiently combine/merge and split multiple documents with minimum lines of Java code.

Steps shown here will guide you to:
  • Insert PDF as OLE object into MS Word document.
  • Embed Word document as OLE object into Excel spreadsheet.
  • Insert Excel sheets as OLE object into presentations.
Posted in GroupDocs.Merger Product Family | Tagged , , , ,

Verify Digital Signature in Documents using Java

In this article, we will be learning to programmatically verify the digitally signed documents using Java. The example uses PDF document for the verification, however, you may also perform verification of digitally signed word processing documents like MS Word DOC/DOCX, Excel spreadsheets XLS/XLSX, and presentations PPT/PPTX.

Posted in GroupDocs.Signature Product Family | Tagged ,

Add Watermark to Images in Java

In this article, we will learn to add text and image watermarks to images using Java. There can be two ways to add watermark to images. Either you want to add the watermark with the personalized text or add an image watermark over the source image. We will see both scenarios. Currently, in addition to the JPG and PNG, this Java API supports BMP, GIF, JP2, TIFF & WebP image formats for adding watermarks on it. We can also change the style, orientation, and appearance of the watermark text.

Posted in GroupDocs.Watermark Product Family | Tagged , , ,

Convert MSG and EML files to PDF in Java

Convert Emails to PDF in Java

Emails to PDF conversions are often required for referencing and requirements like sharing the email content. In this article, we will learn about how to convert email message files like MSG and EML into PDF using Java. Previously, in one of the earlier blog posts, we have already learned to convert MSG & EML files using C# in .NET application. This will help to automate the Email conversion within the desktop or web applications.

The following are the … Continue Reading

Posted in GroupDocs.Conversion Product Family | Tagged , , ,

Merge PDF, Word Documents, Spreadsheets, Presentation Files in C#

Today, we will learn to programmatically merge PDF, Word documents, spreadsheets, presentations using C#. In an earlier post, we have seen the merging and splitting of documents using Java.

merge multiples pdf, word, excel, ppt files using csharp dotnet

This article will also show you the code examples regarding:

I will be using the GroupDocs.Merger for .NET in all the examples below. Before you proceed, you may get the API from either of the following options:… Continue Reading

Posted in GroupDocs.Merger Product Family | Tagged , , , ,

Compare Text, Word, and PDF Files with Java Difference Library

After going through this article, we will be able to compare text files, Word files, PDF files, and other documents in Java-based applications. By using this feature, we can compare invoices, contracts, presentations, AutoCAD designs, price lists, or programming files. We will also the privilege to highlight the identified changes and have the option to either accept or reject any change. We can even build our own document comparison tool similar to the one launched by GroupDocs, using the document … Continue Reading

Posted in GroupDocs.Comparison Product Family | Tagged , ,

Convert WebP to JPG, PNG, TIFF, and PDF in C#

In our previous post, we discussed WebP images and learned to convert WebP Images in Java. Today, in this article, we will learn to programmatically convert the WebP images into JPG, PNG, TIFF, and other formats using C#.

Convert WebP image to JPG, PNG or PDF formats in CSharp

First, we will have a look to convert the WebP images in the simplest way. Later we will convert with some customized options like tilt, flip, grayscale, resize, change gamma, contrast, and brightness, and add watermark to converted JPG images. Following … Continue Reading

Posted in GroupDocs.Conversion Product Family | Tagged , , ,

Classify your Customer Feedback using Sentiment Analysis in C#

Suppose that you have the opportunity to receive comments or reviews from your customers or some other source and you want to evaluate how positive they are. There is a way to analyze such comments called sentiment analysis. This post focuses on the sentiment analysis tool based on a deep neural network model using C#. This model is suitable for a wide range of tasks.

Sentiment Analysis API for .NET

If you want to do sentiment analysis programmatically, GroupDocs.ClassificationContinue Reading

Posted in GroupDocs.Classification Product Family | Tagged , , , ,

Search Text in Word, Excel, PDF, ZIP and other Document Formats using C# .NET

Full text search of documents

We often need a full-text search API that enables our applications to search through documents for particular information specified as a textual search query. The documents can be of any format such as Word (Doc, Docx), PDF, HTML, EPUB, Spreadsheet (XLS, XLSX), Presentation (PPT, PPTX), images, and videos.

GroupDocs.Search is a powerful full-text search API that allows you to search through over 70 document formats in your applications. To make it possible to search instantly across thousands of documents, they … Continue Reading

Posted in GroupDocs.Search Product Family | Tagged , , ,