XtremePDFConverter VCL v3
Intelligently convert PDF to easily editable Word documents
Compatibility
Delphi 7/2005 RAD Studio 2007/2009/2010

How To Perform PDF To Word Conversions In Delphi

About our new PDF to Word RTF conversion tool for Delphi programmers.
By V. Subhash

On February 25, we released XtremePDFConverter Version 3.0. In this article, you will learn all about this new PDF conversion VCL component.

Why XtremePDFConverter?

A typical PDF page looks rich in formatted content. Internally, though, PDF does not support formatting features such as paragraphs, tables, or sections. A PDF page contains individual strands of text, shapes, images, and other page elements placed in a 2-D surface. A PDF document has no information about logical organization of content that is typically associated with documents created by word processing programs.

For example, a few closely-placed lines of text in a PDF page may look like a paragraph to a human but internally they are just a few lines of text with nothing connecting them. Empty or white space on the edges of a page may look like margins but they are just parts of the PDF page that have no visible elements.

PDF was built this way so that documents could retain absolute appearance fidelity in any resolution, device, or platform. PDF is also a final-form document format and is not meant for documents that need to be reused again.

However, real-world situations require that PDF documents be converted back to a more usable and editable format. That's where XtremePDFConverter comes in.

How XtremePDFConverter Works

XtremePDFConverter uses an artificial intelligence-based approach to convert PDF to free-flowing, easily editable Word RTF documents. If required, your code can intercept XtremePDFConverter's conversion events and interact with the conversion process.

XtremePDFConverter parses PDF page elements, tests them against certain criteria, decides whether those page elements could represent a paragraph or a table or a list or something else, and then applies appropriate conversion rules. The resultant document is much more usable and editable than the input PDF document.

Many products in the market simply convert each textout as an individual text box in the output document. That is easy. XtremePDFConverter uses special intelligence and can detect multiple textouts as a single paragraph or a table cell or a numbered list item. This is where XtremePDFConverter scores over competition.

XtremePDFConverter Features

XtremePDFConverter VCL can detect and convert the following PDF page elements:

XtremePDFConverter has methods and properties for:

XtremePDFConverter provides the following run-time events:

Sample Code

program Load_n_Convert;

{$APPTYPE CONSOLE}

uses
  SysUtils,
  gtPDFConverter;

var
   gtPDFConverter1: TgtPDFConverter;

begin
  gtPDFConverter1 := TgtPDFConverter.Create(nil);

  try
    begin
      // Specify input file
      gtPDFConverter1.InputFileName := 'input_doc.pdf';
      // Specify output file
      gtPDFConverter1.OutputFileName := 'output_doc.rtf';

      // Specify page range for conversion
      gtPDFConverter1.OutputSettings.PageRange := '3-4';

      Writeln('Beginning to convert... Wait.');
      // Convert the input file
      gtPDFConverter1.Convert();
      Writeln('Converted successfully.');
    end
  except on Err:Exception do
    begin
      Writeln('Sorry, an exception was raised. ' + Err.Classname + ':' 
              + Err.Message);
    end;
  end;

  FreeAndNil(gtPDFConverter1);
  Writeln('Press Enter to exit.');
  Readln;
end.

Performance Test

For this article, we used a PDF document that would adequately demonstrate all the features of XtremePDFConverter. First, we created a Word DOC file in Microsoft Word. Next, we printed it to PDF using Print2eDoc. This output PDF document was then used as input with an XtremePDFConverter demo application.

Here is a screenshot of page 1 of the input PDF document, as displayed in our Free PDF Reader desktop application.

This is page 2 of the PDF document.

Here is page 1 of the the output Word RTF document.

And, here is the second page of the output document.

Buy XtremePDFConverter VCL

You can buy XtremePDFConverter individually or get it as part of our all-Delphi-tools-included XtremeDevSystem Delphi Universal Subscription. If you buy the subscription, source code is included.

---o0O0o---

Our .NET Developer Tools
Gnostice Document Studio .NET

Multi-format document-processing component suite for .NET developers.

PDFOne .NET

A .NET PDF component suite to create, edit, view, print, reorganize, encrypt, annotate, and bookmark PDF documents in .NET applications.

Our Delphi/C++Builder developer tools
Gnostice Document Studio Delphi

Multi-format document-processing component suite for Delphi/C++Builder developers, covering both VCL and FireMonkey platforms.

eDocEngine VCL

A Delphi/C++Builder component suite for creating documents in over 20 formats and also export reports from popular Delphi reporting tools.

PDFtoolkit VCL

A Delphi/C++Builder component suite to edit, enhance, view, print, merge, split, encrypt, annotate, and bookmark PDF documents.

Our Java developer tools
Gnostice Document Studio Java

Multi-format document-processing component suite for Java developers.

PDFOne (for Java)

A Java PDF component suite to create, edit, view, print, reorganize, encrypt, annotate, bookmark PDF documents in Java applications.

Our Platform-Agnostic Cloud and On-Premises APIs
StarDocs

Cloud-hosted and On-Premises REST-based document-processing and document-viewing APIs

Privacy | Legal | Feedback | Newsletter | Blog | Resellers © 2002-2025 Gnostice Information Technologies Private Limited. All rights reserved.