HexaPDF 0.9.0 - Document Layout Published on

With the ground work for document layout management done in the 0.8.0 release the focus was shifted to the refinement of these features and to the actual document layout functionality.

The major changes for this release are the document composition functionality, incremental PDF writing, a CLI command for splitting PDF files and compatibility with Ruby 2.6.

Document Composition

One of the goals of HexaPDF is to provide high-level functionality for document composition. With this release the initial working version of this goal has been achieved with the new HexaPDF::Composer class.

This class uses the functionality introduced in the last release (e.g. frames, text boxes, …) to make creating a PDF document as easy as it is with other libraries, for example Prawn. Here is a simple example:

require 'hexapdf'

HexaPDF::Composer.create('hello-world.pdf') do |pdf|
  pdf.text("Hello World!", font_size: 50, align: :center, valign: :center)

Text (or more generally every box) is layed out from top to bottom and can be flown around objects that have been placed on the page before. This makes it easy, for example, to flow text around images (see the new composer example). Also, if some box doesn’t fit on a page or can’t be split, a new page is automatically created.

Arbitrary drawing operations can still be performed by using the HexaPDF::Content::Canvas object that is provided by the composer.

Incremental Writing

Starting with this version HexaPDF supports incremental writing. Writing a PDF document in incremental mode means that the new or changed content is just appended to the original PDF. This is used, for example, if the original document was cryptographically signed so as to not invalidate the signature.

Incremental writing in HexaPDF is not perfect in the sense that it doesn’t completely minimize the amount of data that gets written. The reason for this is HexaPDF’s automatic conversion of hash values. For example, PDF dates (which are stored as strings) are automatically converted to Ruby date objects on access, making the comparison fail even though there are no differences when serializing.

Splitting PDF Files

The hexapdf command line application already has a command for merging files but the reverse was missing. So this version brings the split command that can do exactly that.

As an example, consider the following: hexapdf split input.pdf out_%02d.pdf. This would split the input.pdf file page by page and generate files of the form out_01.pdf, out_02.pdf and so on.

Other Changes

There were some other changes and bug fixes, the most noteworthy are:

  • Usage of some non-described stdlib behaviour was fixed to make HexaPDF compatible with Ruby 2.6.

  • Text boxes now respect width/height/padding/border when fitting.

  • Variable width line wrapping now correctly considers line spacing when determining line width.

As always, have a look at the changelog for an overview of all changes.

And Happy New Year!