“Tiny RTF Viewer” 2.5 using ScroogeXHTML RTF to HTML5 converter

Habarisoft released Tiny RTF Viewer 2.5 for Android. This small viewer app converts RTF documents (which can be stored locally or accessed by choosing a hyperlink in a web browser) to HTML5, and displays them in the internal web browser.
Google play

For the internal conversion from Rich Text Format (RTF) to HTML5, the app uses the ScroogeXHTML library from Habarisoft. More information and an on-line demo of the converter library are available at https://www.scroogexhtml.com/


Android is a trademark of Google Inc. ♦ Google Play is a trademark of Google Inc.


ScroogeXHTML for the Java™ platform 6.0: new features

The upcoming 6.0 release of ScroogeXHTML for the Java™ platform introduces useful new features. Here is a short overview:

Embedding images with HTML Data URI scheme

The traditional MemoryPictureAdapter class in ScroogeXHTML for the Java platform generates image link elements which point to a resource location <img src=”…”>. This keeps the document small, but requires making the image resources accessible for the web browser at the given location.

In some cases however, it is useful to embed the whole image in-line in the web page as if they were external resources.

The new MemoryPictureAdapterBase64 class returns Data URIs for small JPEG and PNG images. By default, the size threshold is set to 32 kB.

Usage example:

scrooge = new ScroogeXHTML();
PictureAdapter adapter = new MemoryPictureAdapterBase64();

The new class inherits from the old MemoryPictureAdapter, and will return the inherited result for images which exceed the size limit.

Data URIs are fully supported by most major browsers, and partially supported in Internet Explorer and Microsoft Edge.

Event listeners for DOM post processing

The converter internally uses a XML DOM tree to create the HTML document structure. Before converting the DOM to the result HTML5 String, the converter calls a sequence of post processing handlers, which apply optimizations and custom modifications on the DOM tree. Post processing handlers must implement the PostProcessListener interface.

The converter stores the event handlers in its PostProcessListeners property which is a list of PostProcessListener implementations. By default, the converter library creates and assigns post process handlers to perform these tasks

  • strip empty (whitespace-only) text nodes
  • strip empty span nodes
  • strip attribute-less span nodes
  • replace empty paragraph (<p>) nodes with <br> nodes

These default PostProcessListener implementations are located in the com.habarisoft.scroogexhtml.tidy package and use XPath to perform the DOM modification (see Stack Overflow example code).

Application code may create and add more post process listeners as needed.

On-line demo

The new demo page https://www.scroogexhtml.com/sxd6snap/ allows to upload and convert RTF files.


ScroogeXHTML for the Java™ platform 6.0 – RTF to HTML5 converter preview

Habarisoft released the first public preview of version 6.0 of its RTF to HTML5 converter library, ScroogeXHTML for the Java™ platform.

Major changes

  • only HTML5 and XHTML will be supported  – for other markup language versions, ScroogeXHTML 5.X is still available
  • new DOM based post processing event listeners
  • new support for embedded JPEG and PNG images with HTML Data URI scheme
  • improved support for RTF tables
  • improved support for monospace fonts
  • improved conversion of footnote
  • improved support for listtable (paragraph numbering)

Continue reading

ScroogeXHTML for the Java™ platform 5.5 – RTF to HTML5 converter

Habarisoft released version 5.5 of its RTF to HTML5 converter library, ScroogeXHTML for the Java™ platform. This is mainly a maintenance release.

Important changes

  • The converter does no longer generate span tags without attributes <span>text<span>
  • Fixed incorrect values of the ISO 8601 time stamp in the meta date header
  • Improved handling of empty font names, the converter now uses the specified default font instead of the ‘first’ font in the font replace list

Continue reading