Intro to eBooks for Journalists and News Publishers

“ebook” is shorthand for at least 3 different file formats:

  • PDF (you’re likely familiar with this one), it’s been around for 10+ years and almost all devices and browsers can render a PDF. Publishers have a great deal of presentation control in a PDF but PDF renderers on mobile devices aren’t very sophisticated – making the readability questionable.
  • ePub – that’s essentially a compressed folder of HTML and CSS files. It’s preferred by Apple, Barnes & Noble, and most everyone else except Amazon.
  • .mobi – Amazon’s file format that previously was an non-human-readable binary file – but in the latest version ‘Kindle Format 8‘ is a very comparable to ePub 3.

In may ways you can think of ePub and .mobi files as an offline archive of a webpage. Like a webpage, ebooks can support video, complex styling, links, scripting for complex interactions. Everything you would expect of a modern web experience – but all without a persistent internet connection.

You can think of PDF as, um, well, a frozen Word doc.

Technical publishers like The Pragmatic Programmers and O’Reilly Media (and essentially any publisher that doesn’t have a line of ebook readers) make their publications available in all 3 file formats as a way to serve all their customers.

The annoying thing is each ebook reader (whether a device or a software application) has it’s own presentation and functionality constraints. Some support color – others don’t. Some support tables of data or code samples or embedded fonts well – others completely not at all. In many ways – this is very analogous to publishing a website where, despite the publisher’s intentions and technical potential – presentation & experience is still completely up to the reader’s choice of vendor.

In many ways, the ebook retail space feels identical to the mobile application space. Each ebook retailer takes substantial cut of the purchase price and may or may not have a completely opaque approval process that you may or may not be able to coordinate a market launch against. Thankfully, generating ebooks is very inexpensive compared to app development. There are number of tools that can generate ebooks from pre-existing content – InDesign, Pages, as well as many open-source toolchains like Adobe InDesign, Apple’s Pages, as well as many open-source toolchains like eBook Export for WordPress, Booktype, easybook, bookshop, Bookie, and likely more.

Content that’s primarily text will render fairly well across all ebook readers with these converter tools – some more manual/detailed tweaking may be required to really polish it. Again, similar to web development in this regard.

Unlike the web space, people are accustomed to paying some, how ever paltry, amount for ebooks (and mobile apps).

I see two opportunities for news publishers relative to ebooks:

The first is repackaging existing content into focused, collections on a topic that serve a niche audience in a fuller, more comprehensive manner. A couple examples of this are Neiman Lab’s “The Future of News As We Know It” series of epubs and locally StarTribunes The Cookie Book: 10 Years of Winning Recipes from our Holiday Baking Contest.

The second is longer form work that may not fit in a larger, more general audience print publication. These are articles that really go in-depth and highlight journalistic expertise. Something so good that I’ll want to re-read it again and again. The definitive telling of an issue – that will likely take multiple sittings to finish. Recent examples include the Star Tribune’s In the Footsteps of Little Crow and the New York Times’ Snow Fall: The Avalanche at Tunnel Creek.

The thing is, web browsers are now technically sophisticated enough that they elegantly support offline access. In fact, in 2012 O’Reilly acquired the browser-based epub reader ibisreader.com and the company behind it – merging them into Safari Books Online, their on-demand content service.

This just leaves the bigger challenge of getting fans and customers comfortable with paying a meaningful amount for content.