05 June 2011

Cropping complete

As reported in March 2010, the microfiche images omit the cover pages for the January 1968 and January 1969 issues. I attended the Socialist Party of Great Britain's head office yesterday and picked up a spare copy of the January 1969 issue. There were no extra January 1968 issues remaining, but the issue did appear in the bound volumes in the archive, so I took a photocopy. The binding obscures about a centimetre of the left edge of the page, but my copy is better than nothing, I guess. Now my only problem is getting these two pages digitized, as I no longer have access to a scanner. I'll either have to find someone with a scanner, or see if I can photograph the covers myself.

With most of the pages now in place, it's time to start thinking again about how to "bind" them into PDF or DjVu documents. Since it's been a year since I last experimented with this, I downloaded the latest version of jbig2enc and its dependency, Leptonica. I discovered that jbig2enc doesn't compile with Leptonica 1.68, but only because the parameters to the findFileFormat() function have changed. This function is referenced once, in jbig2.cc, where it's used to check something involving multi-page TIFFs. I don't use jbig2enc to process TIFFs so I just commented out these lines, and then jbig2enc compiled fine.

My computer is now whirring away, generating three PDFs for each of the issues that I have up to the end of 1969: one with no upsampling, one with 2× upsampling, and one with 4× upsampling. It will probably be busy doing this all night. Once it's done, I'll examine the results to see what looks the best and what the file sizes are like. Watch this space for further analysis of the results…

3 comments:

  1. Dear Tristan Miller

    I'm happy to read again about your digitization project

    I also had problems in compiling jbig2enc with leptonica 1.68

    the latest commit to jbig2enc sources, seems not solve the problem, but only allow compilation without tiff support

    do you think it is possible restore tiff support in jbig2enc?

    ReplyDelete
  2. Sorry for the late reply. If you follow the link to the bug report in the post, you'll see that on 6 July mistydemeo claimed to have located the source of the problem and fixed it. I haven't tested the fix myself as I've got no need to upgrade from my own patched version.

    ReplyDelete
  3. thanks for reply

    I'm interested to read news about this digitization project

    I downloaded latest jbig2enc source code from git and leptonica libs 1.68 and I have successfully compiled jbig2enc binary with tiff support

    I have also seen that *rubypdf* posted a patch for *pdf.py* able to make pdf.py, able to generate the proper pdf page size

    - https://github.com/agl/jbig2enc/issues/15

    ReplyDelete