« Changing workflow with IGP:FLIP | Main | eScape Online Tutorial »

Saturday, February 21, 2009

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a01053628a377970b0105372217ed970b

Listed below are links to weblogs that reference eScape - Open Office to ePub Convertor:

Comments

Feed You can follow this conversation by subscribing to the comment feed for this post.

Dmitri Minaev

I've been looking for an EPUB authoring software for some months. Thank you for eScape. Do you plan a release for Linux?

Richard Pipe

Dmitri.

Linux version coming up very shortly. It will be a Deb if that is OK. We just a gentle walloping at Mobileread for the same reason and we are a Linux development house. Go figure! Please give us your feedback, complaints it's a different system to most, and we need a bit of feedback.

CNH

Only one snag. Export as XHTM(.html) in Open Office. File exported. Open Escape. Get input file. Selected exported html file. Escape replies on the lines 'select xhtml generated by Open Office ...'

So what's happening?

Richard Pipe

Thanks for the feedback.

That's our ommission on instructions. Sorry about that. The system is hard coded to look for an *.xhtml extension at present. The reason for this is a bit of histroy on browser and parsing behaviour.

The correct filename extension for XHTML is *.html. There is no official *.xhtml mimetype. When files are sent from a server, the browser will interprete .html using the SGML parser instead of XML parsing unless the server passes the mimetype application/xhtml+xml.

Mozilla in their wisdom wanted the browser to understand which parser to use from the file name for local file opening so they made up the *.xhtml convention, which fortunately Safari and Opera have adopted. Of course IE doesn't know a thing about this xhtml stuff and just acts dumb. If you double click the file, you can see the raw exported XHTML in Firefox, Safari/Chrome or Opera.

In eScape this is not really an issue, just a habit we have here of always saving XHTML as *.xhtml and we haven't communicated this clearly anywhere in the documentation. So please rename your file to *.xhtml and all should be OK, and use the *.xhtml extension option next time when exporting.

I will upgrade the documentation accordingly, and also in the next version we will fix this up so either *.html or *.xhtml can be loaded, and we will do our Open Office check internally. For interest, the reason this check exists is to prevent arbitrary XHTML/HTML files being loaded that make the application behave erratically!

Anonymous Coward

My OpenOffice Writer 3.0 doesn't even have an option to export as XHTML - if I use Save As instead of Export, it has "HTML (OpenOffice.org Writer Document)" (which claims to be HTML 4.0 Transitional, but actually isn't quite), and it uses upper-case tag/attribute names so isn't parsable as (case-sensitive) XHTML. Easily fixed with a pass through HTML Tidy, of course, but... ???

Richard Pipe

I have added a page on the tutorial on how to save as XHTML. You can opent the tutorial document here http://www.publisherdams.com/reader/content/c-0002184/?a=lc . Go to the last page titled originally "Saving the File as XHTML". Hope that clears up the problem.

Anonymous Coward

'fraid not - that says it's under Export, but it isn't. My Export filetype selector has:

PDF - Portable Document Format (.pdf)
-------------------------------
BibTeX (.bib)
LaTeX 2e (.tex)
MediaWiki (.txt)

that is all (I didn't count the dashes in the second line). Maybe you have some addon installed?

[Ah...after a bit of poking around, I found the answer: it's in the xsltfilter sub-package, which I don't have installed!]

Richard Pipe

Thanks for that info. I will put a note in the tutorial. "If you can't see the XHTML option, you probably don't have the xsltfilter sub-package installed". I found the option bit in the OO help file so will include that.

I am one of those close the eyes and install everything types. I use OO on Ubuntu and Vista, and around the office we have every version from 2.2 to 3.0. (about 40 workstations and 20 laptops all running OO on most Windows, and Linux versions available). We thought we had it reasonably covered with various tests. Guess we are all "Install everything" artists.

Again. Thanks for the feedback.

Jordi Mustieles

I've been playing with eScape and it's really great, but there are a couple of things that bug me:

1) Localization. The terms "Title page" and "Content" in the TOC seem hardcoded. Is there any way to translate that?

2) Support for images inside the text. This could be a deal breaker. Are you planning to add this in a near future, even in a simple way?

Also, while, we are at it, in the tutorial (section "Text file Preparation", points 8 through 13) it says to merge paragraphs together. But I need paragraphs in my text. What happens if I omit this step?

Richard Pipe

Jordi, Thanks for the feedback.

Last point first. That set of instructions is conditional for Gutenberg text. It's definitely ambiguous so I have corrected that. This little Regular Expression trick joins adjacent lines which are broken as individual paragraphs, and creates the paragraph break where there is an empty line. That is the only time it should be used. If your paragraphs are fine - jump these steps.

Localization - hard coded Title Page/Contents. We have had this issue pointed out by a kindly German user as well. The sin of language myopia is ours. We will get this fixed very shortly. The terms will be taken from the book content instead of an internal structure. You will be able to create your full TOC in any language.

Images. Various versions of the Open Office XSL convertors treat images differently (3.0 turns them into Base64) and we were looking for a "middle path". Our idea is to forget trying to get images into the ODT and just create references. I am creating a new blog post on the proposed approach and would like yours and anyone elses feedback. It makes the tool less elegant, but when using third party elements we have to work with them. The fact that they change frequently doesn't make it easier. We will also be introducing font embedding into the same version. This is a little easier.

Owl City

thanks for this article...
I really need something like this.

Bryan

Does not work in OSX 10.6.2

say's that it's not compatable with this computer when I try to install it with the manager.

XML Training

Nice post. Lot of inormation is there.Thanks for your innovative idea.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Blog powered by TypePad