PDF Generating Tool Support Forum

HOME   Login   Register    Search




  Subject: Can pd4ml follow HTML links and merge it to one pdf?
   PostPosted: 12 May 2010, 13:30 
I have an index.html (as file) which has links to other html pages in a folder. I want to generate a PDF and it should contain all linked html pages.

Just like when using Acrobat > Create PDF > From Web page > Get entire site.

I don't want to generate all html pages separately and then merge them with itext.

Thanks in advance!


  Subject: Re: Can pd4ml follow HTML links and merge it to one pdf?
   PostPosted: 18 Jun 2010, 21:56 
For the time being the feature is not supported.

However PD4ML implements a proprietary <pd4ml:include> tag, which could help to achieve the needed result.

Currently the tag is undocumented and we use it only in DXL->PDF conversion scenarios. I am afraid the tag has some limitations - I need to check.

But the idea is as follows: you may add a sequence of tags at the end of the main document:
<pd4ml:include src="suppot.htm">
<pd4ml:include src="about.htm">
</body>
</html>

The tags will be ignored by the browsers, but PD4ML will include the referenced documents. Of course, in forthcomming versions, we could automate the process and to implicitly build the list from <a href=> tags.


Last bumped by Anonymous on 18 Jun 2010, 21:56.



[Reply]     [ 2 posts ] 

cron
Copyright ©2004-10 zefer|org. All rights reserved. Bookmark and Share