Seybold SF 2002 Mark Stephens
(Managing Director)
www.idrsolutions.com
Who are IDRSolutions?
Established 1999Based in United Kingdom, resellers in Australia and USA.Customers range from large multi-nationals to individuals.Focus – Systems integration & extracting content from pdf.
www.idrsolutions.com
Why extract data from pdf files?
Retrieve content from pdf files.Extract data from legacy systems using printed output which can be easily converted into pdf.
xml
www.idrsolutions.com
Extraction from pdf
Pdf files lack structure so the items on the page are not connected.
We develop algorithms to group the content from different types of page layout to meet
customers’ requirements.
www.idrsolutions.com
What do we offer?
StorypadEnterprise – a high end extraction and repurposing tool.Personal – a low-end extraction tool for pdf.Customized – versions to suit specific requirements.A new LGPL library for pdf.
Cross-platform tools written in JavaNative windows exe
(dll ??)
www.idrsolutions.com
Java Pdf Extraction Decoder Access Library
Routines to read and parse pdf files Extraction of raw and scaled/clipped images Extraction of text fragments as XMLFont information converted to XML metadataLocation on page of objectsPage RasterizerExamples includedActive developmentFree of all dependencies – ie Acrobat SDKLGPL license –no license fee, full source code
www.idrsolutions.com
LGPL and Open Source
Open Source offers ONE way to keep costs down, improve flexibility and match user requirements.Examples – itext, Zope, JBoss, MySQL, Linux, Xpdf, Ghostscript, Apache, Samba, GIMP….
www.idrsolutions.com
Free as in air, not beer
Access to the source code.Right to modify the code.No license fees required.No limitations on usage.Limited lock-inCommercial support/development available.No support.Cannot be passed off as your own work.Acquisition cost – time to understand the software, modify it to meet requirements, test, support.
www.idrsolutions.com
More details
Visit our websites at
www.idrsolutions.comwww.jpedal.org
Or come and see us…