AVForums

Our philosophy in our forums, reviews, podcasts and feature videos is to promote audio and visual excellence by gathering and sharing the best information and resources available.

Help

To begin please visit our help section »

Not a Member Yet?

It only takes a minute to start enjoying the benefits of AVForums membership, and it's free!

Member Log in

Scanned Doc. to CD/DVD?

Post Reply
Old 20-10-2005, 10:49 AM   #1
GJC GJC is offline
Senior Member
 
GJC's Avatar
Join Date: Mar 2002
Location: London
Experience Points:
7,417, Level: 20
Points: 7,417, Level: 20 Points: 7,417, Level: 20 Points: 7,417, Level: 20
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
Thanks: Gave 104, Got 43
Posts: 1,550
Scanned Doc. to CD/DVD?

I'm researching various ways of transfering documents to CD/DVD. I need software and or hardware that will do this and allow me to search the contents of the scanned Document (preferably pdf) within the DVD.

E.G. The DVD will autorun, Page 1 will refer to page one on DVD and if possible the contents table of the document will be linked in the DVD so that the user can click the chapter and be taken to it.

Any ideas?

---

Thanks!
  Quote
Old 20-10-2005, 11:55 AM   #2
Prominent Member
 
Mac Man's Avatar
Join Date: Jan 2001
Experience Points:
9,703, Level: 23
Points: 9,703, Level: 23 Points: 9,703, Level: 23 Points: 9,703, Level: 23
Activity: 2.7%
Activity: 2.7% Activity: 2.7% Activity: 2.7%
Thanks: Gave 220, Got 240
Posts: 3,103
You mentioned scanned documents and PDFs

Are you talking about scanned images within a PDF document, or converting, say, a Word file to PDF. There is nothing in a bitmapped image to search on, apart from any meta data you embed within the file somehow.

I believe the files would have to contain actual text in order for them to be searchable/indexable.


Chris
  Quote
Old 20-10-2005, 12:10 PM   #3
GJC GJC is offline
Senior Member
 
GJC's Avatar
Join Date: Mar 2002
Location: London
Experience Points:
7,417, Level: 20
Points: 7,417, Level: 20 Points: 7,417, Level: 20 Points: 7,417, Level: 20
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
Thanks: Gave 104, Got 43
Posts: 1,550
Thanks for the reply Chris.

Basically I want to scan 'a book' containing upto 1000 pages or for example then transfer the scanned images in order to create a published CD or DVD, it would effectively become an e-book/digital bible.

Cheers!
  Quote
Old 20-10-2005, 12:48 PM   #4
Senior Member
 
tomson's Avatar
Join Date: Jul 2000
Location: Berk'amsted
Experience Points:
7,527, Level: 20
Points: 7,527, Level: 20 Points: 7,527, Level: 20 Points: 7,527, Level: 20
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
Thanks: Gave 88, Got 189
Posts: 1,900
have worked on branding documents where we've designed a printed version then transfered everything into an html version. All the content is displayed within a browser and has a full index and search facility. This can be put online or onto CD. Have also done similar using PDFs for distribution on CD - save out each page as a PDF then combine it all into a single document that's viewable in Acrobat. You can create thumbnail and text indexes and retain full search functionality.
  Quote
Old 20-10-2005, 12:48 PM   #5
Conspicuous Member
 
shahedz's Avatar
Join Date: Apr 2005
Experience Points:
13,120, Level: 27
Points: 13,120, Level: 27 Points: 13,120, Level: 27 Points: 13,120, Level: 27
Activity: 1.1%
Activity: 1.1% Activity: 1.1% Activity: 1.1%
Thanks: Gave 1,382, Got 722
Posts: 8,464
HI check these guys out www.pinpointdigital.co.uk they specialise in scannign and archiving and i think they can do what you need,
  Quote
Old 20-10-2005, 12:56 PM   #6
GJC GJC is offline
Senior Member
 
GJC's Avatar
Join Date: Mar 2002
Location: London
Experience Points:
7,417, Level: 20
Points: 7,417, Level: 20 Points: 7,417, Level: 20 Points: 7,417, Level: 20
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
Thanks: Gave 104, Got 43
Posts: 1,550
Thanks for the info guys I'll read up on the info you've provided.
  Quote
Old 20-10-2005, 1:08 PM   #7
Prominent Member
 
Mac Man's Avatar
Join Date: Jan 2001
Experience Points:
9,703, Level: 23
Points: 9,703, Level: 23 Points: 9,703, Level: 23 Points: 9,703, Level: 23
Activity: 2.7%
Activity: 2.7% Activity: 2.7% Activity: 2.7%
Thanks: Gave 220, Got 240
Posts: 3,103
Quote:
Originally Posted by tomson
have worked on branding documents where we've designed a printed version then transfered everything into an html version. All the content is displayed within a browser and has a full index and search facility. This can be put online or onto CD. Have also done similar using PDFs for distribution on CD - save out each page as a PDF then combine it all into a single document that's viewable in Acrobat. You can create thumbnail and text indexes and retain full search functionality.
Tomson, that'll work for text based documents, but not PDFs that contain scanned images - you can't search through images as you would a Word or text based PDF - (assuming that's what's needed) - in order to find a word or phrase within a page.

Also PDF files, unlike .exe files will not (easily) be made to autostart from a CD. I did find this though http://www.cdmenupro.com/pdf_starter.htm that might do the job.

A lot of document management systems for scanned images rely on using a database application that assigns searchable keywords to the images. You could try doing a Google on document management solutions (or similar)

Chris
  Quote
Old 20-10-2005, 1:39 PM   #8
Senior Member
 
tomson's Avatar
Join Date: Jul 2000
Location: Berk'amsted
Experience Points:
7,527, Level: 20
Points: 7,527, Level: 20 Points: 7,527, Level: 20 Points: 7,527, Level: 20
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
Thanks: Gave 88, Got 189
Posts: 1,900
Quote:
Originally Posted by Chris Lamle
Tomson, that'll work for text based documents, but not PDFs that contain scanned images - you can't search through images as you would a Word or text based PDF - (assuming that's what's needed) - in order to find a word or phrase within a page.
Indeed. From GJCs 'book' description I presumed the content would would be predominantly text based anyway.

But if you ensure every image has a description (much like meta tags , but visible) you can build up a fairly intuitive search facility that pulls up image descriptions in the results . Obviously it requires a fair bit of forward planning.
  Quote
Old 20-10-2005, 3:09 PM   #9
GJC GJC is offline
Senior Member
 
GJC's Avatar
Join Date: Mar 2002
Location: London
Experience Points:
7,417, Level: 20
Points: 7,417, Level: 20 Points: 7,417, Level: 20 Points: 7,417, Level: 20
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
Thanks: Gave 104, Got 43
Posts: 1,550
Quote:
Originally Posted by tomson
Indeed. From GJCs 'book' description I presumed the content would would be predominantly text based anyway.

But if you ensure every image has a description (much like meta tags , but visible) you can build up a fairly intuitive search facility that pulls up image descriptions in the results . Obviously it requires a fair bit of forward planning.
The documents will be 95% text based, the images dont require data for search purposes.
  Quote
Old 21-10-2005, 11:08 AM   #10
Prominent Member
 
Mac Man's Avatar
Join Date: Jan 2001
Experience Points:
9,703, Level: 23
Points: 9,703, Level: 23 Points: 9,703, Level: 23 Points: 9,703, Level: 23
Activity: 2.7%
Activity: 2.7% Activity: 2.7% Activity: 2.7%
Thanks: Gave 220, Got 240
Posts: 3,103
Quote:
Originally Posted by GJC
Thanks for the reply Chris.

Basically I want to scan 'a book' containing upto 1000 pages or for example then transfer the scanned images in order to create a published CD or DVD, it would effectively become an e-book/digital bible.

Cheers!
I took it that you were scanning a book and creating bmap images of the pages. I couldn't see how a scanned book could be text based. Unless you're using OCR software on the pages afterwards. Or have I missed something here?

Only trying to helpfull on the details given
  Quote
Old 21-10-2005, 11:11 AM   #11
GJC GJC is offline
Senior Member
 
GJC's Avatar
Join Date: Mar 2002
Location: London
Experience Points:
7,417, Level: 20
Points: 7,417, Level: 20 Points: 7,417, Level: 20 Points: 7,417, Level: 20
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
Thanks: Gave 104, Got 43
Posts: 1,550
Quote:
Originally Posted by Chris Lamle
I took it that you were scanning a book and creating bmap images of the pages. I couldn't see how a scanned book could be text based. Unless you're using OCR software on the pages afterwards. Or have I missed something here?

Only trying to helpfull on the details given
--

---

Apologies should have been more specific. I believe OCR software would be needed so that keywords can be searched?
  Quote
Old 21-10-2005, 11:49 AM   #12
Senior Member
 
tomson's Avatar
Join Date: Jul 2000
Location: Berk'amsted
Experience Points:
7,527, Level: 20
Points: 7,527, Level: 20 Points: 7,527, Level: 20 Points: 7,527, Level: 20
Activity: 0%
Activity: 0% Activity: 0% Activity: 0%
Thanks: Gave 88, Got 189
Posts: 1,900
Quote:
Originally Posted by GJC
I believe OCR software would be needed so that keywords can be searched?
Absolutely. Again, I assumed this was a given. Should have explained that.

Mental note to self: must stop assuming.
  Quote
Old 21-10-2005, 12:01 PM   #13
Prominent Member
 
Mac Man's Avatar
Join Date: Jan 2001
Experience Points:
9,703, Level: 23
Points: 9,703, Level: 23 Points: 9,703, Level: 23 Points: 9,703, Level: 23
Activity: 2.7%
Activity: 2.7% Activity: 2.7% Activity: 2.7%
Thanks: Gave 220, Got 240
Posts: 3,103
Quote:
Originally Posted by tomson
Absolutely. Again, I assumed this was a given. Should have explained that.

Mental note to self: must stop assuming.
There was a great quote in a film I saw once

"Assumption is the mother of a **** ups"

Just so true. After nearly 20 years in design I (try) not to assume anything of my clients and suppliers.

  Quote
Post Reply



Thread information and display options
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off