new search concepts – the hidden data internet librarian london 2007
DESCRIPTION
New Search Concepts – the Hidden Data Internet Librarian London 2007. Helle Lauridsen. Technology Manager [email protected]. Literature search challenge – why deep indexing? Normal A&I. We cannot search this. Abstract and title – the basic indexing. This can be very difficult to search…. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/1.jpg)
New Search Concepts – the Hidden Data
Internet Librarian London
2007
Helle Lauridsen. Technology [email protected]
![Page 2: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/2.jpg)
Literature search challenge – why deep indexing?
Normal A&I
We probably
don’t want to search
this
Abstract and title – the basic indexing
We don’t want to
search this
This can be very difficult to search…
We cannot search this
We cannot search this
We cannot search this
![Page 3: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/3.jpg)
![Page 4: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/4.jpg)
Why Index Tables And Figures?
• They contain important and valuable information• Figures and tables represent the distilled essence of
research – the closest thing to raw datasets• Researchers want access to data • They are invisible
![Page 5: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/5.jpg)
Reasons Why Data Are Hidden In Traditional Searches
1. Data variables do not appear in any index.– there are no indexing ‘hooks’ in title, abstract or caption for
“dissolved oxygen”, below.
2. A search of the full text bypasses the image files– text in tables & figures is considered an image, not
searchable text
Table 1. Depth, physico-chemical and sedimentological variables.
![Page 6: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/6.jpg)
What Researchers Currently Do
• Search for photographs and maps more than tables, figures or graphs
• Use Google Images most often
• Level of satisfaction with traditional searches consistently rated low
• locating objects is “difficult”
• “in general, academic figures, tables, and graphs are not available to search”
![Page 7: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/7.jpg)
From idea to reality
• An innovative Company • A Prototype database of 325,000 objects • In depth market research set up by Carol Tenopir from
Tennessee University• 60+ scientists, students and librarians• Lots of travelling and face to face meetings with
scientists• A White Paper• Agreements with major publishers
![Page 8: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/8.jpg)
In Depth Market Research: Participants
![Page 9: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/9.jpg)
Current Practices and Experiences
A highly experienced and computer literate test group
![Page 10: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/10.jpg)
Experiences with Tables and Figures Index
• . “I can find the tables and figures that I need quickly, [and] it can save me a lot of time. I can work more efficiently” (Post Doc, Biology)
• “It makes the search much quicker when it is focused” (Post Doc, Biology)
• that “the tables and figures are really helpful for scanning large sets of data first” (Post Doc, Oceanography).
• “[i]t takes less time to find the information I want and especially I would find this useful when making a presentation” (Student, Biology).
• “I could find relevant information more quickly and images that were useful for presentations and research” (Professor, Engineering).
![Page 11: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/11.jpg)
Experiences with Tables and Figures Index
• Quality of the tables was PARAMOUNT.• Rights – with proper attribution tables and figures can be
extracted directly from the database and used in teaching and other work.
• Linking to the full text was crucial since they would not use an image unless they were sure of the context.
• They wanted to see a list of articles as well as a list of relevant objects
• Overview at a glance right after searching, no unnecessary clicks
They also told us…
![Page 12: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/12.jpg)
Effectiveness of Tables and Figures Index
Surprisingly, even the small dataset in the prototype revealed the usefulness of a tables and figures index:
0
50
100
150
200
More Less Same Unsure
The time for a search using a traditional database would have been:
0
20
40
60
80
100
120
140
160
No Yes Not Sure
Would Information Be Found Without Tables & Figures Search Capabilities?
![Page 13: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/13.jpg)
From prototype to reality
The feedback from the market research sent the development team back to the drawing board to make the required changes:
![Page 14: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/14.jpg)
The Product Design Changed
The figure quality improved drastically
Publisher specific attribution
![Page 15: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/15.jpg)
Article Information
- CCC working on
securing permissions specific to
images
Permission Types
![Page 16: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/16.jpg)
The Product Design Changed – and improved
Pinky nails for quick overview
![Page 17: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/17.jpg)
Clear sharp images + mouseover information = quick overview
![Page 18: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/18.jpg)
Object Thumbnails
Article Descriptors
Object Descriptors
Links to Full-Text
![Page 19: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/19.jpg)
Machine-Assisted Indexing:
Subject, Taxonomic; Geographic, Statistical
Manual Indexing
Indexing Review
3. Indexing
Scan
OCR
XML or variant
PDF text
PDF image
Hardcopy
1. Article AcquisitionManual Image Zoning
2. Image Processing
Automated Image
Extraction
Creation of CSA Illustrata Index
![Page 20: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/20.jpg)
Process Patent Application
![Page 21: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/21.jpg)
Multi disciplinary
![Page 22: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/22.jpg)
… and the press wrote:
![Page 23: New Search Concepts – the Hidden Data Internet Librarian London 2007](https://reader035.vdocuments.site/reader035/viewer/2022070406/568142a5550346895daee274/html5/thumbnails/23.jpg)
Read more about it
• Jacso, P. (2007). CSA illustrata, gale virtual reference library, and cambridge journals. Online, 31(3), 57.
• Ojala, M. Searching scholarly tables, figures, graphs, and illustrations with CSA illustrata. Information Today, 2007(5/7/2007). Retrieved 5/7/2007,
• ProQuest CSA adds content to illustrata and illumina. Retrieved 5/7/2007, 2007, from http://newsbreaks.infotoday.com/wndReader.asp?ArticleId=35842#top
• Tenopir, C. (2007). When you just need a part. Library Journal, (6) • Tenopir, C., Sandusky, R. J., & Casado, M. M. (2006). The value of
CSA deep indexing for researchers. White Paper,
- or just at
http://info.csa.com/csaillustrata/
THANK YOU
Helle Lauridsen