iaea international atomic energy agency special characters implementation zbigniew majewski 12th...
TRANSCRIPT
![Page 1: IAEA International Atomic Energy Agency Special Characters Implementation Zbigniew Majewski 12th Joint INIS/ETDE Technical Committee Meeting 21-22 October](https://reader036.vdocuments.site/reader036/viewer/2022071808/56649ee85503460f94bf900c/html5/thumbnails/1.jpg)
IAEAInternational Atomic Energy Agency
Special Characters Implementation
Zbigniew Majewski
12th Joint INIS/ETDE Technical Committee Meeting 21-22 October 2009, Vienna, Austria
![Page 2: IAEA International Atomic Energy Agency Special Characters Implementation Zbigniew Majewski 12th Joint INIS/ETDE Technical Committee Meeting 21-22 October](https://reader036.vdocuments.site/reader036/viewer/2022071808/56649ee85503460f94bf900c/html5/thumbnails/2.jpg)
IAEA 21-22 October 2009, Vienna12th INIS/ETDE Joint Technical Committee Meeting
Outcome of the 11th JTCM
• XML implementation for INIS output and a new input tool development should allow introduction of Unicode.
• the recommendation to develop a detailed plan regarding the possible implications of UNICODE implementation
![Page 3: IAEA International Atomic Energy Agency Special Characters Implementation Zbigniew Majewski 12th Joint INIS/ETDE Technical Committee Meeting 21-22 October](https://reader036.vdocuments.site/reader036/viewer/2022071808/56649ee85503460f94bf900c/html5/thumbnails/3.jpg)
IAEA 21-22 October 2009, Vienna12th INIS/ETDE Joint Technical Committee Meeting
Problem
• INIS allows characters a-Z, digits and a few special characters
• INIS records quality is constrained due to the limited character set• Some abstracts, original titles, author names,
conference and journal titles use multilingual characters
• For some INIS records, formulas are needed in their abstracts
• Extra effort to eliminate rich character set of electronic input
![Page 4: IAEA International Atomic Energy Agency Special Characters Implementation Zbigniew Majewski 12th Joint INIS/ETDE Technical Committee Meeting 21-22 October](https://reader036.vdocuments.site/reader036/viewer/2022071808/56649ee85503460f94bf900c/html5/thumbnails/4.jpg)
IAEA 21-22 October 2009, Vienna12th INIS/ETDE Joint Technical Committee Meeting
Impacts
• Storage• Databases and data exchange files
• Processing • QA (checking rules, authority validation) • Retrieval• External applications
• Presentation • HTML/XML enabled browsers • User Interface using tool specific data formats
![Page 5: IAEA International Atomic Energy Agency Special Characters Implementation Zbigniew Majewski 12th Joint INIS/ETDE Technical Committee Meeting 21-22 October](https://reader036.vdocuments.site/reader036/viewer/2022071808/56649ee85503460f94bf900c/html5/thumbnails/5.jpg)
IAEA 21-22 October 2009, Vienna12th INIS/ETDE Joint Technical Committee Meeting
Approach options
• Unicode enabled storage based• Unicode encoding (binary representation)
implemented in all layers (storage, processing and presentation)
• Use of XML for interfaces (like Atomindex)
• Mark-up based• ASCII based mark-up for Unicode characters
implemented for storage and presentation• Processing modified to recognize mark-up or to
become character agnostic
![Page 6: IAEA International Atomic Energy Agency Special Characters Implementation Zbigniew Majewski 12th Joint INIS/ETDE Technical Committee Meeting 21-22 October](https://reader036.vdocuments.site/reader036/viewer/2022071808/56649ee85503460f94bf900c/html5/thumbnails/6.jpg)
IAEA 21-22 October 2009, Vienna12th INIS/ETDE Joint Technical Committee Meeting
Barriers
Processing Step Software Component Unicode compatibility
Pres. Proc. Storage
BR preparation FIBRE - - -
MET + + +
Submission to INIS Secretariat
e-mail, FTP, File system
+ + +
Image processing Scanning/OCR + - +
BR QA IDPS + - +
Thesaurus + + +
Journals + + +
CAI + + +
INIS Products Atomindex +
NCL Collection +
INIS DB on DVD +
INIS DB on Web + - +
![Page 7: IAEA International Atomic Energy Agency Special Characters Implementation Zbigniew Majewski 12th Joint INIS/ETDE Technical Committee Meeting 21-22 October](https://reader036.vdocuments.site/reader036/viewer/2022071808/56649ee85503460f94bf900c/html5/thumbnails/7.jpg)
IAEA 21-22 October 2009, Vienna12th INIS/ETDE Joint Technical Committee Meeting
Actions
• Finalize upgrading the software platform used by INIS applications
• Modify FIBRE and IDPS to allow Unicode characters
• Extend use of XML as the INIS record format throughout the entire INIS process
• Agree on use of Unicode in Atomindex• Replace the search engine to allow
searches with Unicode characters
![Page 8: IAEA International Atomic Energy Agency Special Characters Implementation Zbigniew Majewski 12th Joint INIS/ETDE Technical Committee Meeting 21-22 October](https://reader036.vdocuments.site/reader036/viewer/2022071808/56649ee85503460f94bf900c/html5/thumbnails/8.jpg)
IAEA 21-22 October 2009, Vienna12th INIS/ETDE Joint Technical Committee Meeting
Thank you!