Syntax space for qcodes
NewsML-G2 uses datatype called QCode to express controlled values. QCode consists of a scheme alias, followed by a colon (:) followed by a code from the scheme.
Currently, NewsML-g2 defines syntax space for the code part of a QCode as “all characters except white space (#x20 | #x9 | #xD | #xA)”.
This means that code part of a QCode could contain characters like ‘#’, ‘?’ and ‘/’, and that QCodes such as “sch:abc#de”, “sch:abc?de” and “sch:abc/de” are valid QCodes.
When code part of a QCode is appended to scheme URI mapped from a scheme alias “sch”, resulting conceptURI might not be legal HTML (or RDF) URI.
Scheme URI / code
On the other hand, some codes used in the business today (like Reuters RICs) could contain some of the “problematic” values.
I have several questions to the group:
1. would such conceptURIs break existing applications (what is with semantic-web aware applications as a special case)
2. what procedure should be recommended to make conceptURIs “semantic-web safe” (simple URLEncoding would probably not help, because it would leave “problematic” characters unchanged, because they are legal URI characters)
3. should IPTC change the rule for the syntax space for QCodes? How to deal with existing codes that use characters like ‘#’, ‘?’ and ‘/’ in that case ?
Please share with us your thoughts on the matter.
News Content WP Chair
Counsellor for IT system
strategy and development
tel: +385 1 48 08 800
fax: +385 1 48 08 820
Croatian News Agency HINA
Marulicev trg 16
- VTD-XML 2.5 is now released. Please go to https://sourceforge.net/project/showfiles.php?group_id=110612&package_id=120172&release_id=661376 to download the latest version.
Changes from Version 2.4 (2/2009)
* Added separate VTD indexing generating and loading (see http://vtd-xml.sf.net/persistence.html for further info)
* Integrated extended VTD supporting 256 GB doc (In Java only).
* Added duplicateNav() for replicate multiple VTDNav instances sharing XML, VTD and LC buffer (availabe in Java and C#).
* Various bug fixes and enhancements