Languages as RDF resources

Objectives

Make the languages as implicitely defined by tags and subtags in BCP 47 (RFC 4646 & RFC 4647) [1] and ISO 693 [2] available as proper RDF resources.

Use cases

Background

For backgorund discussions see the threads [4], [5] on public-esw-thes list. Language codes are defined in BCP 47 [1] which bases on ISO 639 [2]. Codes are managed by the ISO 639-2 Registration Authority [6], ISO 639-3 Registration Authority [7] and in the IANA Language Subtag Registry [8]. There are and/or have been some other systems of langauge tags (see [9]) that will not directly be covered by this proposal. See [10] for more information how to use BCP 47.

Issues identified so far

Management

Which authority is to define, maintain and host URIs? Since IANA/BCP 47 is specifying the tag grammar and subtag registry, it seems the best candidate. Base tag URIs may also be specified by the ISO 639 Registration Authorities.

Technical

BCP 47 defines a grammar to construct langtags from subtags. The list of subtag types and subtag values is authoritative, but the possible combinations defining langtags are open, and not all combinations make sense. So it seems difficult to define and maintain URIs for langtags, whereas URIs for subtags are easy to define. On the other hand combinations may be created dynamically to supply URIs for all possible combinations.

Proposed approaches

Detailed ontology with subclasses

The proposal [11] shows how this approach can be used to define Dublin Core metadata.

Plain SKOS without subclasses

See IANA Language Subtag Registry in SKOS for a first script to that converts IANA Language Subtag Registry to SKOS in RDF/XML based on this approach.

Bottom-up approach

http://www.lingvoj.org is harvesting URIs already used on the Web to declare languages as RDF resources, in order to make them available in the framework of the Linking Open Data project : http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData.

Open Issues

References

Languages as RDF Resources (last edited 2007-11-13 06:33:37 by BernardVatant)