A bit of background on RDFa
RDFa is a set of extensions to HTML and XHTML from W3C. With RDFa it is possible to use custom vocabularies to include machine readable data in web documents. In current web documents based on HTML or XHTML there are very limited ways of expressing information for machines. There are:
- HTML elements that express document structure (e.g. headings, lists, tables), and rethoric (em, strong),
- (broadly defined) HTML elements that express various technical terms (code, kbd, samp), and,
- the content itself.
As more business domains are moving online the need to exchange data in a more structured fashion will increase. Instead of publishing data twice, once in a web document for humans and once in a separate file for machines, RDFa makes it possible to include machine readable data in web documents. (In a way, this has been possible since the modularization of XHTML, but in practice, few developers seem to have used the extension mechanisms of XHTML).
An RDFa example
Consider the following XHTML markup:
My name is Jo Smith. I'm a distinguished web engineer at
You can contact me
<a href="mailto:email@example.com">via email</a>.
Most humans will be able to understand the information but for machines this markup is too vague to parse without ambiguity. By providing more information about the content we can reduce this ambigity. First we provide information about our vocabulary in the HTML element:
Then we can use the terms of that vocabulary to provide more information for machines:
<p class="contactinfo" about="http://example.org/staff/jo">
My name is
<span property="contact:fn">Jo Smith</span>.
distinguished web engineer
<a rel="contact:org" href="http://example.org">
You can contact me
<a rel="contact:email" href="mailto:firstname.lastname@example.org">
One of the major benefits is that there is a standard for the vocuabulary specification and it is machine readable. You can open the URI for the vcard vocabulary used above in your browser (you may have to “view source” to see it) and see more information about the terms of the vocabulary. Another big advantage is that you can create a vocabulary specification for your business domain yourself and publish on the web. You do not have to put it through some central authority.
We have now modified the markup in our document to make it useful for both humans and machines. The document still looks the same for sighted users that look at the information in their web browser. Apart from the added benefit for search engines and desktop applications (e.g. importing this information into your adressbook now becomes easier) I believe it will have interesting implications for assistive tools as well.
What if assistive software could use RDFa information?
Since the vocabuary is created in a machine readable format it should be possible to let assistive software such as screen readers load the vocabulary specification and provide more information for the user. If you look at the vocabulary specification for vcards used above each term has a label text. For the title the specification looks like this:
One of the simplest ways of using this information is of course to read it to the listener during the linearization of the page. Since a term can have a description too even more information could be provided to the user.
In practice, the sequence of events for a screen reader working on top of a web browser could look like this:
- Browser opens the web page.
- Screen reader parses the HTML and extracts references to all external vocabularies.
- External vocabularies are fetched and parsed for labels and descriptions.
- The screen reader announce that extended information exists and starts rendering the page.
So, by using RDFa to reduce ambiguity for machines it is likely that humans too can benefit from the added information. It will be very interesting to see what makers of assistive tools can come up with. What other use cases for RDFa with regards to accessibility can you see?
For more information on RDFa see: