Annotation

english | 한국어 HowTo [ID# 57107] +/-

We want to support annotation of any document, in a generalized way that can be supported by a unified aggregation and sharing system (where annotations/comments are similar to other objects in the object store). Media that should support annotation include documents and images; perhaps also any webpage or item viewed through a browser. In the extreme one can imagine adding notes to any moment in time using a laptop; associated as well as possible with a specific item with its own identifier, or a specific activity, or at least a combination of timestamp and screenshot and context.

We should support elegant libraries for displaying aggregated notes; levels of publicity (and perhaps ways to change this after the fact for clusters of notes) and ways to highlight annotations and reviews as they take place.

See also content stamping for a specific kind of annotation that supports reviewing.

What's an Annotation?

An annotation is any kind of data imposed onto another page/document/object. Generally you do not need the permission of the author to add these comments or discussion. You may share your annotations with other users, or they may be private.

An annotation may be:

A comment that applies to a specific range of text
Something directed at a coordinate location in a PDF or image
A comment applied to a document generally
A comment applied to another annotation (forming a threaded discussion)
A rating or recommendation
A copyedit intended for the author
No comment, but simply the highlighting of a range of text or a pointer to something in a PDF (indicating a vague sense of "this is important or interesting")

As a result there are many optional aspects to an annotation -- the comment text is optional, the text range is optional, tags are optional, ratings are optional, etc.

Desired Features

Querying and Aggregation

It is useful to aggregate annotations. In the simplest case, we want to retrieve annotations from several sources.

Automatically aggregated annotations can also be useful. An aggregator may pull together annotations from many sources and either republish a selection of the annotations. For example, the aggregator may drop what it judges to be spam, or only republish what it judges to be the most interesting annotations.

Identifying Targets

We weren't able to find any existing protocols for specifying target content, so we identified the two main use cases:

Commenting on a page as a whole (Digg-like).
Commenting on specific sections of a page.

Threading

RFC4685 covers ATOM threading in detail.

Rating

We settled on adding an <ann:rating>N</ann:rating> equivalent, which gives a user rating for the target page.

hReview was considered, but it seemed overkill for our needs.

Tagging/Categorisation

Publishing

Viewing Annotations

When annotations are separate from the underlying work, one can see a constellation of notes from many people. A few views which we want to readily support:

no comments
my own comments
comments from a group (myself/class/teachers)
all comments
new comments

We also want to limit the types of annotation viewed to an area of interest:

Point-and-click annotation associated with a spot on an image or page
Selection annotation associated with a string in a document or region in an image
Block annotation associated with a paragraph or block in a document or region in an image
Document-level annotation such as tags or reviews

API Proposals

Here are two proposals.

Original Annotation API Proposal by Ian Bicking and Joshua Gay
Comment Anywhere Annotation Protocol Proposal by Alec Thomas and Alan Green

XSS Security

We will be injecting other people's HTML into content. We must be sure this HTML does not contain dangerous stuff, like Javascript that itself calls XMLHttpRequests. We must be sure to scrub the HTML carefully. It is difficult to do this in Javascript, but that would be most secure (on the client when loading the comments). We could require XHTML, embedded in the Atom, to do this. Or, we could rely on server-side filtering of the HTML.

References

Server-side HTML filtering in lxml.html.clean -- Ian Bicking
We're working on a Atom store for tagging (a related kind of annotation) called TaggerStore -- it's in an early stage still -- Ian Bicking

Annotation

Contents

What's an Annotation?

Desired Features

Querying and Aggregation

Identifying Targets

Threading

Rating

Tagging/Categorisation

Publishing

Viewing Annotations

API Proposals

XSS Security

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

About OLPC

About the laptop

About the tablet

Projects

OLPC wiki

Tools