OurStoriesXML: Difference between revisions

From OLPC
Jump to navigation Jump to search
No edit summary
No edit summary
Line 1: Line 1:
This page documents the mark-up language used to keep meta data for interviews recorded by the [[Our_Stories]] activity.
This page documents the mark-up language used to keep meta data for interviews recorded by the [[Our_Stories]] activity.


Some fields will be automatically generated during interview recording, ie date and file reference (and probably language and country of origin / geo-data, as those could be automatically appended by the school servers). Other fields, such as description fields and keyword tags, are optionally entered from the interviewers, or filled in by reviewers or teachers going over the stories afterwards.
The fields would be useful for categorization and searching of the collected audio clips.


__TOC__
some of the fields would be automatically generated during interview recording, ie date and file reference (and probably language and country of origin / geo-data, as those could be automatically appended by the school servers). Other fields, such as description fields and keyword tags would be optionally entered from either the children recording the interviews, or filled in by the teachers or other volunteers reviewing the stories.

__FORCETOC__
== Quick Example ==
== Quick Example ==


Line 16: Line 14:




Desired fields as of 12/2007 include (quoting Curtis Chen):
Desired fields as of 12/2007:
* '''Title''' - this title will be used for the link text on the landing pages; if no title is available, we use "Story By <first name of speaker>;" if no name is available, we use "UNTITLED"
* '''Title''' - this title will be used for the link text on the landing pages; if no title is available, we use "Story By <first name of speaker>;" if no name is available, we use "UNTITLED"
* '''Name of Speaker(s)''' - if none available, we use "Anonymous"
* '''Name of Speaker(s)''' - if none available, we use "Anonymous"
* '''Name of Audio File''' - please avoid using non-alphanumeric characters in filenames (e.g., punctuation and spaces; dashes and underscores are okay)
* '''Name of Audio File''' - please avoid using non-alphanumeric characters in filenames (e.g., punctuation and spaces; dashes and underscores are okay)
* '''Name of Photo''' - if available; same filename notes as above
* '''Name of Photo''' - if available; same filename notes as above
* '''Source''' - name of organization which collected the recording, e.g., "UNICEF"
* '''Source''' - name of school/organization which collected the recording, e.g., "UNICEF", "Khairat School"
* '''Country'''
* '''Country'''
* '''City/Town'''
* '''City/Town'''
* '''Short description'''
* '''Short description'''
* '''Language'''
* '''Keyword tags'''
* '''Geodata''' (latitude & longitude)


All fields should contain UTF-8 text strings.
All fields should contain UTF-8 text strings.


=== Other fields ===
Other fields to be added:
* Name of server used to upload
* Fields to be added in next revision: language, country of origin, generic description text field, keyword tags
* Creation related : multiple authors, dubbing/translation in the original
* multiple authors
* Transcription and translation related : Transcribers, Translators
** Transcribers
** Translators
* General tags.
** Dubbing/translation notes
* Geodata (lat/long, other)
* Server used to upload


=== Other data to gather on OurStories ===
=== Other data to gather on OurStories ===
Line 50: Line 50:
== Interested participants ==
== Interested participants ==
* Allan Doyle
* Allan Doyle
* John Huang
* ...
* ...

Revision as of 18:14, 7 April 2008

This page documents the mark-up language used to keep meta data for interviews recorded by the Our_Stories activity.

Some fields will be automatically generated during interview recording, ie date and file reference (and probably language and country of origin / geo-data, as those could be automatically appended by the school servers). Other fields, such as description fields and keyword tags, are optionally entered from the interviewers, or filled in by reviewers or teachers going over the stories afterwards.

Quick Example

<?xml version="1.0" ?>
<ourstories version="0.1">
  <story date="2007-10-29 07:13:04" 
         oggfile="/home/olpc/ourstories/story_filename.ogg" 
         title="story_title"/>
</ourstories>


Desired fields as of 12/2007:

  • Title - this title will be used for the link text on the landing pages; if no title is available, we use "Story By <first name of speaker>;" if no name is available, we use "UNTITLED"
  • Name of Speaker(s) - if none available, we use "Anonymous"
  • Name of Audio File - please avoid using non-alphanumeric characters in filenames (e.g., punctuation and spaces; dashes and underscores are okay)
  • Name of Photo - if available; same filename notes as above
  • Source - name of school/organization which collected the recording, e.g., "UNICEF", "Khairat School"
  • Country
  • City/Town
  • Short description
  • Language
  • Keyword tags
  • Geodata (latitude & longitude)

All fields should contain UTF-8 text strings.

Other fields to be added:

  • Name of server used to upload
  • multiple authors
    • Transcribers
    • Translators
    • Dubbing/translation notes

Other data to gather on OurStories

  • Narrative flow, unstructured groupings
  • External tags / additions to the above (cf. how Connexions does it)

Parsing

Versioning

version 0.1 - 10/29/2007

  • Initial attempt to define a XML style meta-data mark-up language.
  • Fields available: date, oggfile, title as attributes of a story

Interested participants

  • Allan Doyle
  • John Huang
  • ...