User:Assim.deodia: Difference between revisions

From OLPC
Jump to navigation Jump to search
No edit summary
No edit summary
Line 7: Line 7:
== Listen and Spell ==
== Listen and Spell ==


I am working on [[Listen_and_Spell]] actvity as a part of my GSoC project under the mentor ship of Dafydd Harris. This section maintain the updates and the discussion regarding the project.
I am working on [[Listen_and_Spell]] activity as a part of my GSoC project under the mentor ship of Dafydd Harris. This section maintain the updates and the discussion regarding the project.


git Repository: [http://dev.laptop.org/git/projects/listen-spell git Repository]<br />
git Repository: [http://dev.laptop.org/git/projects/listen-spell git Repository]<br />
Line 14: Line 14:
== Weekly Updates ==
== Weekly Updates ==


===May 24th - May 31st ===
=== May 24th - May 31st ===


----
Initial discussion with daf on the project design which finalized

==== The Start ====
Initial discussion with daf on the project design which finalized
* Using ElementTree module of python as the parser for XML dictionary files.
* Using ElementTree module of python as the parser for XML dictionary files.
* Using SQLite as a temporary storage space for the parsed dictionary so that it is easy to access data without parsing the XML again.
* Using SQLite as a temporary storage space for the parsed dictionary so that it is easy to access data without parsing the XML again.


* Work Details
==== Work Details ====
** Started with reading about python especially ElementTree class
* Started with reading about python especially ElementTree class
** Made a small sized dictionary file from [http://www.ibiblio.org/webster/ gcide] and making a basic class which initially only parsed the file and display its content in more readable form.
* Made a small sized dictionary file from [http://www.ibiblio.org/webster/ gcide] and making a basic class which initially only parsed the file and display its content in more readable form.
** Modified the XMl file to remove all the characters starting with "#&" as they were unreadable by parsers and were giving errors of "Unidentified entity"
* Modified the XMl file to remove all the characters starting with "#&" as they were unreadable by parsers and were giving errors of "Unidentified entity"
** Applied for git repository.
* Applied for git repository.

=== June 10th - June 14th===


----
===June 10th - June 14th===


* Work Details
==== Work Details ====
** Extended the class structure to include some more functions
* Extended the class structure to include some more functions
** Read about SQLite
* Read about SQLite
** Added the SQLite interface to the dictionary class which parses the dictionary and stores all the data in a database. This class is available at the git repository.
* Added the SQLite interface to the dictionary class which parses the dictionary and stores all the data in a database. This class is available at the git repository.
** Fixed the error of "Unidentified entity" by adding the "DOCTYPE", which defines entities for xml files, to all the XML files.
* Fixed the error of "Unidentified entity" by adding the "DOCTYPE", which defines entities for xml files, to all the XML files.
** Again applied for git repository as it's not been created yet.
* Again applied for git repository as it's not been created yet.
** Repository created :). First git commit
* Repository created :). First git commit


* Issues to be tackled
==== Issues to be tackled ====
** The "DOCTYPE" has to be included in each mxl file seperately
* The "DOCTYPE" has to be included in each xml file separately and since the size of DOCTYPE section is large, it is adding to file size. It can in placed in one file and that file can be included in each xml dictionary.
* GCIDE dictionary's are categorized by starting letters which have to re-categorized with respect to difficulty level.
* The XML schema of the GCIDE dictionary are different from what required. It has to redesigned according to required schema.
* Merging of above class can be done only after the above tasl are completed.

Revision as of 12:54, 14 June 2008

Assim Deodia
@ assim.deodia at gmail.com
{{{2}}}
Nuvola apps edu miscellaneous.png This user is a Under Graduate student– Information Technology at Netaji Subhas Institute of Technology

Listen and Spell

I am working on Listen_and_Spell activity as a part of my GSoC project under the mentor ship of Dafydd Harris. This section maintain the updates and the discussion regarding the project.

git Repository: git Repository
Project Page: GSoC page

Weekly Updates

May 24th - May 31st


The Start

Initial discussion with daf on the project design which finalized

  • Using ElementTree module of python as the parser for XML dictionary files.
  • Using SQLite as a temporary storage space for the parsed dictionary so that it is easy to access data without parsing the XML again.

Work Details

  • Started with reading about python especially ElementTree class
  • Made a small sized dictionary file from gcide and making a basic class which initially only parsed the file and display its content in more readable form.
  • Modified the XMl file to remove all the characters starting with "#&" as they were unreadable by parsers and were giving errors of "Unidentified entity"
  • Applied for git repository.

June 10th - June 14th


Work Details

  • Extended the class structure to include some more functions
  • Read about SQLite
  • Added the SQLite interface to the dictionary class which parses the dictionary and stores all the data in a database. This class is available at the git repository.
  • Fixed the error of "Unidentified entity" by adding the "DOCTYPE", which defines entities for xml files, to all the XML files.
  • Again applied for git repository as it's not been created yet.
  • Repository created :). First git commit

Issues to be tackled

  • The "DOCTYPE" has to be included in each xml file separately and since the size of DOCTYPE section is large, it is adding to file size. It can in placed in one file and that file can be included in each xml dictionary.
  • GCIDE dictionary's are categorized by starting letters which have to re-categorized with respect to difficulty level.
  • The XML schema of the GCIDE dictionary are different from what required. It has to redesigned according to required schema.
  • Merging of above class can be done only after the above tasl are completed.