Projects/Automatic translation software: Difference between revisions

Moses toolkit

  http://www.statmt.org/moses

 1. Run a virtual machine of the OLPC (If you have an actual OLPC, forget this step). I personally use Virtual Box (by Sun/Oracle, it's free and open source)
        http://www.virtualbox.org/
     There are prepackaged VM files located here
        http://dev.laptop.org/pub/virtualbox/
     I use version 656 because I have an OLPC of the same version, but take your pick
 2. Download the model file

  http://www.statmt.org/matrix/

 1. Create a client-server application which will run the resource intensive application on a server. Clients will be a Web browser or a custom Pythong app.
     - Skills required: Python, Apache, C++
 2. Fork the decoder source code to enable it to run on the OLPC. Minimize memory consumption, discard code not likely to be used by the application. 
     - Skills required: C++
 3. Minimize the work the decoder has to do by using a greedy search instead of a beam search, or have a very tight beam and other threshold.
     - Skills required: C++, statistical machine translation

 4. Different language pairs
 5. Speech-to-speech translation
 6. Integrating Optical Character Recognition (OCR) with translation
 7. Enable sharing of user vocabulary via the OLPC Mesh network
 8. Distributed training of data on the OLPC

  http://www.statmt.org/moses/

  Moses Support

 430Mhz CPU. AMD geode x86 processor
 237MB RAM
 1GB flash disk
 Linux OS

Projects/Automatic translation software: Difference between revisions

Revision as of 16:13, 2 May 2010

Contents

Getting started

Mission Statement and Objectives

Project Ideas

Progress

12th march 2009

3rd May 2009

31st May 2009

12th Jully, 2009

Navigation menu

Projects/Automatic translation software: Difference between revisions

Revision as of 16:13, 2 May 2010

Getting started

Mission Statement and Objectives

Project Ideas

Progress

12th march 2009

3rd May 2009

31st May 2009

12th Jully, 2009

Navigation menu

Search