The Internet side Distributed Proofreaders (DP) was called in the year 2000 by Charles Franks in the life, in order to support the international Project good mountain.
Here one tries to regard by partitioning of in-scanned books into individual sides the working load as small as possible as an individual proofreader and according to the Brute Force method (means here: as large a number of editors as possible reads a book side of thousands of made available to the correction) as large a Pensum as possible to reach only in each case.
According to the same principle as when distribute counting (distributed computing) one proceeds. The crucial difference consists of the fact that a very large number of computers over Internet it is not linked with one another here but that of any size a number of humans over Internet available their cooperation makes and so that within a short time by their proof reading digitizes hundreds of books.
Three phases can fundamental be differentiated with respect to the expiration.
After call of the project a page of the book is indicated in each case. The scanned original side becomes in the upper screen half (as diagram) and indicated in the lower screen half the recognized ocr text. The Proofreader reads now the text of the original side and compares it with the ocr text (raw text). Scanfehler are corrected and supplemented special characters.
This actual proof reading ("proofing ") takes place in two rounds, whereby each side is worked on by two different participants. The second round only experienced proofreaders become certified.
In the third and fourth round formatting are added (e.g. italic writing, headings, footnotes). While the entrance hurdles are relative to the third round small, only experienced participants have entrance to the fourth round (second of formatting).
The unconnected sides raw text into a text document are combined automatically. In each case an experienced proofreader, who reached the status one "post office Processors ", completes the layout with the diagrams, i.e. it adapts these, improves these and/or supplements still possible gaps in the text. It examines the document for complete agreement with the original work. Finally it can produce except the mandatory title format still further formats, above all for HTML.
The project is terminated. The digitized work is published on the server by Project good mountain (not to confound with the commercial offerer project Gutenberg DE). Each Internet user can download and read now this work. The work is thereby to the whole world at the disposal.
In the process of the time Distributed Proofreading (DP) developed to the largest source of E-texts for the Project good mountain, so that Distributed Proofreaders became in the year 2002 official part of the Project good mountain. Up to now approx. 7,000 texts from literature and science in the Internet were again-published by Distributed Proofreading. Thus a substantial contribution is made with the elevation of a knowledge treasure of our culture and knowledge history.
http://www.pgdp.net - homepage of the founder Charles Franks. Worked on predominantly English texts.
http://dp.rastko.net/de - Distributed Proofreaders of Europe. Works on texts of all European languages.
We found here 2 articles.
D» Distributed Proofreaders» Document scanner |
We found here 5 related websites.
Index | Privacy | Terms Of Use | Sitemap | Feedback