Další formáty:
BibTeX
LaTeX
RIS
@misc{905712, author = {Sojka, Petr}, booktitle = {University of Portsmouth Computing Seminar}, keywords = {mathematics knowledge management;DML-CZ;digitization workflow;digital libraries;pdfJbim;big2enc;PDF recompression}, language = {eng}, title = {Document Engineering for Digital Libraries (invited talk 5.11.2010,Portsmouth University Computing Seminar,UK)}, url = {http://www.fi.muni.cz/usr/sojka/presentations/sojka-portsmouth-pres2010.pdf}, year = {2010} }
TY - SLIDE ID - 905712 AU - Sojka, Petr PY - 2010 TI - Document Engineering for Digital Libraries (invited talk 5.11.2010,Portsmouth University Computing Seminar,UK) KW - mathematics knowledge management;DML-CZ;digitization workflow;digital libraries;pdfJbim;big2enc;PDF recompression UR - http://www.fi.muni.cz/usr/sojka/presentations/sojka-portsmouth-pres2010.pdf L2 - http://www.fi.muni.cz/usr/sojka/presentations/sojka-portsmouth-pres2010.pdf N2 - Several innovative document transformations and tools developed in the process of building the Digital Mathematical Library DML-CZ http://dml.cz are described. The main result is our new PDF re-compression tool, developed using a enhanced jbig2enc library. Together with pdfsizeopt.py by Péter Szabó, we have managed to decrease PDF storage size and transmission needs by 62%: using both programs we reduced the size of the original already compressed PDFs to 38%. We briefly describe workflow and tools developed for creating the digital library. The batch digital signature stamper, the document similarity metrics which uses four different methods, a [meta]data validation process and math OCR tools represent some of the main [by]products. Such document engineering, together with Google Scholar indexing optimization, have led to the success of serving digitized and born-digital scientific math documents to the public in DML-CZ, and are being employed also in The European Digital Mathematics Library, EuDML. ER -
SOJKA, Petr. Document Engineering for Digital Libraries (invited talk 5.11.2010,Portsmouth University Computing Seminar,UK). In \textit{University of Portsmouth Computing Seminar}. 2010.
|