R 2010

Recompression of Bitmaps in PDF using JBIG2 format

HATLAPATKA, Radim and Petr SOJKA

Basic information

Original name

Recompression of Bitmaps in PDF using JBIG2 format

Name in Czech

Rekomprese bitmap v PDF s užitím JBIG2 formátu

Authors

HATLAPATKA, Radim (203 Czech Republic, guarantor, belonging to the institution) and Petr SOJKA (203 Czech Republic, belonging to the institution)

Edition

2010

Other information

Language

English

Type of outcome

Software

Field of Study

20206 Computer hardware and architecture

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

RIV identification code

RIV/00216224:14330/10:00049414

Organization unit

Faculty of Informatics

Keywords (in Czech)

rekomprese bitonální grafiky;PDF;jbig2enc;JBIG2;velikost PDF;optimalizace;EuDML;DML-CZ;OCR

Keywords in English

JBIG2;compression;jbig2enc;PDF;PDF size optimization;EuDML;DML-CZ;OCR

Technical parameters

Petr Sojka, FI MU Brno, Botanická 68a, 60200 Brno, CZ, tel. +420549496966

Tags

International impact
Změněno: 10/5/2013 13:23, doc. RNDr. Petr Sojka, Ph.D.

Abstract

V originále

Software for automatization of lossy (re)compression of scanned bitmaps to standard JBIG2 with adaptive optimalization level of lossiness and compression. It can be used on any PDF files that contain bitonal images (e.g. created by scanning during digitization). On these types of files the space gained by recompression is often more than 50%. In addition, the quality of the rendered page is often better than before recompression, as character candidates used during recompression are averaged over all representants recognized (by OCR techniques) in given PDF. It has been applied on 30,000+ pages of http://dml.cz and has been also used in the EuDML project. The application has been described in the peer-reviewed publications like Sojka, P., Hatlapatka, R.: Document Engineering for a Digital Library: PDF recompression using JBIG2 and other optimization of PDF documents. In Proceedings of DocEng 2010 conference. ACM, 2010. p.3-12, ISBN 978-1-4503-0231-9

Links

LA09016, research and development project
Name: Účast ČR v European Research Consortium for Informatics and Mathematics (ERCIM) (Acronym: ERCIM)
Investor: Ministry of Education, Youth and Sports of the CR, Czech Republic membership in the European Research Consortium for Informatics and Mathematics
1ET200190513, research and development project
Name: DML-CZ: Česká digitální matematická knihovna
Investor: Academy of Sciences of the Czech Republic, DML-CZ: Czech Digital Mathematical Library
250503, interní kód MU
Name: The European Digital Mathematics Library (Acronym: EuDML)
Investor: European Union