Další formáty:
BibTeX
LaTeX
RIS
@inproceedings{959065, author = {Pomikálek, Jan and Suchomel, Vít}, address = {Brno, Czech Republic}, booktitle = {RASLAN 2011}, edition = {5}, editor = {Aleš Horák, Pavel Rychlý}, keywords = {character encoding; character encoding detection; charset; Unicode}, howpublished = {tištěná verze "print"}, language = {eng}, location = {Brno, Czech Republic}, isbn = {978-80-263-0077-9}, pages = {125-129}, publisher = {Tribun EU}, title = {chared: Character Encoding Detection with a Known Language}, url = {https://nlp.fi.muni.cz/raslan/2011/paper16.pdf}, year = {2011} }
TY - JOUR ID - 959065 AU - Pomikálek, Jan - Suchomel, Vít PY - 2011 TI - chared: Character Encoding Detection with a Known Language PB - Tribun EU CY - Brno, Czech Republic SN - 9788026300779 KW - character encoding KW - character encoding detection KW - charset KW - Unicode UR - https://nlp.fi.muni.cz/raslan/2011/paper16.pdf N2 - chared is a system which can detect character encoding of a text document provided the language of the document is known. The system supports a wide range of languages and the most commonly used character encodings. We explain the details of the algorithm, describe the process of creating models for various languages and present results of an evaluation on a collection of Web pages. ER -
POMIKÁLEK, Jan a Vít SUCHOMEL. chared: Character Encoding Detection with a Known Language. In Aleš Horák, Pavel Rychlý. \textit{RASLAN 2011}. 5. vyd. Brno, Czech Republic: Tribun EU, 2011, s.~125-129. ISBN~978-80-263-0077-9.
|