Masaryk University

Publication Records

česky | in English

Filter publications

    2022

    1. HA, Hien Thi and Aleš HORÁK. Information Extraction from Scanned Invoice Images using Text Analysis and Layout Features. Signal Processing: Image Communication. Elsevier, 2022, vol. 102, No 1, p. 1-11. ISSN 0923-5965. Available from: https://dx.doi.org/10.1016/j.image.2021.116601.
      URL
      RIV/00216224:14330/22:00125095 Article in a journal. English. Netherlands.
      Ha, Hien Thi (704 Viet Nam, belonging to the institution) -- Horák, Aleš (203 Czech Republic, belonging to the institution)
      Keywords in English: OCR; Information extraction; Scanned documents; Document metadata; Invoice metadata extraction; Metadata indexing

      Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 28/3/2023 09:56.

    2021

    1. HA, Hien Thi, Aleš HORÁK and BUi MINH TUAN. Contract Metadata Identification in Czech Scanned Documents. Online. In Ana Paula Rocha ; Luc Steels and Jaap van den Herik. Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART. Portugal: The SciTePress Digital Library, 2021, p. 795-802. ISBN 978-989-758-484-8. Available from: https://dx.doi.org/10.5220/0010243807950802.
      URL
      RIV/00216224:14330/21:00121131 Proceedings paper. English.
      Ha, Hien Thi (704 Viet Nam, belonging to the institution) -- Horák, Aleš (203 Czech Republic, guarantor, belonging to the institution) -- Minh Tuan, BUi (704 Viet Nam)
      Keywords in English: Information Extraction; Scanned Documents; Document Metadata; Contract Metadata Extraction; Czech
      Type of proceedings: post-proceedings
      International impact: yes
      Reviewed: yes

      Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 23/5/2022 14:21.
    2. HA, Hien Thi and Aleš HORÁK. Who is Selling to Whom – Feature Evaluation for Multi-block Classification in Invoice Information Extraction. Online. In Karpov A., Potapova R. SPECOM 2021: 23rd International Conference on Speech and Computer. St. Petersburg, Russia: Springer, 2021, p. 250-261. ISBN 978-3-030-87801-6. Available from: https://dx.doi.org/10.1007/978-3-030-87802-3_23.
      URL
      Name (in English): Who is Selling to Whom – Feature Evaluation for Multi-block Classification in Invoice Information Extraction
      RIV/00216224:14330/21:00123275 Proceedings paper. English.
      Ha, Hien Thi (704 Viet Nam, belonging to the institution) -- Horák, Aleš (203 Czech Republic, belonging to the institution)
      Keywords in English: OCR; Invoice; Block type classification; Seller; Buyer; Delivery address
      International impact: yes
      Reviewed: yes

      Changed by: doc. RNDr. Aleš Horák, Ph.D., učo 1648. Changed: 10/10/2022 10:26.

    2019

    1. HA, Hien Thi. Approximate String Matching for Detecting Keywords in Scanned Business Documents. Online. In Ales Horak, Pavel Rychly, Adam Rambousek. Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2019. Brno, Czech Republic: NLP Consulting, 2019, p. 49-54. ISBN 978-80-263-1530-8.
      URL
      RIV/00216224:14330/19:00113733 Proceedings paper. English. Czech Republic.
      Ha, Hien Thi (704 Viet Nam, guarantor, belonging to the institution)
      Keywords in English: approximate string matching; Levenshtein distance; weighted edit distance; OCR; invoice
      Type of proceedings: pre-proceedings
      International impact: yes
      Reviewed: yes

      Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 15/5/2024 01:32.

    2018

    1. BUI, M. Tuan, R. DOSKOCIL, V. KRIVANEK, Hien Thi HA, Y. BERGEON and P. KUTILEK. Indirect Method Usage of Distance and Error Measurement by Single Optical Cameras. Advances in Military Technology. University of Defence, 2018, vol. 13, No 2, p. 209-221. ISSN 1802-2308. Available from: https://dx.doi.org/10.3849/aimt.01221.
      URL
      RIV/00216224:14330/18:00111380 Article in a journal. English. Czech Republic.
      Bui, M. Tuan (704 Viet Nam) -- Doskocil, R. (203 Czech Republic) -- Krivanek, V. (203 Czech Republic) -- Ha, Hien Thi (704 Viet Nam, belonging to the institution)
      Keywords in English: distance measurement; indirect method; measurement error; single optical camera; un-certainty

      Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 3/5/2020 13:27.
    2. OCRMiner (software)
      HA, Hien Thi, Aleš HORÁK, Marek MEDVEĎ and Zuzana NEVĚŘILOVÁ. OCRMiner. 2018.
      URL
      Name in Czech: OCRMiner
      Name (in English): OCRMiner
      RIV/00216224:14330/18:00101859 Software. English. Czech Republic.
      Ha, Hien Thi (704 Viet Nam, belonging to the institution) -- Horák, Aleš (203 Czech Republic, belonging to the institution) -- Medveď, Marek (703 Slovakia, guarantor, belonging to the institution) -- Nevěřilová, Zuzana (203 Czech Republic, belonging to the institution)
      Keywords in English: data mining; information extraction; text classification; OCR

      Changed by: doc. RNDr. Aleš Horák, Ph.D., učo 1648. Changed: 2/4/2019 13:34.
    3. HA, Hien Thi, Aleš HORÁK, Marek MEDVEĎ and Zuzana NEVĚŘILOVÁ. Recognition of OCR Invoice Metadata Block Types. In P. Sojka, A. Horák, I. Kopeček, K. Pala. Text, Speech, and Dialogue, 21st International Conference, TSD 2018. Switzerland: Springer International Publishing, 2018, p. 304-312. ISBN 978-3-030-00793-5. Available from: https://dx.doi.org/10.1007/978-3-030-00794-2_33.
      Name (in English): Recognition of OCR Invoice Metadata Block Types
      RIV/00216224:14330/18:00103049 Proceedings paper. English. Switzerland.
      Ha, Hien Thi (704 Viet Nam, belonging to the institution) -- Horák, Aleš (203 Czech Republic, guarantor, belonging to the institution) -- Medveď, Marek (703 Slovakia, belonging to the institution) -- Nevěřilová, Zuzana (203 Czech Republic, belonging to the institution)
      Keywords in English: OCR;scanned documents;document metadata;invoice metadata extraction
      Type of proceedings: pre-proceedings
      International impact: yes
      Reviewed: yes

      Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 30/4/2019 07:42.

    2017

    1. BUI, Minh Tuan, Radek DOSKOČIL, Vaclav KRIVANEK, Hien Thi HA, Yves T BERGEON and Patrik KUTILEK. Indirect method to estimate distance measurement based on single visual cameras. Online. In Vaclav Krivanek. DOSKOČIL, Radek, Vaclav KRIVANEK, Hien Thi HA, YvesT. BERGEON and Kutilek PATRIK. 2017 International Conference on Military Technologies (ICMT). Brno: IEEE, 2017, p. 695-700. ISBN 978-1-5386-1988-9. Available from: https://dx.doi.org/10.1109/MILTECHS.2017.7988846.
      URL
      RIV/00216224:14330/17:00108777 Proceedings paper. Informatics. English. United States of America.
      Bui, Minh Tuan (704 Viet Nam) -- Doskočil, Radek (203 Czech Republic) -- Krivanek, Vaclav (203 Czech Republic) -- Ha, Hien Thi (704 Viet Nam, belonging to the institution)
      Keywords in English: distance measurement; single visual camera; indirect method; uncertainty; measurement error
      Type of proceedings: post-proceedings
      International impact: yes
      Reviewed: yes

      Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 3/5/2020 11:12.
    2. HA, Hien Thi. Recognition of Invoices from Scanned Documents. Online. In Horák A., Rychlý P., Rambousek A. RASLAN 2017 Recent Advances in Slavonic Natural Language Processing. first. Brno, Czech Republic: NLP Consulting, 2017, p. 71-78. ISBN 978-80-263-1340-3.
      URL
      RIV/00216224:14330/17:00099020 Proceedings paper. Informatics. English. Czech Republic.
      Ha, Hien Thi (704 Viet Nam, guarantor, belonging to the institution)
      Keywords in English: classification; recognition; invoice; OCR; Czech
      Type of proceedings: pre-proceedings
      International impact: yes
      Reviewed: yes

      Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 18/5/2018 05:52.
Displayed: 27/5/2024 06:28