NOVÁK, David, Michal BATKO a Pavel ZEZULA. Large-scale similarity data management with distributed Metric Index. Information Processing and Management. ELSEVIER, 2012, roč. 48, č. 5, s. 855-872. ISSN 0306-4573. Dostupné z: https://dx.doi.org/10.1016/j.ipm.2010.12.004. |
Další formáty:
BibTeX
LaTeX
RIS
@article{988255, author = {Novák, David and Batko, Michal and Zezula, Pavel}, article_number = {5}, doi = {http://dx.doi.org/10.1016/j.ipm.2010.12.004}, keywords = {Distributed data structures; Performance tuning; Similarity search; Scalability; Peer-to-peer structured networks; Metric space}, language = {eng}, issn = {0306-4573}, journal = {Information Processing and Management}, title = {Large-scale similarity data management with distributed Metric Index}, volume = {48}, year = {2012} }
TY - JOUR ID - 988255 AU - Novák, David - Batko, Michal - Zezula, Pavel PY - 2012 TI - Large-scale similarity data management with distributed Metric Index JF - Information Processing and Management VL - 48 IS - 5 SP - 855-872 EP - 855-872 PB - ELSEVIER SN - 03064573 KW - Distributed data structures KW - Performance tuning KW - Similarity search KW - Scalability KW - Peer-to-peer structured networks KW - Metric space N2 - Metric space is a universal and versatile model of similarity that can be applied in various areas of non-text information retrieval. However, a general, efficient and scalable solution for metric data management is still a resisting research challenge. In this work, we try to make an important step towards such management system that would be able to scale to data collections of billions of objects. We propose a distributed index structure for similarity data management called the Metric Index (M-Index) which can answer queries in precise and approximate manner. This technique can take advantage of any distributed hash table that supports interval queries and utilize it as an underlying index. We have performed numerous experiments to test various settings of the M-Index structure and we have proved its usability by developing a full-featured publicly-available Web application. ER -
NOVÁK, David, Michal BATKO a Pavel ZEZULA. Large-scale similarity data management with distributed Metric Index. \textit{Information Processing and Management}. ELSEVIER, 2012, roč.~48, č.~5, s.~855-872. ISSN~0306-4573. Dostupné z: https://dx.doi.org/10.1016/j.ipm.2010.12.004.
|