conference presentation
MarcXimiL : near duplicates detection (and similarity analysis)
2011
MarcXimiL is an open source tool which works on MARCXML records and calculates similarity indices between these records. After a short theoretical introduction, the tutorial will focus on how to install, parametrize and use the tool. This tool can be implemented in order to : * prevent creation of duplicates (similar records are shown during the validation process) * identify duplicates into batch files before ingest * find duplicates inside a collection * suggest to users similar records to the one found after a request * match related documents eg. preprints and articles * and so on.
Type
conference presentation
Author(s)
Krause, Jan
Date Issued
2011
Note
Tutorial session
Written at
EPFL
EPFL units
| Event name | Event place | Event date |
Geneva | June 22-24, 2011 | |
Available on Infoscience
June 26, 2011
Use this identifier to reference this record