Discovering duplicates and merging
Discovering duplicates
Descriptions in the database should be unique. Each person, publication or institution should be described only once. If we see different variants of the same description in the database, then they should be merged into one record. The merging procedure allows one to select from the duplicates the fields that should appera in the target unique record.
Causes of duplicates may include:
automatic downloading of data from other systems (like Scopus)
manual importing data from other systems
typos in the description
entering data by authors (autofilling)
other data recording
mistakes by editors entering descriptions.
One can easily idetify diplicates among objects of a given type. Having the identified one can indicate duplicates and then merge them. To do so, you can proceed as foolow:
on the editor panel select the type of objects you want to discover duplicates;
perform search of objects (preferably covering the whole set of the objects. In order to make the process more efficient you can select a subset, e.g. for the publications choose a range of publication dates
Select all the retrieved object (preferably) or a limited part
from the 3-dots menu select the option
On the window as below confirm
detect duplicates
On the screen as below identify the duplicates, check them and use the option
Merge Selected Records
Select the target record (as below) - it should be the one that loos more complete
Use the option
Interactive merge
On the left column you see the
Field names
, the middle column containsTarget values
, the right column shows the option values from other dulicate record(s).the triangle sign on the left column preceeding the field name means that the name is composed. it is strongly recomended that before transferring the fields from duplicate to target you “open” the composed fields, so that you can see all the components inthe field
the sign
x
at the value in the middle column can be used to remove the value from the targetThe sign
v
at the value in the middle column can be used to open the alternative fields to move the value to another fieldThe sign + at the value in the right column means that you can add it to the target record to the field as it is in the duplicate
The sign
v
at the value in the righ column means that before transferring to the target you can rename it (send to an alternative field)with the above functions you can build the target record
having completed building the target you can
Apply
Reset
orCancel
If you apply your specifications for the target (using Apply
) then on the screen as in p. 6 use the option merge selected records (also in history) is to merge the records.