Viewing Resolve Project Confidence
There are two important measures related to Golden Records. One is the Model Confidence, and the other is Data Quality. Let us look at Confidence first. The Confidence is shown separately for Entities Resolved and Entities Mastered. The Model Confidence in Resolve, and typically across the product is split into High, Medium and Low confidence records, which together give the combined confidence figure.
The Confidence for Entities Resolved is a measure of how confident the system feels about the grouping of individual records i.e. clusters generated. Similarly, the Confidence of Entities Mastered is a measure of how confident the system feels that the most representative attributes were picked in the clusters to generate ‘Golden records’.
The Compression Ratio is a measure of how many distinct records it could derive from the multiple Data Sets which in turn is an indication of the level of duplication and scattered nature of legacy data of the system. For example, a Compression Ratio of 1.6:1 means that on average there was only one distinct logical record for every 1.6 records concerning the entity in the project.