-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathintegrity_report.txt
More file actions
120 lines (99 loc) · 5.96 KB
/
Copy pathintegrity_report.txt
File metadata and controls
120 lines (99 loc) · 5.96 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
Integrity report for jacob_olie_sources.xlsx
Sheet: Sources rows: 3657 cols: 18
==============================================================================
Data rows: 3656
--- Outcome distribution (URL replacement succes) ---
TRUE 3640
FALSE 16
--- Anomaly: TRUE rows with no Beta URL: 0 ---
--- FALSE rows: 16 ---
no_beta_url 16
--- Anomaly: rows with EMPTY result cell: 0 ---
--- Anomaly: UUID mismatch / unparseable between Detail URL columns: 0 ---
--- Anomaly: duplicate Filename values: 0 ---
--- Duplicate Source URLs (same Beeldbank URL on multiple rows): 94 ---
x11: http://stadsarchief.amsterdam.nl/archief/10019
x6: http://beeldbank.amsterdam.nl/afbeelding/10019A000790
x3: http://beeldbank.amsterdam.nl/afbeelding/10019A001037
x3: http://beeldbank.amsterdam.nl/afbeelding/10019A000723
x3: http://beeldbank.amsterdam.nl/afbeelding/010019001743
x3: http://beeldbank.amsterdam.nl/afbeelding/10019A001190
x3: http://www.waalseiland.nl/Kromme%20Waal/index.htm
x2: http://beeldbank.amsterdam.nl/afbeelding/010019000110
x2: http://beeldbank.amsterdam.nl/afbeelding/010019000069
x2: http://beeldbank.amsterdam.nl/afbeelding/010019000131
... and 84 more
--- Anomaly: duplicate Beta URLs (same Memorix record from multiple rows): 103 ---
x6: https://beta.archief.amsterdam/detail/b4cb7738-ea59-d722-0a19-ec87c8021a5d
x3: https://beta.archief.amsterdam/detail/cb77a0a5-c7ab-9aa2-1d2a-f060f68b2b93
x3: https://beta.archief.amsterdam/detail/173c974d-cbd4-ed1c-2d8e-c0f31d84dd3c
x3: https://beta.archief.amsterdam/detail/2c4ef826-858e-af60-1fb4-b47c7c0669a8
x3: https://beta.archief.amsterdam/detail/53fed917-1337-92d6-7600-d50e14f79e39
x3: https://beta.archief.amsterdam/detail/73b68bef-9575-15a2-a4dc-eb6cffe5cac4
x3: https://beta.archief.amsterdam/detail/aa6703c3-762d-8c98-e880-61de4f9bdc42
x2: https://beta.archief.amsterdam/detail/dd91a7ed-533b-e4b3-7223-c7f32cf931f5
x2: https://beta.archief.amsterdam/detail/e3b04f4f-20e7-51e9-e23a-4b8b22dd704e
x2: https://beta.archief.amsterdam/detail/01aa2ccf-d63b-7ef3-cd4b-67325d4032fe
--- Anomaly: rows with 'ERROR ...' in a metadata cell: 0 ---
--- Anomaly: TRUE rows with ZERO metadata fields populated: 0 ---
--- Anomaly: TRUE rows missing one or more required metadata fields: 75 ---
row 17: 'File:Amstel, foto 8 Jacob Olie (max res).jpg' missing=['Datering']
row 93: 'File:Onbekend (max res).jpg' missing=['Datering']
row 95: 'File:Noordzeekanaal, foto 8 Jacob Olie (max res).jpg' missing=['Datering']
row 125: 'File:Stadhouderskade 42 (max res).jpg' missing=['Datering']
row 126: 'File:Gemetselde wrongtrap in de Franekertoren in het nieuwe Fragmentengebouw.jpg' missing=['Datering']
row 251: 'File:Zandhoek, links het Wersterdok, bezoek Koningin Wilhelmina, 28 september 1904, foto 4 Jacob Olie (max res).jpg' missing=['Datering']
row 259: 'File:Paardrijden op de heide, Engels plaatje (max res).jpg' missing=['Datering']
row 262: 'File:Rokin 1-3B (oude nummers, rechts, vlnr) Jacob Olie (max res).jpg' missing=['Datering']
row 319: 'File:Laren (NH), foto 7 Jacob Olie (max res).jpg' missing=['Datering']
row 386: 'File:Dam-noordzijde, Afbraak Koopmansbeurs Jacob Olie (max res).jpg' missing=['Datering']
row 538: 'File:Onbekend foto 1 (max res).jpg' missing=['Datering']
row 581: 'File:IJ (Afgesloten), foto 72 Jacob Olie (max res).jpg' missing=['Datering']
row 582: 'File:IJ (Afgesloten), foto 74 Jacob Olie (max res).jpg' missing=['Datering']
row 602: 'File:Open Havenfront, foto 14 Jacob Olie (max res).jpg' missing=['Datering']
row 603: 'File:Open Havenfront, foto 15 Jacob Olie (max res).jpg' missing=['Datering']
row 629: 'File:Jacob Olie - Nieuwmarkt Amsterdam september 1902.jpeg' missing=['Datering']
row 670: 'File:IJ (Afgesloten), foto 73 Jacob Olie (max res).jpg' missing=['Datering']
row 694: 'File:Linnaeusstraat, foto 2 Jacob Olie (max res).jpg' missing=['Datering']
row 724: 'File:Abcoude, foto 76 Jacob Olie (max res).jpg' missing=['Datering']
row 877: 'File:Huidekoperstraat 22 (max res).jpg' missing=['Datering']
... and 55 more
--- Metadata coverage (out of 11 fields) ---
0 fields: 1
1 fields: 15
2 fields: 0
3 fields: 0
4 fields: 0
5 fields: 0
6 fields: 0
7 fields: 23
8 fields: 215
9 fields: 1023
10 fields: 2293
11 fields: 86
--- Per-field empty cells (out of 3656 data rows) ---
Gebouw (sk_gebouw) 3566
Geografische aanduiding (geografische_aanduiding) 1139
Beschrijving (dc_description) 342
Datering (dc_date) 91
Inventarissen (dc_source) 18
Titel (dc_title) 16
Documenttype (sk_documenttype) 16
Vervaardiger (sk_vervaardiger) 16
Collectie (dc_provenance) 16
Afbeeldingsbestand (identifier) 16
Rechthebbende (sr_rechthebbende) 1
--- Source URL value distribution (anomalies only; URLs not counted) ---
empty 13
non-URL value 9 (e.g. row 3601)
--- Anomaly: File URL (Commons) inconsistent with Filename: 0 ---
--- Anomaly: cells with leading/trailing whitespace (first 30): 0 ---
--- Anomaly: cells with control / non-printable characters (first 30): 0 ---
--- Anomaly: rows with EMPTY Filename: 0 ---
--- URL format validation per column ---
Beta Archief Amsterdam Detail URL bad: 0 (expected host=beta.archief.amsterdam)
Archief Amsterdam Detail URL bad: 0 (expected host=archief.amsterdam)
File URL (Commons) bad: 0 (expected host=commons.wikimedia.org)
Inventarissen (dc_source) bad: 0 (expected host=archief.amsterdam)
==============================================================================
End of report.