'OpenRefine + RDF: handling repeated occurrences
Converting XML to RDF using OpenRefine with the RDF-extension. Here's a sample record in OR.
I've been looking for a way to handle repeated occurrences, in this case two in animals - narrower
. This is what I got to:
@prefix ex: <http://example.com/> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix skos: <http://www.w3.org/2004/02/skos/core#> .
ex:mammals a skos:Concept;
skos:narrower "ex:cats, ex:cows" .
In the RDF node pane, I'm mapping the narrowers to text, and use below GREL to get all the occurrence on one line with the ex:
prefix.
forEach(row.record.cells['animals - narrower'].value,v,'ex:'+v).join(', ')
It's almost what I want. Ideally, there should be no quotes:
ex:mammals a skos:Concept;
skos:narrower ex:cats, ex:cows .
In other words, I want to map to a URI type, but for better legibility, I want to keep the ex:
prefix, rather than the full URI. Also I want to keep them as a repeated objects to the one `skos:narrower' predicate, not a line with predicate + object for each occurrence.
I know how to get rid of the quotes using an editor. I'm just hoping there's a better way than the one I've found.
Also, I know how to spread repeated occurrences over individual new columns and map each new column, but in real life I would have to map over 100 columns to only a handful of target skos fields. That's not what automation is for.
Solution 1:[1]
Meanwhile, RDF-Transform emerged. It has the answer to this question. There is a slight difference in the way it works, exemplified by the highlighted mammals in this screenshot.
One caveat: I am seeing this issue running the extension in Windows 10 with with the JRE. The workaround that I found is to run OpenRefine with RDF-Transform plugin from WSL.
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
Solution | Source |
---|---|
Solution 1 | RolfBly |