'DIKW Model Applied on PDF Files
I did some research on the DIKW pyramid (data, information, knowledge, wisdom). And am confused on how to apply it on a PDF file. For example, if I extract specific sentences from a PDF, am I extracting data or information? Does it matter if I label these sentences? For example if I label the extracted sentences as containing user review they are information and when they're just left with no label (context) they are data?
The question is mainly for a terminology issue and how to classify down different elements (words, sentences, letters, images, titles...) of a PDF into the relevant layer in the DIKW model. If you know a better information systems model please suggest it too.
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
Solution | Source |
---|