'DIKW Model Applied on PDF Files

I did some research on the DIKW pyramid (data, information, knowledge, wisdom). And am confused on how to apply it on a PDF file. For example, if I extract specific sentences from a PDF, am I extracting data or information? Does it matter if I label these sentences? For example if I label the extracted sentences as containing user review they are information and when they're just left with no label (context) they are data?

The question is mainly for a terminology issue and how to classify down different elements (words, sentences, letters, images, titles...) of a PDF into the relevant layer in the DIKW model. If you know a better information systems model please suggest it too.



Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source