'Extracting education from Text data
I have bunch of text data describing people's eduation. I have already done some basic NLP processing to those text data.
An example would be this : XXX received his MBA degree from ABC University and MSc degree from the University of XYZ.
I have already done some job that recongnize those university entities.
I would want to further refine the result to link their degree to university. Use the above result , i would like to have : ("MBA degree" , "ABC University") , ("MSc degree" , "University of XYZ").
One things i noticed would help is i observe that the "degree" keyword is rather specific , It mostly consist of : Phd , BA , B.S. , master , doctorate.
Is there any algorithm that can do the job ? I know there is a NLP are called relation extraction , but i have the task i wish to done is rather specific and really don't need a such complicated and resource-expensive technique.
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
Solution | Source |
---|