Open Access Journal

ISSN : 2394 - 6849 (Online)

International Journal of Engineering Research in Electronics and Communication Engineering(IJERECE)

Monthly Journal for Electronics and Communication Engineering

Open Access Journal

International Journal of Engineering Research in Electronics and Communication Engineering(IJERECE)

Monthly Journal for Electronics and Communication Engineering

ISSN : 2394-6849 (Online)

Unstructured Text to DBpedia RDF Triples – Entity Extraction

Author : Monika S G 1 Chiranjeevi S 2 Harshitha A 3 Harshitha M 4 V K Tivari 5 Raghevendra Rao 6

Date of Publication :30th November 2017

Abstract: In the means of current technologies Use of data, information has grown significantly over the last few years. The information processing facing an issue like where data is originating from multiple sources in an uncontrolled environment. The reason for the uncontrolled environment is the data gathered beyond the organization and generated by many people working outside the organization. The intent of this paper is delving into this unformatted information and build the framework in such a way that the information becomes more managed and used for the organization. Case and point for resume submitted for particular positions should become searchable. In this framework, we try and solve the problem and provide suggestions on how to solve other similar problem. In this paper, we describe an end-to-end system that automatically extracts RDF triples describing entity relations and properties from unstructured text. This system is based on a pipeline of text processing modules that includes an asemantic parser and a co-reference solver. By using co-reference chains, we group entity actions and properties described in different sentences and convert them into entity triples. We applied our system to over 114,000 Wikipedia articles and we could extract more than 1,000,000 triples. Using an ontology-mapping system that we bootstrapped using existing DBpedia triples, we mapped 189,000extracted triples onto the DBpedia namespace. These extracted entities are available online in the N-Triple format.

Reference :

Will Updated soon

Recent Article