Date of Award

Winter 12-2017

Document Type

Thesis

Degree Name

Bachelor Degree

Department

Computer Science

First Advisor

Bertan Karahoda

Language

English

Abstract

We live in the golden area of information. The World Wide web contains a vast amount of unstructured text in different digital formats, including newswire, blogs, email communications, governmental documents, chat logs, and so on. Some of the biggest companies and organizations have created knowledge bases which represent a semantic network of facts, entities and relations between them. Even though this area has been well researched for a considerable time, there is a lack of implementation of such a knowledge extraction for Albanian language. In this thesis we will try to create an Albanian general knowledge graph from unstructured text. The existing state of the art proposals for relations extraction in other languages will be reviewed and used. We will present the process of creating a knowledge base using some of natural language processing techniques, graph modeling, storing and retrieving. Finally, we will discuss important potential applications of such a knowledge base in industry and academia.

DOI

10.33107/ubt-etd.2017.1570

Share

COinS