Web-Platform developed from data obtained from KBRA(ARBK), based on business development depending on location
Session
Information Systems and Security
Description
Nowadays the term Big Data is very important as well as the visualization of this data. The data can be processed and therefore can be easily maneuvered to be modified and we can generate results through raw data. Raw data always contains different types of problems such as duplicate or duplicate data, deficient data that lacks specifics, which may display a weak pattern, and therefore the best techniques are used and reasonable enough to display satisfactory results through applications. Therefore, it is essential that the data be thoroughly cleaned before proceeding with the analysis step. The process that cleans a database depending on potential problems in the world of technology and programming is called data cleaning. Unfortunately, data cleansing is inevitable and primary, it is also a task that takes a lot of time to find the most appropriate way to minimize problems, this is due to the lack of information in the data that I have and have encountered in the data set of the Business Registration Agency in Kosovo.
In this work, a programming language that has been quite suitable for me is Python as a scripting programming language for developing data analysis in order to meet the requirements that fit the title of my work. The tool used through this language is able to identify potential data issues and report such results and recommendations so that users can clean the data smoothly and effectively with its help. Compared to existing data cleaning tools, this tool has been specially created to address tasks in the working machine and can find the optimal cleaning approach according to the data characteristics.
My paper aims to show a result in the use of tools for visualizing information, increase the readability and usability of data and finally how feasible it is to present results to unstructured data. The database analysis focuses on data maintained by the Kosovo Open Business Registration Agency (KBRA).
The results achieved are worked with a lot of dedication, and are results which contain detailed information so that the user has the opportunity to maneuver within the opportunities offered. At first glance the generated results serve as open information and not so detailed but through the options provided by the created site the user can have different display options to see in more detail about the information that the site has about the data on businesses in Kosovo. The analysis of this data was done through Geodata - data spread on a map of Kosovo where based on the filters are displayed results which are quite easy to read and offers maneuvering in the filters placed next to the attributes possessed by the dataset.
Keywords:
Data visualization, data cleaning, generating results based on requirements
Proceedings Editor
Edmond Hajrizi
ISBN
978-9951-550-47-5
First Page
1
Location
UBT Kampus,Lipjan
Start Date
30-10-2021 12:00 AM
End Date
30-10-2021 12:00 AM
DOI
10.33107/ubt-ic.2021.100
Recommended Citation
Abdullahu, Arta and Metin, Hasan, "Web-Platform developed from data obtained from KBRA(ARBK), based on business development depending on location" (2021). UBT International Conference. 29.
https://knowledgecenter.ubt-uni.net/conference/2021UBTIC/all-events/29
Web-Platform developed from data obtained from KBRA(ARBK), based on business development depending on location
UBT Kampus,Lipjan
Nowadays the term Big Data is very important as well as the visualization of this data. The data can be processed and therefore can be easily maneuvered to be modified and we can generate results through raw data. Raw data always contains different types of problems such as duplicate or duplicate data, deficient data that lacks specifics, which may display a weak pattern, and therefore the best techniques are used and reasonable enough to display satisfactory results through applications. Therefore, it is essential that the data be thoroughly cleaned before proceeding with the analysis step. The process that cleans a database depending on potential problems in the world of technology and programming is called data cleaning. Unfortunately, data cleansing is inevitable and primary, it is also a task that takes a lot of time to find the most appropriate way to minimize problems, this is due to the lack of information in the data that I have and have encountered in the data set of the Business Registration Agency in Kosovo.
In this work, a programming language that has been quite suitable for me is Python as a scripting programming language for developing data analysis in order to meet the requirements that fit the title of my work. The tool used through this language is able to identify potential data issues and report such results and recommendations so that users can clean the data smoothly and effectively with its help. Compared to existing data cleaning tools, this tool has been specially created to address tasks in the working machine and can find the optimal cleaning approach according to the data characteristics.
My paper aims to show a result in the use of tools for visualizing information, increase the readability and usability of data and finally how feasible it is to present results to unstructured data. The database analysis focuses on data maintained by the Kosovo Open Business Registration Agency (KBRA).
The results achieved are worked with a lot of dedication, and are results which contain detailed information so that the user has the opportunity to maneuver within the opportunities offered. At first glance the generated results serve as open information and not so detailed but through the options provided by the created site the user can have different display options to see in more detail about the information that the site has about the data on businesses in Kosovo. The analysis of this data was done through Geodata - data spread on a map of Kosovo where based on the filters are displayed results which are quite easy to read and offers maneuvering in the filters placed next to the attributes possessed by the dataset.