Representative sample extraction from web data streams
Scriney, MichaelORCID: 0000-0001-6813-2630, Xing, Congcong, McCarren, AndrewORCID: 0000-0002-7297-0984 and Roantree, Mark
(2019)
Representative sample extraction from web data streams.
In: Database and Expert Systems Applications - 30th International Conference, DEXA 2019 {I}, 26-29 Aug 2019, Linz, Austria.
ISBN 978-3-030-27614-0
Smart or digital city infrastructures facilitate both decision support and strategic planning with applications such as government services, healthcare, transport and traffic management. Generally, each service generates multiple data streams using different data models and structures. Thus, any form of analysis requires some form of extract-transform-load process normally associated with data warehousing to ensure proper cleaning and integration of heterogeneous datasets. In addition, data produced by these systems may be generated at a rate which cannot be captured completely using standard computing resources. In this paper, we present an ETL system for transport data coupled with a smart data acquisition methodology to extract a subset of data suitable for analysis.
Metadata
Item Type:
Conference or Workshop Item (Paper)
Event Type:
Conference
Refereed:
Yes
Uncontrolled Keywords:
Data Warehousing; Data Mining; Data Analytics; ETL; Web Data
Database and Expert Systems Applications - 30th International Conference, DEXA 2019, Proceedings, Part I. Lecture Notes in Computer Science
11706.
Springer. ISBN 978-3-030-27614-0
This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:
Science Foundation Ireland (SFI) and the Department of Agriculture, Food and Marine on behalf of the Government of Ireland under Grant Number 16/RC/3835
ID Code:
23659
Deposited On:
23 Aug 2019 09:22 by
Michael Scriney
. Last Modified 23 Aug 2019 09:22