Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Representative sample extraction from web data streams

Scriney, Michael orcid logoORCID: 0000-0001-6813-2630, Xing, Congcong, McCarren, Andrew orcid logoORCID: 0000-0002-7297-0984 and Roantree, Mark (2019) Representative sample extraction from web data streams. In: Database and Expert Systems Applications - 30th International Conference, DEXA 2019 {I}, 26-29 Aug 2019, Linz, Austria. ISBN 978-3-030-27614-0

Abstract
Smart or digital city infrastructures facilitate both decision support and strategic planning with applications such as government services, healthcare, transport and traffic management. Generally, each service generates multiple data streams using different data models and structures. Thus, any form of analysis requires some form of extract-transform-load process normally associated with data warehousing to ensure proper cleaning and integration of heterogeneous datasets. In addition, data produced by these systems may be generated at a rate which cannot be captured completely using standard computing resources. In this paper, we present an ETL system for transport data coupled with a smart data acquisition methodology to extract a subset of data suitable for analysis.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:Data Warehousing; Data Mining; Data Analytics; ETL; Web Data
Subjects:UNSPECIFIED
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Institutes and Centres > INSIGHT Centre for Data Analytics
Published in: Database and Expert Systems Applications - 30th International Conference, DEXA 2019, Proceedings, Part I. Lecture Notes in Computer Science 11706. Springer. ISBN 978-3-030-27614-0
Publisher:Springer
Official URL:https://doi.org/10.1007/978-3-030-27615-7_26
Copyright Information:© 2019 Springer
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland (SFI) and the Department of Agriculture, Food and Marine on behalf of the Government of Ireland under Grant Number 16/RC/3835
ID Code:23659
Deposited On:23 Aug 2019 09:22 by Michael Scriney . Last Modified 23 Aug 2019 09:22
Documents

Full text available as:

[thumbnail of _Michael__Dawak2019 (2).pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
492kB
Metrics

Altmetric Badge

Dimensions Badge

Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record