Skip to main content
DORAS
DCU Online Research Access Service
Login (DCU Staff Only)
Representative sample extraction from web data streams

Scriney, Michael ORCID: 0000-0001-6813-2630, Xing, Congcong, McCarren, Andrew ORCID: 0000-0002-7297-0984 and Roantree, Mark (2019) Representative sample extraction from web data streams. In: Database and Expert Systems Applications - 30th International Conference, DEXA 2019 {I}, 26-29 Aug 2019, Linz, Austria. ISBN 978-3-030-27614-0

Full text available as:

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
492kB

Abstract

Smart or digital city infrastructures facilitate both decision support and strategic planning with applications such as government services, healthcare, transport and traffic management. Generally, each service generates multiple data streams using different data models and structures. Thus, any form of analysis requires some form of extract-transform-load process normally associated with data warehousing to ensure proper cleaning and integration of heterogeneous datasets. In addition, data produced by these systems may be generated at a rate which cannot be captured completely using standard computing resources. In this paper, we present an ETL system for transport data coupled with a smart data acquisition methodology to extract a subset of data suitable for analysis.

Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:Data Warehousing; Data Mining; Data Analytics; ETL; Web Data
Subjects:UNSPECIFIED
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Initiatives and Centres > INSIGHT Centre for Data Analytics
Published in: Database and Expert Systems Applications - 30th International Conference, DEXA 2019, Proceedings, Part I. Lecture Notes in Computer Science 11706. Springer. ISBN 978-3-030-27614-0
Publisher:Springer
Official URL:https://doi.org/10.1007/978-3-030-27615-7_26
Copyright Information:© 2019 Springer
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland (SFI) and the Department of Agriculture, Food and Marine on behalf of the Government of Ireland under Grant Number 16/RC/3835
ID Code:23659
Deposited On:23 Aug 2019 09:22 by Michael Scriney . Last Modified 23 Aug 2019 09:22

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

Altmetric
- Altmetric
+ Altmetric
  • Student Email
  • Staff Email
  • Student Apps
  • Staff Apps
  • Loop
  • Disclaimer
  • Privacy
  • Contact Us